cs.LG(2024-12-05)

📊 共 21 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (9 🔗2) 支柱九:具身大模型 (Embodied Foundation Models) (8) 支柱一:机器人控制 (Robot Control) (3) 支柱八:物理动画 (Physics-based Animation) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (9 篇)

#题目一句话要点标签🔗
1 Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach 提出深度强化学习方法以优化低空经济中的集成感知与通信系统 reinforcement learning deep reinforcement learning DRL
2 Hierarchical Multi-Agent DRL Based Dynamic Cluster Reconfiguration for UAV Mobility Management 提出一种基于分层多智能体DRL的无人机动态集群重配置方法,用于优化移动性管理。 reinforcement learning deep reinforcement learning DRL
3 Action Mapping for Reinforcement Learning in Continuous Environments with Constraints 提出基于动作映射的强化学习方法,提升约束连续动作空间环境下的训练效率。 reinforcement learning deep reinforcement learning DRL
4 Multi-Preference Optimization: Generalizing DPO via Set-Level Contrasts 提出多偏好优化方法以解决直接偏好优化的局限性 DPO direct preference optimization
5 Closed-Loop Supervised Fine-Tuning of Tokenized Traffic Models 提出CAT-K闭环微调策略,提升Token化交通模型在交通仿真中的性能 reinforcement learning behavior cloning large language model
6 Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy 提出Marvel框架以加速安全在线强化学习 reinforcement learning policy learning
7 Disentangled Representation Learning for Causal Inference with Instruments 提出基于解耦表示学习的工具变量因果推断方法,解决潜在混淆变量问题。 representation learning
8 BEFL: Balancing Energy Consumption in Federated Learning for Mobile Edge IoT 提出BEFL框架以解决移动边缘物联网中的能耗不平衡问题 reinforcement learning imitation learning
9 ELEMENT: Episodic and Lifelong Exploration via Maximum Entropy 提出ELEMENT框架,通过最大熵探索实现高效的持续终身强化学习 reinforcement learning offline reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)

#题目一句话要点标签🔗
10 Revisiting Federated Fine-Tuning: A Single Communication Round is Enough for Foundation Models 针对大模型联邦微调,提出单轮通信即可达到多轮通信性能的方法 foundation model
11 BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks BigDocs: 开放的文档和代码多模态模型训练数据集与基准测试 multimodal
12 A large language model-type architecture for high-dimensional molecular potential energy surfaces 提出基于图神经网络的大语言模型架构,用于高维分子势能面预测。 large language model
13 SceneDiffuser: Efficient and Controllable Driving Simulation Initialization and Rollout SceneDiffuser:高效可控的自动驾驶仿真初始化与推演 large language model multimodal
14 Improving LLM Group Fairness on Tabular Data via In-Context Learning 通过上下文学习提升LLM在表格数据上的群体公平性 large language model chain-of-thought
15 Learning Symmetry-Independent Jet Representations via Jet-Based Joint Embedding Predictive Architecture 提出基于Jet的联合嵌入预测架构(J-JEPA),学习对称无关的Jet表征,用于高能物理中的Jet分析。 foundation model
16 WinTSR: A Windowed Temporal Saliency Rescaling Method for Interpreting Time Series Deep Learning Models 提出WinTSR,通过窗口化时间显著性重缩放解释时间序列深度学习模型 foundation model
17 SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization SKIM:提出任意比特量化方法,突破后训练量化的性能极限 large language model

🔬 支柱一:机器人控制 (Robot Control) (3 篇)

#题目一句话要点标签🔗
18 Mind the Gap: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning 提出GAP框架,通过域随机化和元强化学习实现通用自主渗透测试 sim-to-real domain randomization reinforcement learning
19 Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting 提出基于自回归特征和优势加权的精细化行为基础模型,提升零样本泛化能力。 humanoid locomotion reinforcement learning
20 GRAM: Generalization in Deep RL with a Robust Adaptation Module 提出GRAM,通过鲁棒适应模块提升深度强化学习在复杂环境下的泛化能力 quadruped locomotion reinforcement learning

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
21 Samudra: An AI Global Ocean Emulator for Climate Samudra:构建基于AI的全球海洋模拟器,用于气候研究。 spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页