cs.LG(2025-08-27)

📊 共 29 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (15 🔗2) 支柱二:RL算法与架构 (RL & Architecture) (11 🔗1) 支柱一:机器人控制 (Robot Control) (2) 支柱八:物理动画 (Physics-based Animation) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (15 篇)

#题目一句话要点标签🔗
1 Pre-trained knowledge elevates large language models beyond traditional chemical reaction optimizers 利用预训练知识提升大语言模型在化学反应优化中的表现 large language model
2 ECG-Soup: Harnessing Multi-Layer Synergy for ECG Foundation Models 提出ECG-Soup以提升心电图基础模型的性能 foundation model
3 Cross-Platform E-Commerce Product Categorization and Recategorization: A Multimodal Hierarchical Classification Approach 提出多模态层次分类框架以解决电商产品分类问题 multimodal
4 FinCast: A Foundation Model for Financial Time-Series Forecasting 提出FinCast以解决金融时间序列预测中的复杂性问题 foundation model
5 A Systematic Review on the Generative AI Applications in Human Medical Genomics 系统评估生成性AI在医学基因组学中的应用 large language model multimodal
6 SCAR: A Characterization Scheme for Multi-Modal Dataset 提出SCAR方案以表征多模态数据集特性 foundation model multimodal
7 Robustness is Important: Limitations of LLMs for Data Fitting 揭示LLMs在数据拟合中的脆弱性及其局限性 large language model foundation model
8 The LLM as a Network Operator: A Vision for Generative AI in the 6G Radio Access Network 提出LLM-RAN操作员以解决未来6G无线网络管理复杂性问题 large language model
9 LLM-QUBO: An End-to-End Framework for Automated QUBO Transformation from Natural Language Problem Descriptions 提出LLM-QUBO框架以自动化QUBO转换解决优化问题 large language model
10 Symphony: A Decentralized Multi-Agent Framework for Scalable Collective Intelligence 提出Symphony以解决集中式多代理系统的局限性 large language model
11 Linear-Time Demonstration Selection for In-Context Learning via Gradient Estimation 提出线性时间示例选择算法以优化上下文学习 chain-of-thought
12 CrystalICL: Enabling In-Context Learning for Crystal Generation 提出CrystalICL以解决晶体生成中的少样本学习问题 large language model
13 Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs 提出生成自我精炼方法以提升大语言模型的推理能力 large language model
14 Generative Models for Synthetic Data: Transforming Data Mining in the GenAI Era 提出生成模型以解决数据稀缺和隐私问题 large language model
15 MobText-SISA: Efficient Machine Unlearning for Mobility Logs with Spatio-Temporal and Natural-Language Data 提出MobText-SISA以解决移动日志中的机器遗忘问题 multimodal

🔬 支柱二:RL算法与架构 (RL & Architecture) (11 篇)

#题目一句话要点标签🔗
16 Counterfactual Reward Model Training for Bias Mitigation in Multimodal Reinforcement Learning 提出反事实奖励模型以缓解多模态强化学习中的偏见问题 reinforcement learning RLHF representation learning
17 Data-Efficient Symbolic Regression via Foundation Model Distillation 提出EQUATE框架以解决小数据集下的符号回归问题 distillation foundation model
18 Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning 提出自适应缩放策略约束以解决离线强化学习中的超参数调优问题 reinforcement learning offline RL offline reinforcement learning
19 Dynamics-Aligned Latent Imagination in Contextual World Models for Zero-Shot Generalization 提出DALI以解决零-shot泛化中的环境适应问题 reinforcement learning world model dreamer
20 Encouraging Good Processes Without the Need for Good Answers: Reinforcement Learning for LLM Agent Planning 提出RLTR框架以解决LLM代理规划能力不足问题 reinforcement learning large language model
21 Learning Game-Playing Agents with Generative Code Optimization 提出生成代码优化方法以学习游戏智能体 reinforcement learning deep reinforcement learning large language model
22 The Role of Teacher Calibration in Knowledge Distillation 提出教师模型校准方法以提升知识蒸馏效果 distillation
23 Reinforcement Learning for Search Tree Size Minimization in Constraint Programming: New Results on Scheduling Benchmarks 基于强化学习的约束编程搜索树大小最小化方法 reinforcement learning
24 Interestingness First Classifiers 提出EUREKA框架以构建有趣的分类器 Eureka large language model
25 MicroLad: 2D-to-3D Microstructure Reconstruction and Generation via Latent Diffusion and Score Distillation 提出MicroLad以解决3D微观结构重建问题 distillation
26 PoolFlip: A Multi-Agent Reinforcement Learning Security Environment for Cyber Defense 提出PoolFlip以解决网络防御中的决策自动化问题 reinforcement learning

🔬 支柱一:机器人控制 (Robot Control) (2 篇)

#题目一句话要点标签🔗
27 Multi-Agent Reinforcement Learning in Intelligent Transportation Systems: A Comprehensive Survey 综述多智能体强化学习在智能交通系统中的应用与挑战 sim-to-real reinforcement learning
28 Pruning Strategies for Backdoor Defense in LLMs 提出注意力头剪枝策略以防御大语言模型中的后门攻击 manipulation reinforcement learning

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
29 Experimental End-to-End Optimization of Directly Modulated Laser-based IM/DD Transmission 基于数据驱动模型优化直接调制激光的传输性能 PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页