cs.LG(2025-12-01)

📊 共 7 篇论文

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (4) 支柱二:RL算法与架构 (RL & Architecture) (3)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)

#题目一句话要点标签🔗
1 Do Large Language Models Walk Their Talk? Measuring the Gap Between Implicit Associations, Self-Report, and Behavioral Altruism 评估大型语言模型利他行为:揭示内隐认知、自我报告与实际行为间的差距 large language model
2 RE-LLM: Integrating Large Language Models into Renewable Energy Systems RE-LLM:集成大语言模型到可再生能源系统,提升能源模型可解释性 large language model
3 AlignSAE: Concept-Aligned Sparse Autoencoders 提出AlignSAE,通过概念对齐的稀疏自编码器实现LLM内部知识的可控干预。 large language model
4 Zero-Overhead Introspection for Adaptive Test-Time Compute ZIP-RC:为LLM配备零开销自省能力,实现自适应测试时计算。 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
5 Forecasting in Offline Reinforcement Learning for Non-stationary Environments 提出FORL框架,解决离线强化学习在非平稳环境中因状态偏移导致的性能下降问题。 reinforcement learning offline RL offline reinforcement learning
6 Stabilizing Reinforcement Learning with LLMs: Formulation and Practices 提出基于LLM的强化学习新公式,解决训练不稳定问题并提供稳定训练方案。 reinforcement learning large language model
7 Agentic Policy Optimization via Instruction-Policy Co-Evolution 提出INSPO,通过指令-策略协同进化优化Agentic策略,提升多轮推理能力。 reinforcement learning large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页