cs.LG(2025-12-22)
📊 共 8 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | R-GenIMA: Integrating Neuroimaging and Genetics with Interpretable Multimodal AI for Alzheimer's Disease Progression | R-GenIMA:结合神经影像与遗传信息的Alzheimer病进展可解释多模态AI模型 | large language model multimodal | ||
| 2 | When Less is More: 8-bit Quantization Improves Continual Learning in Large Language Models | 量化提升大语言模型持续学习能力:INT8优于FP16 | large language model | ✅ | |
| 3 | Learning Hierarchical Procedural Memory for LLM Agents through Bayesian Selection and Contrastive Refinement | MACLA:通过贝叶斯选择和对比精炼学习LLM Agent的分层程序记忆 | large language model | ||
| 4 | Brain-Grounded Axes for Reading and Steering LLM States | 提出基于人脑活动的LLM状态解读与操控方法,实现神经生理学层面可控性。 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 5 | Lag Operator SSMs: A Geometric Framework for Structured State Space Modeling | 提出基于Lag算子的结构化状态空间建模几何框架,简化SSM设计。 | Mamba SSM state space model | ||
| 6 | Scaling Online Distributionally Robust Reinforcement Learning: Sample-Efficient Guarantees with General Function Approximation | 提出在线分布鲁棒强化学习算法,解决训练与部署环境不匹配问题。 | reinforcement learning | ||
| 7 | Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies | 提出Bottom-up Policy Optimization (BuPO),提升LLM在复杂推理任务中的性能 | reinforcement learning large language model | ✅ | |
| 8 | Learning Through Little Eyes: Attribute Discrimination Beyond Objects | 通过婴儿视角学习:超越物体的属性辨别能力研究 | contrastive learning egocentric |