cs.LG(2025-05-10)
📊 共 3 篇论文
🎯 兴趣领域导航
支柱一:机器人控制 (Robot Control) (1)
支柱二:RL算法与架构 (RL & Architecture) (1)
支柱九:具身大模型 (Embodied Foundation Models) (1)
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach | 提出视频增强离线强化学习以解决环境交互不足问题 | manipulation reinforcement learning offline RL |
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 2 | Model Steering: Learning with a Reference Model Improves Generalization Bounds and Scaling Laws | 提出模型引导方法以提升模型泛化能力和扩展性 | contrastive learning foundation model |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Improving Block-Wise LLM Quantization by 4-bit Block-Wise Optimal Float (BOF4): Analysis and Variations | 提出BOF4优化块级量化以降低LLM内存需求 | large language model |