cs.LG(2024-09-06)
📊 共 7 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (3)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | OPAL: Outlier-Preserved Microscaling Quantization Accelerator for Generative Large Language Models | 提出OPAL以解决大语言模型的内存和带宽挑战 | large language model | ||
| 2 | Detecting Buggy Contracts via Smart Testing | SmartSys:基于自决策大模型的智能合约漏洞检测系统 | foundation model | ||
| 3 | CubicML: Automated ML for Large ML Systems Co-design with ML Prediction of Performance | CubicML:利用机器学习自动优化大规模机器学习系统训练性能 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Stacked Universal Successor Feature Approximators for Safety in Reinforcement Learning | 提出堆叠通用后继特征近似器(SUSFAS)以提升强化学习安全性 | reinforcement learning SAC distillation | ||
| 5 | Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimization | 提出基于黎曼优化的GMM Q函数,用于强化学习策略评估。 | reinforcement learning | ||
| 6 | Contrastive Learning-based User Identification with Limited Data on Smart Textiles | 提出基于对比学习的智能纺织品用户识别方法,解决数据量有限难题 | contrastive learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 7 | Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance | 提出基于子目标引导的策略学习方法,解决非专家观测数据下的长时程目标达成问题 | manipulation policy learning |