cs.LG(2024-09-06)

📊 共 7 篇论文

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (3) 支柱二:RL算法与架构 (RL & Architecture) (3) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
1 OPAL: Outlier-Preserved Microscaling Quantization Accelerator for Generative Large Language Models 提出OPAL以解决大语言模型的内存和带宽挑战 large language model
2 Detecting Buggy Contracts via Smart Testing SmartSys:基于自决策大模型的智能合约漏洞检测系统 foundation model
3 CubicML: Automated ML for Large ML Systems Co-design with ML Prediction of Performance CubicML:利用机器学习自动优化大规模机器学习系统训练性能 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
4 Stacked Universal Successor Feature Approximators for Safety in Reinforcement Learning 提出堆叠通用后继特征近似器(SUSFAS)以提升强化学习安全性 reinforcement learning SAC distillation
5 Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimization 提出基于黎曼优化的GMM Q函数,用于强化学习策略评估。 reinforcement learning
6 Contrastive Learning-based User Identification with Limited Data on Smart Textiles 提出基于对比学习的智能纺织品用户识别方法,解决数据量有限难题 contrastive learning

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
7 Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal Guidance 提出基于子目标引导的策略学习方法,解决非专家观测数据下的长时程目标达成问题 manipulation policy learning

⬅️ 返回 cs.LG 首页 · 🏠 返回主页