cs.LG(2024-09-02)

📊 共 7 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (5) 支柱九:具身大模型 (Embodied Foundation Models) (2 🔗2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
1 Imitating Language via Scalable Inverse Reinforcement Learning 提出基于逆强化学习的语言模型微调方法,提升生成质量和多样性。 reinforcement learning imitation learning inverse reinforcement learning
2 Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data 对比LLM与传统机器学习在COVID-19死亡率预测中的性能,发现传统方法更优 predictive model large language model
3 Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization 提出PPO-DAP,通过扩散模型提升PPO在连续控制任务中的样本效率和探索能力 reinforcement learning PPO
4 Revisiting Safe Exploration in Safe Reinforcement learning 提出基于EMCC的SafeRL方法,解决传统方法在安全探索中的风险问题 reinforcement learning
5 Real-Time Recurrent Learning using Trace Units in Reinforcement Learning 提出基于Trace Units的实时循环学习方法,提升强化学习在部分可观测环境中的性能。 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
6 Efficient and Scalable Estimation of Tool Representations in Vector Space 提出高效可扩展的工具表示估计方法以解决LLM工具检索问题 large language model
7 ToolACE: Winning the Points of LLM Function Calling ToolACE:通过自进化Agent流程生成高质量函数调用数据,显著提升LLM工具使用能力 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页