cs.AI(2024-10-19)
📊 共 16 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets | 提出GDPO:利用GFlowNets学习直接对齐语言模型并提升多样性 | reinforcement learning RLHF DPO | ||
| 13 | Cooperation and Fairness in Multi-Agent Reinforcement Learning | 提出基于Min-Max公平目标分配的MARL方法,提升多智能体导航的公平性和效率。 | reinforcement learning | ||
| 14 | Augmented Lagrangian-Based Safe Reinforcement Learning Approach for Distribution System Volt/VAR Control | 提出基于增广拉格朗日的安全强化学习方法,解决配电系统电压/无功控制问题 | reinforcement learning | ||
| 15 | A Novel Reinforcement Learning Model for Post-Incident Malware Investigations | 提出一种新型强化学习模型,用于优化恶意软件事件后的调查取证。 | reinforcement learning | ||
| 16 | Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS | 提出基于知识蒸馏和原生TTS合成数据的口音转换与发音改进方法 | distillation |