cs.AI(2024-05-29)
📊 共 17 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | LLMs Meet Multimodal Generation and Editing: A Survey | 综述LLM在多模态生成与编辑中的应用,涵盖图像、视频、3D和音频等领域。 | world model large language model multimodal | ✅ | |
| 13 | One-Shot Safety Alignment for Large Language Models via Optimal Dualization | 提出基于最优对偶化的大语言模型单样本安全对齐方法,提升安全性和效率。 | reinforcement learning RLHF large language model | ||
| 14 | Exploring the impact of traffic signal control and connected and automated vehicles on intersections safety: A deep reinforcement learning approach | 提出基于深度强化学习的交通信号控制与自动驾驶协同优化方案,提升交叉口安全性 | reinforcement learning deep reinforcement learning | ||
| 15 | Dr-LLaVA: Visual Instruction Tuning with Symbolic Clinical Grounding | Dr-LLaVA:利用符号临床基础进行视觉指令调优,提升医学VLM的临床推理能力 | reinforcement learning RLHF multimodal | ||
| 16 | Efficient Learning in Chinese Checkers: Comparing Parameter Sharing in Multi-Agent Reinforcement Learning | 在六人跳棋中,全参数共享的多智能体强化学习优于独立和部分共享架构 | reinforcement learning | ✅ | |
| 17 | Why Reinforcement Learning in Energy Systems Needs Explanations | 强调能源系统中强化学习模型可解释性的必要性 | reinforcement learning |