cs.AI(2024-06-11)
📊 共 5 篇论文
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (2)
支柱九:具身大模型 (Embodied Foundation Models) (2)
支柱六:视频提取与匹配 (Video Extraction) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward | 揭示DPO训练挑战:3D属性问题及改进方案 | PPO preference learning DPO | ||
| 2 | Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment | 提出AIHF,通过联合学习奖励和策略,提升人类对齐效果 | reinforcement learning RLHF DPO |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Large Language Models for Constrained-Based Causal Discovery | 利用大语言模型进行约束式因果发现,提升因果图构建效率 | large language model chain-of-thought | ||
| 4 | Autograding Mathematical Induction Proofs with Natural Language Processing | 利用自然语言处理自动评估数学归纳法证明,提升教学反馈效率。 | large language model |
🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 5 | The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition | MuSe 2024挑战赛:多模态情感分析,关注社会感知与幽默识别 | HuMoR multimodal |