cs.AI(2024-06-11)

📊 共 5 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (2) 支柱九:具身大模型 (Embodied Foundation Models) (2) 支柱六:视频提取与匹配 (Video Extraction) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
1 3D-Properties: Identifying Challenges in DPO and Charting a Path Forward 揭示DPO训练挑战:3D属性问题及改进方案 PPO preference learning DPO
2 Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment 提出AIHF,通过联合学习奖励和策略,提升人类对齐效果 reinforcement learning RLHF DPO

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
3 Large Language Models for Constrained-Based Causal Discovery 利用大语言模型进行约束式因果发现,提升因果图构建效率 large language model chain-of-thought
4 Autograding Mathematical Induction Proofs with Natural Language Processing 利用自然语言处理自动评估数学归纳法证明,提升教学反馈效率。 large language model

🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)

#题目一句话要点标签🔗
5 The MuSe 2024 Multimodal Sentiment Analysis Challenge: Social Perception and Humor Recognition MuSe 2024挑战赛:多模态情感分析,关注社会感知与幽默识别 HuMoR multimodal

⬅️ 返回 cs.AI 首页 · 🏠 返回主页