cs.CL(2025-01-28)
📊 共 24 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (18 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (5 🔗2)
支柱六:视频提取与匹配 (Video Extraction) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (18 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 19 | CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs | 提出CHiP:跨模态分层直接偏好优化,缓解多模态LLM幻觉问题 | DPO direct preference optimization large language model | ✅ | |
| 20 | Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction | MERA:利用大语言模型进行临床诊断预测,解决数据稀缺和候选疾病空间大的问题 | contrastive learning large language model | ||
| 21 | Multimodal Magic Elevating Depression Detection with a Fusion of Text and Audio Intelligence | 提出基于教师-学生架构的多模态抑郁症检测模型,提升文本与音频特征融合效果。 | teacher-student multimodal | ||
| 22 | xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking | 提出xJailbreak,利用表征空间引导强化学习实现可解释的LLM越狱攻击 | reinforcement learning large language model | ✅ | |
| 23 | COS(M+O)S: Curiosity and RL-Enhanced MCTS for Exploring Story Space via Language Models | COS(M+O)S:结合好奇心和强化学习的MCTS,用于探索语言模型的故事空间。 | reinforcement learning chain-of-thought |
🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 24 | Why Do We Laugh? Annotation and Taxonomy Generation for Laughable Contexts in Spontaneous Text Conversation | 提出一种基于LLM辅助的笑点分类方法,用于提升对话AI的自然交互能力。 | HuMoR |