cs.CL(2025-07-27)
📊 共 17 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (13 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱六:视频提取与匹配 (Video Extraction) (1 🔗1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (13 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 14 | SGPO: Self-Generated Preference Optimization based on Self-Improver | 提出SGPO:基于自提升器的自生成偏好优化,无需人工标注数据对齐LLM。 | policy learning DPO direct preference optimization | ||
| 15 | Sem-DPO: Mitigating Semantic Inconsistency in Preference Optimization for Prompt Engineering | 提出Sem-DPO,通过语义一致性约束优化提示工程,提升文本到图像生成质量。 | DPO direct preference optimization | ||
| 16 | Diversity-Enhanced Reasoning for Subjective Questions | 提出MultiRole-R1框架,通过增强视角和token多样性提升主观问题推理能力。 | reinforcement learning reward shaping chain-of-thought |
🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | Multi-Stage Verification-Centric Framework for Mitigating Hallucination in Multi-Modal RAG | 提出多阶段验证中心框架,缓解多模态RAG中的幻觉问题 | egocentric | ✅ |