cs.CL(2025-10-05)
📊 共 22 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (16 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (5 🔗1)
支柱六:视频提取与匹配 (Video Extraction) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (16 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | Exploring Chain-of-Thought Reasoning for Steerable Pluralistic Alignment | 探索思维链推理以实现可控的多元化对齐 | reinforcement learning large language model chain-of-thought | ||
| 18 | Pushing on Multilingual Reasoning Models with Language-Mixed Chain-of-Thought | 提出Language-Mixed CoT,提升多语言推理模型在韩语等场景下的性能。 | distillation chain-of-thought | ✅ | |
| 19 | Teaching LLM to be Persuasive: Reward-Enhanced Policy Optimization for Alignment frm Heterogeneous Rewards | 提出REPO框架,通过异构奖励优化LLM,提升在线旅游议价场景的说服力。 | reinforcement learning PPO DPO | ||
| 20 | PoLi-RL: A Point-to-List Reinforcement Learning Framework for Conditional Semantic Textual Similarity | 提出PoLi-RL框架,解决条件语义文本相似度任务中强化学习训练难题。 | reinforcement learning large language model | ||
| 21 | AgriGPT-VL: Agricultural Vision-Language Understanding Suite | AgriGPT-VL:农业视觉-语言理解套件,解决领域模型稀缺问题 | reinforcement learning large language model multimodal |
🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 22 | Visual Lifelog Retrieval through Captioning-Enhanced Interpretation | 提出CIVIL系统,通过图像描述增强的视觉生活日志检索,解决第一人称视角下的记忆检索问题。 | first-person view |