cs.CL(2026-03-09)
📊 共 20 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (12 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (7 🔗2)
支柱五:交互与反应 (Interaction & Reaction) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (12 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (7 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | SmartThinker: Progressive Chain-of-Thought Length Calibration for Efficient Large Language Model Reasoning | SmartThinker:通过渐进式CoT长度校准提升大语言模型推理效率 | reward design large language model chain-of-thought | ✅ | |
| 14 | Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective | 提出Token条件强化学习(ToCoRL),实现大语言模型行为模式的精准控制与迁移。 | reinforcement learning large language model | ||
| 15 | High-Fidelity Pruning for Large Language Models | 提出基于信息熵的LLM高保真剪枝方法,提升部署效率 | distillation large language model | ✅ | |
| 16 | TildeOpen LLM: Leveraging Curriculum Learning to Achieve Equitable Language Representation | TildeOpen LLM:利用课程学习实现公平的语言表征 | curriculum learning large language model | ||
| 17 | Toward Robust LLM-Based Judges: Taxonomic Bias Evaluation and Debiasing Optimization | 提出JudgeBiasBench基准,并优化LLM评判偏见,提升自动化评估可靠性。 | reinforcement learning contrastive learning large language model | ||
| 18 | ConflictBench: Evaluating Human-AI Conflict via Interactive and Visually Grounded Environments | 提出ConflictBench,用于评估人机交互中基于视觉环境的冲突对齐问题 | world model large language model | ||
| 19 | Aligning to Illusions: Choice Blindness in Human and AI Feedback | 提出选择盲目性研究以挑战人类反馈在RLHF中的假设 | reinforcement learning RLHF |
🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 20 | AdaCultureSafe: Adaptive Cultural Safety Grounded by Cultural Knowledge in Large Language Models | AdaCultureSafe:基于文化知识自适应提升大语言模型的文化安全性 | ReMoS large language model |