cs.CL(2025-06-23)

📊 共 25 篇论文 | 🔗 7 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (18 🔗5) 支柱二:RL算法与架构 (RL & Architecture) (3 🔗2) 支柱一:机器人控制 (Robot Control) (3) 支柱五:交互与反应 (Interaction & Reaction) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (18 篇)

#题目一句话要点标签🔗
1 STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning 提出STU-PID以解决大语言模型推理效率问题 large language model chain-of-thought
2 RWESummary: A Framework and Test for Choosing Large Language Models to Summarize Real-World Evidence (RWE) Studies 提出RWESummary框架以评估大语言模型在RWE研究总结中的表现 large language model foundation model
3 Parallel Continuous Chain-of-Thought with Jacobi Iteration 提出并行连续思维链方法以提升推理效率 large language model chain-of-thought
4 MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis 提出MedTVT-R1以解决多疾病诊断的挑战 large language model multimodal
5 Benchmarking the Pedagogical Knowledge of Large Language Models 提出教学知识基准以评估大型语言模型的教育能力 large language model
6 Is There a Case for Conversation Optimized Tokenizers in Large Language Models? 提出对话优化的分词器以提升大型语言模型的效率 large language model
7 TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models 提出TReB基准以评估大型语言模型的表格推理能力 large language model
8 Enhancing Document Retrieval in COVID-19 Research: Leveraging Large Language Models for Hidden Relation Extraction 提出Covrelex-SE系统以提升COVID-19研究文献检索效率 large language model
9 A Survey of AIOps in the Era of Large Language Models 综述大语言模型在AIOps中的应用与挑战 large language model
10 Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs 提出L²多语言统一学习以解决大语言模型测试时推理效率问题 large language model chain-of-thought
11 Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective 提出FiSCo框架以解决LLMs公平性评估问题 large language model
12 OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization 提出OMEGA基准以评估LLMs在数学推理中的创新能力 chain-of-thought
13 CommVQ: Commutative Vector Quantization for KV Cache Compression 提出CommVQ以解决长上下文LLM推理中的KV缓存瓶颈问题 large language model
14 From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents 提出基于推理代理的深度研究方法以提升信息检索能力 large language model
15 Existing LLMs Are Not Self-Consistent For Simple Tasks 提出不一致性度量与自动化方法以解决LLM自洽性问题 large language model
16 The Anatomy of Speech Persuasion: Linguistic Shifts in LLM-Modified Speeches 提出一种新方法分析大型语言模型对演讲说服力的理解 large language model
17 Reply to "Emergent LLM behaviors are observationally equivalent to data leakage" 澄清LLM群体中自组织与模型依赖的动态研究 large language model
18 Team LA at SCIDOCA shared task 2025: Citation Discovery via relation-based zero-shot retrieval 提出基于关系的零-shot检索方法以解决引用发现问题 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
19 ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs 提出ReasonFlux-PRM以解决长链推理中的奖励评估问题 reinforcement learning distillation large language model
20 LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning 提出LongWriter-Zero以解决超长文本生成问题 reinforcement learning large language model
21 USAD: Universal Speech and Audio Representation via Distillation 提出USAD以解决音频表示学习的领域特定问题 representation learning distillation

🔬 支柱一:机器人控制 (Robot Control) (3 篇)

#题目一句话要点标签🔗
22 How Large Language Models play humans in online conversations: a simulated study of the 2016 US politics on Reddit 评估大型语言模型在2016年美国政治讨论中的表现 manipulation large language model
23 Language Models Might Not Understand You: Evaluating Theory of Mind via Story Prompting 提出StorySim框架以评估语言模型的心智理论能力 manipulation world model large language model
24 Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations 研究非标准分词对语言模型性能的影响 manipulation

🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)

#题目一句话要点标签🔗
25 The Open Proof Corpus: A Large-Scale Study of LLM-Generated Mathematical Proofs 提出开放证明语料库以推动数学证明生成研究 IMoS large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页