cs.CL（2025-06-23）

📊 共 25 篇论文 | 🔗 7 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (18 🔗5) 支柱二：RL算法与架构 (RL & Architecture) (3 🔗2) 支柱一：机器人控制 (Robot Control) (3) 支柱五：交互与反应 (Interaction & Reaction) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (18 篇)

#	题目	一句话要点	标签	🔗	⭐
1	STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning	提出STU-PID以解决大语言模型推理效率问题	large language model chain-of-thought
2	RWESummary: A Framework and Test for Choosing Large Language Models to Summarize Real-World Evidence (RWE) Studies	提出RWESummary框架以评估大语言模型在RWE研究总结中的表现	large language model foundation model
3	Parallel Continuous Chain-of-Thought with Jacobi Iteration	提出并行连续思维链方法以提升推理效率	large language model chain-of-thought	✅
4	MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis	提出MedTVT-R1以解决多疾病诊断的挑战	large language model multimodal	✅
5	Benchmarking the Pedagogical Knowledge of Large Language Models	提出教学知识基准以评估大型语言模型的教育能力	large language model
6	Is There a Case for Conversation Optimized Tokenizers in Large Language Models?	提出对话优化的分词器以提升大型语言模型的效率	large language model
7	TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models	提出TReB基准以评估大型语言模型的表格推理能力	large language model
8	Enhancing Document Retrieval in COVID-19 Research: Leveraging Large Language Models for Hidden Relation Extraction	提出Covrelex-SE系统以提升COVID-19研究文献检索效率	large language model
9	A Survey of AIOps in the Era of Large Language Models	综述大语言模型在AIOps中的应用与挑战	large language model
10	Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs	提出L²多语言统一学习以解决大语言模型测试时推理效率问题	large language model chain-of-thought
11	Quantifying Fairness in LLMs Beyond Tokens: A Semantic and Statistical Perspective	提出FiSCo框架以解决LLMs公平性评估问题	large language model
12	OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization	提出OMEGA基准以评估LLMs在数学推理中的创新能力	chain-of-thought
13	CommVQ: Commutative Vector Quantization for KV Cache Compression	提出CommVQ以解决长上下文LLM推理中的KV缓存瓶颈问题	large language model	✅
14	From Web Search towards Agentic Deep Research: Incentivizing Search with Reasoning Agents	提出基于推理代理的深度研究方法以提升信息检索能力	large language model	✅
15	Existing LLMs Are Not Self-Consistent For Simple Tasks	提出不一致性度量与自动化方法以解决LLM自洽性问题	large language model	✅
16	The Anatomy of Speech Persuasion: Linguistic Shifts in LLM-Modified Speeches	提出一种新方法分析大型语言模型对演讲说服力的理解	large language model
17	Reply to "Emergent LLM behaviors are observationally equivalent to data leakage"	澄清LLM群体中自组织与模型依赖的动态研究	large language model
18	Team LA at SCIDOCA shared task 2025: Citation Discovery via relation-based zero-shot retrieval	提出基于关系的零-shot检索方法以解决引用发现问题	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
19	ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs	提出ReasonFlux-PRM以解决长链推理中的奖励评估问题	reinforcement learning distillation large language model	✅
20	LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning	提出LongWriter-Zero以解决超长文本生成问题	reinforcement learning large language model	✅
21	USAD: Universal Speech and Audio Representation via Distillation	提出USAD以解决音频表示学习的领域特定问题	representation learning distillation

🔬 支柱一：机器人控制 (Robot Control) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
22	How Large Language Models play humans in online conversations: a simulated study of the 2016 US politics on Reddit	评估大型语言模型在2016年美国政治讨论中的表现	manipulation large language model
23	Language Models Might Not Understand You: Evaluating Theory of Mind via Story Prompting	提出StorySim框架以评估语言模型的心智理论能力	manipulation world model large language model
24	Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations	研究非标准分词对语言模型性能的影响	manipulation

🔬 支柱五：交互与反应 (Interaction & Reaction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
25	The Open Proof Corpus: A Large-Scale Study of LLM-Generated Mathematical Proofs	提出开放证明语料库以推动数学证明生成研究	IMoS large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页