cs.CL（2025-07-10）

📊 共 25 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (15 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (10 🔗3)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (15 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Single-to-mix Modality Alignment with Multimodal Large Language Model for Document Image Machine Translation	M4Doc：利用多模态大语言模型进行单模态到混合模态对齐的文档图像机器翻译	large language model multimodal
2	When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance	构建法律领域LLM双重视角分类体系，提升法律任务泛化与伦理治理能力	large language model multimodal	✅
3	Automating Expert-Level Medical Reasoning Evaluation of Large Language Models	提出MedThink-Bench基准与LLM-w-Ref评估框架，用于自动化评估LLM的医学推理能力。	large language model
4	Automating MD simulations for Proteins using Large language Models: NAMD-Agent	提出NAMD-Agent，利用大语言模型自动化蛋白质分子动力学模拟流程。	large language model
5	Compactor: Calibrated Query-Agnostic KV Cache Compression with Approximate Leverage Scores	Compactor：基于近似杠杆分数的免训练KV缓存压缩，提升长上下文LLM性能。	large language model
6	MIRIX: Multi-Agent Memory System for LLM-Based Agents	MIRIX：面向LLM Agent的多Agent记忆系统，解决长期记忆和多模态理解难题	multimodal
7	TruthTorchLM: A Comprehensive Library for Predicting Truthfulness in LLM Outputs	TruthTorchLM：一个全面的LLM输出真实性预测开源库	large language model	✅
8	GRASP: Generic Reasoning And SPARQL Generation across Knowledge Graphs	提出GRASP，利用大语言模型零样本生成SPARQL查询，提升知识图谱问答性能。	large language model
9	Understanding and Controlling Repetition Neurons and Induction Heads in In-Context Learning	探讨重复神经元与诱导头在上下文学习中的作用	large language model
10	StreamUni: Achieving Streaming Speech Translation with a Unified Large Speech-Language Model	StreamUni：利用统一大型语音语言模型实现流式语音翻译	chain-of-thought
11	Lost in Pronunciation: Detecting Chinese Offensive Language Disguised by Phonetic Cloaking Replacement	提出音韵隐蔽替换检测方法以解决中文攻击性语言识别问题	chain-of-thought
12	CEA-LIST at CheckThat! 2025: Evaluating LLMs as Detectors of Bias and Opinion in Text	利用LLM和少量样本提示，CEA-LIST在CheckThat! 2025中实现多语言主观性检测	large language model
13	Toward Real-World Chinese Psychological Support Dialogues: CPsDD Dataset and a Co-Evolving Multi-Agent System	提出CPsDD数据集与协同演化多智能体系统，用于真实中文心理支持对话	large language model
14	SAND: Boosting LLM Agents with Self-Taught Action Deliberation	提出SAND框架，通过自学习行动审议提升LLM Agent性能	large language model
15	Krul: Efficient State Restoration for Multi-turn Conversations with Dynamic Cross-layer KV Sharing	Krul：一种高效的状态恢复系统，通过动态跨层KV共享优化多轮对话。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
16	Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models	提出“机器胡扯”框架与指标，揭示大语言模型中涌现的对真理的漠视现象	reinforcement learning RLHF large language model
17	Towards Interpretable Time Series Foundation Models	提出一种基于指令调优的小型语言模型，用于时间序列的可解释性分析。	distillation foundation model multimodal
18	Distilling Empathy from Large Language Models	提出一种基于提示工程的两阶段微调方法，用于将大型语言模型的共情能力蒸馏到小型语言模型中。	distillation large language model
19	Exploring the Limits of Model Compression in LLMs: A Knowledge Distillation Study on QA Tasks	通过知识蒸馏压缩LLM：在QA任务中探索模型压缩的极限。	distillation large language model
20	The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs	揭示长CoT SFT与RL在视觉语言模型推理中的协同困境，探究后训练技术瓶颈	reinforcement learning multimodal chain-of-thought
21	RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning	RLEP：通过经验回放增强LLM推理的强化学习方法	reinforcement learning large language model	✅
22	SAGE: A Visual Language Model for Anomaly Detection via Fact Enhancement and Entropy-aware Alignment	提出SAGE，通过事实增强和熵感知对齐解决VLM在工业异常检测中的难题。	DPO direct preference optimization multimodal	✅
23	Not All Preferences are What You Need for Post-Training: Selective Alignment Strategy for Preference Optimization	提出选择性对齐策略Selective-DPO，提升LLM偏好优化效率与准确性	DPO distillation large language model	✅
24	Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code	TeaR：通过算法问题强化学习，提升LLM的推理能力，无需编写代码。	reinforcement learning distillation
25	PLAN-TUNING: Post-Training Language Models to Learn Step-by-Step Planning for Complex Problem Solving	提出PLAN-TUNING，通过模仿规划过程提升小模型在复杂问题求解上的能力。	reinforcement learning large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页