cs.CL（2026-05-07）

📊 共 43 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (29 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (11 🔗1) 支柱八：物理动画 (Physics-based Animation) (1) 支柱一：机器人控制 (Robot Control) (1) 支柱四：生成式动作 (Generative Motion) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (29 篇)

#	题目	一句话要点	标签	🔗
1	Litespark Inference on Consumer CPUs: Custom SIMD Kernels for Ternary Neural Networks	Litespark-Inference：面向消费级CPU的三元神经网络定制SIMD推理加速	large language model
2	The Cost of Context: Mitigating Textual Bias in Multimodal Retrieval-Augmented Generation	提出BAIR框架以解决多模态生成中的文本偏差问题	large language model multimodal
3	TableVista: Benchmarking Multimodal Table Reasoning under Visual and Structural Complexity	提出TableVista基准测试，揭示多模态大模型在复杂视觉与结构化表格推理中的性能瓶颈	foundation model multimodal
4	Decomposing the Basic Abilities of Large Language Models: Mitigating Cross-Task Interference in Multi-Task Instruct-Tuning	提出BADIT框架：通过基本能力分解与正交化LoRA专家缓解多任务指令微调中的跨任务干扰	large language model
5	Reflections and New Directions for Human-Centered Large Language Models	提出以人为中心的大语言模型（HCLLM）框架，实现全生命周期的价值对齐与责任部署	large language model
6	Towards Emotion Consistency Analysis of Large Language Models in Emotional Conversational Contexts	分析大语言模型在情感对话语境下的逻辑一致性与虚假信念易感性	large language model
7	Uncovering Entity Identity Confusion in Multimodal Knowledge Editing	揭示多模态知识编辑中的实体身份混淆问题，并提出基于I-E绑定约束的改进策略	multimodal
8	BioTool: A Comprehensive Tool-Calling Dataset for Enhancing Biomedical Capabilities of Large Language Models	提出BioTool数据集以增强大语言模型在生物医学领域的工具调用能力	large language model	✅
9	Negative Before Positive: Asymmetric Valence Processing in Large Language Models	揭示大语言模型中情感效价的非对称处理机制：基于激活修补与干预的深度分析	large language model
10	IntentGrasp: A Comprehensive Benchmark for Intent Understanding	提出IntentGrasp基准与意图微调（IFT）方法，显著提升大语言模型的意图理解能力	large language model
11	Cognitive Agent Compilation for Explicit Problem Solver Modeling	提出认知智能体编译（CAC）框架，通过显式建模实现教育场景下的可解释与可控问题求解	large language model
12	MELD: Multi-Task Equilibrated Learning Detector for AI-Generated Text	提出MELD多任务平衡学习检测器，通过辅助监督与对抗蒸馏提升AI生成文本检测的鲁棒性	large language model
13	Hallucination as an Anomaly: Dynamic Intervention via Probabilistic Circuits	提出PCNET，通过概率电路动态干预LLM幻觉问题，提升生成真实性。	large language model
14	One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue	提出TurnGate防御框架，通过响应感知机制识别多轮对话中的隐蔽恶意意图	large language model	✅
15	SmellBench: Evaluating LLM Agents on Architectural Code Smell Repair	提出SmellBench评估框架，量化评估大模型智能体在架构级代码异味修复中的能力	large language model
16	Can LLMs Take Retrieved Information with a Grain of Salt?	提出基于交互设计的上下文确定性校准策略，显著提升LLM对检索信息置信度的判别与响应能力。	large language model
17	EMO: Pretraining Mixture of Experts for Emergent Modularity	提出EMO预训练框架，通过文档级约束实现混合专家模型（MoE）的涌现式模块化	large language model
18	Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents	提出首个LLM深度研究代理引用评估框架，揭示了引用质量与事实准确性之间的严重脱节。	large language model
19	Algospeak, Hiding in the Open: The Trade-off Between Legible Meaning and Detection Avoidance	提出Algospeak评估框架，量化语言规避策略在内容可理解性与检测逃逸间的权衡。	large language model
20	Efficient Pre-Training with Token Superposition	提出Token叠加训练（TST）方法，通过两阶段训练显著提升大模型预训练效率	large language model
21	STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?	提出STALE基准与CUPMem框架，解决LLM智能体在动态环境下的记忆失效与状态更新难题。	large language model
22	SEQUOR: A Multi-Turn Benchmark for Realistic Constraint Following	提出SEQUOR基准测试，揭示大模型在长多轮对话中遵循复杂约束的性能瓶颈	instruction following
23	Quantifying the Statistical Effect of Rubric Modifications on Human-Autorater Agreement	量化评估准则修改对人机评分一致性的统计影响，优化LLM作为裁判的评价效能	instruction following
24	UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification	提出UniPrefill框架，通过块级动态稀疏化实现通用长上下文预填充加速	large language model
25	Navigating by Old Maps: The Pitfalls of Static Mechanistic Localization in LLM Post-Training	揭示大模型后训练中静态机制定位的局限性，提出电路演化分析框架以应对参数动态更新挑战	large language model
26	From Articles to Premises: Building PrimeFacts, an Extraction Methodology and Resource for Fact-Checking Evidence	提出PrimeFacts数据集与提取框架，通过LLM去语境化重写事实核查证据以提升自动验证性能。	large language model
27	Minimizing Modality Gap from the Input Side: Your Speech LLM Can Be a Prosody-Aware Text LLM	提出TextPro-SLM：通过输入端对齐策略缩小语音大模型模态鸿沟	large language model
28	Evaluation Awareness in Language Models Has Limited Effect on Behaviour	实证研究表明：大型推理模型中的“评估意识”对模型行为的影响极其有限	chain-of-thought
29	A Few Good Clauses: Comparing LLMs vs Domain-Trained Small Language Models on Structured Contract Extraction	提出领域专用小型语言模型Olava Extract，以低成本实现超越前沿大模型的合同结构化抽取能力。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (11 篇)

#	题目	一句话要点	标签	🔗
30	UniSD: Towards a Unified Self-Distillation Framework for Large Language Models	提出UniSD框架以解决自蒸馏在大语言模型中的挑战	contrastive learning distillation feature matching
31	StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction	StraTA：通过策略轨迹抽象激励Agentic强化学习，提升长程决策能力	reinforcement learning large language model
32	MANTRA: Synthesizing SMT-Validated Compliance Benchmarks for Tool-Using LLM Agents	MANTRA：合成SMT验证的合规性基准，用于工具型LLM Agent	world model world models large language model
33	Continuous Latent Diffusion Language Model	提出Cola DLM：一种连续潜在扩散语言模型，用于高效灵活的文本生成。	representation learning large language model
34	When2Speak: A Dataset for Temporal Participation and Turn-Taking in Multi-Party Conversations for Large Language Models	提出When2Speak数据集与四阶段生成流水线，解决大语言模型在多方对话中的介入时机决策问题。	reinforcement learning reward shaping large language model
35	Milestone-Guided Policy Learning for Long-Horizon Language Agents	提出里程碑引导的策略学习框架BEACON，解决长程语言智能体训练中的信用分配与样本效率难题。	reinforcement learning policy learning reward shaping	✅
36	Estimating the Black-box LLM Uncertainty with Distribution-Aligned Adversarial Distillation	提出分布对齐对抗蒸馏（DisAAD）框架，实现黑盒大模型的高效不确定性量化	distillation large language model
37	Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients	提出POPO框架：通过仅正样本策略优化实现大语言模型推理能力的提升	reinforcement learning PPO large language model
38	Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning	提出ReasonMaxxer方法：通过稀疏策略选择替代强化学习以提升大模型推理能力	reinforcement learning large language model
39	A$^2$TGPO: Agentic Turn-Group Policy Optimization with Adaptive Turn-level Clipping	提出A$^2$TGPO算法，通过自适应轮次裁剪优化智能体大模型的强化学习过程奖励分配。	reinforcement learning large language model
40	MemReranker: Reasoning-Aware Reranking for Agent Memory Retrieval	提出MemReranker重排序模型，通过多阶段知识蒸馏增强智能体记忆检索的推理能力	contrastive learning distillation

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
41	MIST: Multimodal Interactive Speech-based Tool-calling Conversational Assistants for Smart Homes	提出MIST多模态交互式语音工具调用数据集，以解决智能家居场景下复杂时空约束与动态状态追踪难题。	spatiotemporal large language model multimodal

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
42	Lightweight Stylistic Consistency Profiling: Robust Detection of LLM-Generated Textual Content for Multimedia Moderation	提出LiSCP轻量级风格一致性分析方法，实现多媒体内容中LLM生成文本的鲁棒检测	manipulation large language model multimodal

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
43	Bridging Passive and Active: Enhancing Conversation Starter Recommendation via Active Expression Modeling	提出PA-Bridge框架，通过主动表达建模打破对话推荐中的反馈循环与回声室效应	penetration large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2026-05-07）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (29 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (11 篇)

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理