cs.CL（2026-05-12）

📊 共 45 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (26 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (14) 支柱一：机器人控制 (Robot Control) (4) 支柱七：动作重定向 (Motion Retargeting) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (26 篇)

#	题目	一句话要点	标签	🔗
1	Checkup2Action: A Multimodal Clinical Check-up Report Dataset for Patient-Oriented Action Card Generation	提出Checkup2Action数据集，用于评估多模态临床报告生成患者导向行动卡片的能力	large language model multimodal
2	Human-Grounded Multimodal Benchmark with 900K-Scale Aggregated Student Response Distributions from Japan's National Assessment of Academic Ability	提出Gakucho基准，用于评估多模态大语言模型在真实日本K-12学业评估中的表现。	large language model multimodal	✅
3	Scalable Token-Level Hallucination Detection in Large Language Models	提出TokenHD，实现大规模语言模型中token级别幻觉检测，无需步骤分割。	large language model
4	From Token to Token Pair: Efficient Prompt Compression for Large Language Models in Clinical Prediction	提出MedTPE方法，用于临床预测中LLM的EHR序列高效压缩。	large language model
5	Question Difficulty Estimation for Large Language Models via Answer Plausibility Scoring	提出Q-DAPS方法，通过答案合理性评分估计大语言模型问题难度	large language model
6	Pretraining Exposure Explains Popularity Judgments in Large Language Models	通过预训练数据暴露度解释大语言模型中的流行度判断	large language model
7	Correcting Selection Bias in Sparse User Feedback for Large Language Model Quality Estimation: A Multi-Agent Hierarchical Bayesian Approach	提出多代理层次贝叶斯方法以纠正稀疏用户反馈中的选择偏差	large language model
8	Large Language Models for Causal Relations Extraction in Social Media: A Validation Framework for Disaster Intelligence	提出基于专家知识的评估框架，验证大型语言模型在灾害情报中提取因果关系的能力。	large language model
9	Reconstruction of Personally Identifiable Information from Supervised Finetuned Models	提出COVA算法，用于从监督微调模型中重建个人身份信息（PII）。	large language model instruction following
10	Towards Visually-Guided Movie Subtitle Translation for Indic Languages	提出视觉引导的电影字幕翻译方法，提升印地语等低资源语言的翻译质量	multimodal visual grounding
11	Task-Adaptive Embedding Refinement via Test-time LLM Guidance	提出基于测试时LLM指导的任务自适应嵌入精炼方法，提升零样本检索和分类性能。	instruction following	✅
12	ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging	ORBIT：通过源头调控合并，在生成式检索中保留基础语言能力	large language model
13	Stories in Space: In-Context Learning Trajectories in Conceptual Belief Space	提出概念信念空间，以几何视角理解LLM上下文学习中的信念动态变化	large language model
14	Training-Inference Consistent Segmented Execution for Long-Context LLMs	提出训练-推理一致的分段执行框架，提升长文本LLM的效率和可扩展性	large language model
15	Three Regimes of Context-Parametric Conflict: A Predictive Framework and Empirical Validation	提出上下文参数冲突三阶段框架，预测并验证大语言模型知识更新行为	large language model
16	Taming Extreme Tokens: Covariance-Aware GRPO with Gaussian-Kernel Advantage Reweighting	提出基于协方差感知的GRPO方法，通过高斯核优势重加权稳定大语言模型推理能力。	large language model
17	The Algorithmic Caricature: Auditing LLM-Generated Political Discourse Across Crisis Events	提出算法漫画方法，通过对比真实与生成政治言论，评估LLM在危机事件中生成内容的人口真实性。	large language model
18	Mitigating Context-Memory Conflicts in LLMs through Dynamic Cognitive Reconciliation Decoding	提出DCRD动态认知协调解码，缓解大语言模型中的上下文-记忆冲突	large language model
19	Geometric Factual Recall in Transformers	揭示Transformer中几何事实记忆机制，突破参数线性增长瓶颈	chain-of-thought
20	Latent Causal Void: Explicit Missing-Context Reconstruction for Misinformation Detection	提出隐性因果空洞(LCV)，通过显式重建缺失上下文来提升信息检测性能。	large language model
21	Metaphor Is Not All Attention Needs	诗意越狱并非仅依赖注意力机制，而是源于风格不规则性对LLM处理方式的改变	large language model
22	Do Language Models Encode Knowledge of Linguistic Constraint Violations?	提出稀疏自编码器以检测语言模型中的语法约束违规特征	large language model
23	Safety-Oriented Evaluation of Language Understanding Systems for Air Traffic Control	提出安全导向评估框架以解决空中交通管制语言理解系统的可靠性问题	large language model
24	Robust LLM Unlearning Against Relearning Attacks: The Minor Components in Representations Matter	提出MCU方法，通过优化表征中的次要成分，增强LLM抗重学习攻击的卸载能力	large language model
25	StoicLLM: Preference Optimization for Philosophical Alignment in Small Language Models	StoicLLM：小模型哲学对齐的偏好优化方法	large language model
26	Freeze Deep, Train Shallow: Interpretable Layer Allocation for Continued Pre-Training	提出LayerTracer以解决大语言模型继续预训练中的层分配问题	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (14 篇)

#	题目	一句话要点	标签
27	OmniThoughtVis: A Scalable Distillation Pipeline for Deployable Multimodal Reasoning Models	提出OmniThoughtVis，用于可部署多模态推理模型的可扩展蒸馏流水线。	distillation large language model multimodal
28	Combining On-Policy Optimization and Distillation for Long-Context Reasoning in Large Language Models	提出dGRPO方法，结合On-Policy优化与知识蒸馏，提升大语言模型长文本推理能力。	reinforcement learning distillation large language model
29	A Study on Hidden Layer Distillation for Large Language Model Pre-Training	研究大型语言模型预训练中的隐层蒸馏方法，探索其性能瓶颈与潜在价值	distillation large language model
30	Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models	Qwen-Scope：将稀疏特征转化为大语言模型开发工具	reinforcement learning large language model
31	Enhancing Multilingual Counterfactual Generation through Alignment-as-Preference Optimization	提出Macro框架，通过偏好对齐优化提升多语言反事实生成效果	DPO direct preference optimization large language model
32	TextSeal: A Localized LLM Watermark for Provenance & Distillation Protection	TextSeal：一种用于溯源和蒸馏保护的局部化LLM水印方案	distillation large language model
33	SkillGraph: Skill-Augmented Reinforcement Learning for Agents via Evolving Skill Graphs	提出SkillGraph，通过演化技能图增强智能体强化学习能力	reinforcement learning large language model
34	On Predicting the Post-training Potential of Pre-trained LLMs	提出RuDE框架，通过判别而非生成方式预测预训练LLM的后训练潜力，提升模型选择效率。	reinforcement learning large language model foundation model
35	Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation	揭示On-Policy蒸馏效率之谜：提出EffOPD加速训练并保持性能。	distillation large language model
36	TokenRatio: Principled Token-Level Preference Optimization via Ratio Matching	TokenRatio：通过比例匹配实现原则性的Token级别偏好优化	DPO direct preference optimization instruction following
37	SAGE: Scalable Automated Robustness Augmentation for LLM Knowledge Evaluation	SAGE：一种可扩展的自动化鲁棒性增强框架，用于评估LLM知识	reinforcement learning large language model
38	Learning Agentic Policy from Action Guidance	提出ActGuide-RL，利用行动指导提升LLM智能体策略探索能力，降低对SFT数据的依赖。	reinforcement learning large language model
39	StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning	StepCodeReasoner：通过强化学习对齐代码推理与逐步执行轨迹	reinforcement learning
40	Agent-BRACE: Decoupling Beliefs from Actions in Long-Horizon Tasks via Verbalized State Uncertainty	Agent-BRACE：通过语言化的状态不确定性解耦信念与动作，解决长时程任务中的挑战。	reinforcement learning large language model

🔬 支柱一：机器人控制 (Robot Control) (4 篇)

#	题目	一句话要点	标签
41	When Emotion Becomes Trigger: Emotion-style dynamic Backdoor Attack Parasitising Large Language Models	提出Paraesthesia，一种基于情感风格的动态后门攻击方法，寄生于大型语言模型。	manipulation large language model instruction following
42	YFPO: A Preliminary Study of Yoked Feature Preference Optimization with Neuron-Guided Rewards for Mathematical Reasoning	YFPO：一种神经元引导奖励的Yoked特征偏好优化方法，用于提升数学推理能力。	manipulation preference learning large language model
43	PreScam: A Benchmark for Predicting Scam Progression from Early Conversations	PreScam：一个基于早期对话预测诈骗进展的基准数据集	manipulation
44	Is Child-Directed Language Optimized for Word Learning? A Computational Study of Verb Meaning Acquisition	通过计算模型研究儿童语言对动词学习的优化效果	manipulation

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
45	Predicting Disagreement with Human Raters in LLM-as-a-Judge Difficulty Assessment without Using Generation-Time Probability Signals	提出一种无需生成概率信号的LLM难度评估不一致性预测方法	geometric consistency large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2026-05-12）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (26 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (14 篇)

🔬 支柱一：机器人控制 (Robot Control) (4 篇)

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理