cs.CL(2025-09-29)

📊 共 55 篇论文 | 🔗 9 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (38 🔗8) 支柱二:RL算法与架构 (RL & Architecture) (13 🔗1) 支柱八:物理动画 (Physics-based Animation) (1) 支柱七:动作重定向 (Motion Retargeting) (1) 支柱一:机器人控制 (Robot Control) (1) 支柱六:视频提取与匹配 (Video Extraction) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (38 篇)

#题目一句话要点标签🔗
1 Multimodal Large Language Models Meet Multimodal Emotion Recognition and Reasoning: A Survey 综述多模态大语言模型在情感识别与推理中的应用与挑战 large language model multimodal
2 Metaphor identification using large language models: A comparison of RAG, prompt engineering, and fine-tuning 利用大型语言模型进行隐喻识别:比较RAG、提示工程和微调方法 large language model chain-of-thought
3 Towards Structured Knowledge: Advancing Triple Extraction from Regional Trade Agreements using Large Language Models 利用大型语言模型从区域贸易协定中提取结构化知识三元组 large language model
4 Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding 提出Learn2PD以解决大语言模型推理速度瓶颈问题 large language model
5 Pretraining Large Language Models with NVFP4 提出NVFP4训练方法,实现4-bit精度下大规模语言模型的稳定高效预训练。 large language model
6 GateMABSA: Aspect-Image Gated Fusion for Multimodal Aspect-based Sentiment Analysis 提出GateMABSA模型,通过门控多模态融合解决多模态情感分析中噪声过滤和跨模态对齐问题。 multimodal
7 Understanding the Dilemma of Unlearning for Large Language Models 提出unPact框架,揭示大语言模型不可靠的知识遗忘现象与机理。 large language model
8 Sanitize Your Responses: Mitigating Privacy Leakage in Large Language Models 提出Self-Sanitize框架,缓解大语言模型中的隐私泄露问题。 large language model
9 CDT: A Comprehensive Capability Framework for Large Language Models Across Cognition, Domain, and Task 提出CDT框架,从认知、领域和任务三维度全面评估大语言模型能力。 large language model
10 AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment AlignX:通过多语言表示对齐提升多语言大语言模型性能 large language model
11 DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models DiffuGuard:揭示并修复扩散大语言模型中固有的安全漏洞 large language model
12 MobileLLM-R1: Exploring the Limits of Sub-Billion Language Model Reasoners with Open Training Recipes MobileLLM-R1:通过开放训练方案探索十亿参数以下语言模型推理能力的极限 large language model chain-of-thought
13 InfLLM-V2: Dense-Sparse Switchable Attention for Seamless Short-to-Long Adaptation 提出InfLLM-V2:一种稠密-稀疏可切换注意力机制,实现模型从短序列到长序列的无缝适应。 large language model chain-of-thought
14 AdaThink-Med: Medical Adaptive Thinking with Uncertainty-Guided Length Calibration AdaThink-Med:提出一种不确定性引导长度校准的医学自适应思考框架 large language model chain-of-thought
15 Dual Mechanisms of Value Expression: Intrinsic vs. Prompted Values in LLMs 揭示LLM中内在与提示价值观表达的双重机制,并分析其差异性。 large language model instruction following
16 Calibrating Verbalized Confidence with Self-Generated Distractors 提出DINCO,通过自生成干扰项校准LLM的置信度,提升可靠性。 large language model
17 Not Wrong, But Untrue: LLM Overconfidence in Document-Based Queries LLM在文档问答中过度自信:揭示新闻场景下的幻觉问题与溯源挑战 large language model
18 The Rise of AfricaNLP: Contributions, Contributors, and Community Impact (2005-2025) AfricaNLP贡献分析:追踪非洲自然语言处理研究进展与社区影响 large language model
19 Fingerprinting LLMs via Prompt Injection LLMPrint:利用Prompt注入为LLM构建鲁棒指纹,实现模型溯源 large language model
20 Generative Value Conflicts Reveal LLM Priorities ConflictScope:揭示LLM在价值冲突下的优先级偏好 large language model
21 From Internal Representations to Text Quality: A Geometric Approach to LLM Evaluation 利用内部表征几何特性评估LLM文本质量,实现无参考文本质量评估。 large language model
22 Investigating Language and Retrieval Bias in Multilingual Previously Fact-Checked Claim Detection 研究多语言预训练模型在跨语言事实核查中的语言和检索偏差 large language model
23 Learning from Convenience Samples: A Case Study on Fine-Tuning LLMs for Survey Non-response in the German Longitudinal Election Study 微调LLM解决调查非回应问题,利用便利样本提升选举研究准确性 large language model
24 Hyperdimensional Probe: Decoding LLM Representations via Vector Symbolic Architectures 提出超维探针,通过向量符号架构解码大型语言模型表征 large language model
25 How Well Do LLMs Imitate Human Writing Style? 提出一种快速免训练框架,用于评估大型语言模型模仿人类写作风格的能力 large language model
26 BOE-XSUM: Extreme Summarization in Clear Language of Spanish Legal Decrees and Notifications BOE-XSUM:发布西班牙法律公文的明晰语言极端摘要数据集,并验证LLM微调有效性 large language model
27 Expanding Computation Spaces of LLMs at Inference Time 提出一种推理时扩展LLM计算空间的方法,提升问题解决能力 chain-of-thought
28 SemShareKV: Efficient KVCache Sharing for Semantically Similar Prompts via Token-Level LSH Matching SemShareKV:通过Token级LSH匹配为语义相似Prompt高效共享KVCache large language model
29 Hallucination is Inevitable for LLMs with the Open World Assumption 重新审视大语言模型幻觉现象:开放世界假设下的必然产物 large language model
30 ProxyAttn: Guided Sparse Attention via Representative Heads ProxyAttn:通过代表性注意力头引导的稀疏注意力机制,加速长文本处理。 large language model
31 Think Twice, Generate Once: Safeguarding by Progressive Self-Reflection 提出渐进式自反思(PSR)方法,提升大语言模型生成内容的安全性。 large language model
32 MemGen: Weaving Generative Latent Memory for Self-Evolving Agents MemGen:为自进化Agent构建生成式潜在记忆,提升推理能力 large language model
33 Bias Mitigation or Cultural Commonsense? Evaluating LLMs with a Japanese Dataset 提出SOBACO:评估日语LLM社会偏见与文化常识的统一基准 large language model
34 HarmMetric Eval: Benchmarking Metrics and Judges for LLM Harmfulness Assessment 提出HarmMetric Eval,用于全面评估LLM有害性评估指标与判别器的质量。 large language model
35 MAS$^2$: Self-Generative, Self-Configuring, Self-Rectifying Multi-Agent Systems 提出MAS$^2$,一种自生成、自配置、自校正的多智能体系统,提升复杂任务性能。 large language model
36 Training Dynamics of Parametric and In-Context Knowledge Utilization in Language Models 研究训练条件对语言模型参数化知识和上下文知识利用的影响 large language model
37 Beyond Manuals and Tasks: Instance-Level Context Learning for LLM Agents 提出实例级上下文学习方法,提升LLM Agent在复杂任务中的表现 large language model
38 SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents SimuHome:面向智能家居LLM代理的时间与环境感知基准测试 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (13 篇)

#题目一句话要点标签🔗
39 GRPO-MA: Multi-Answer Generation in GRPO for Stable and Efficient Chain-of-Thought Training GRPO-MA:通过多答案生成提升GRPO在CoT训练中的稳定性和效率 reinforcement learning large language model multimodal
40 RFG: Test-Time Scaling for Diffusion Large Language Model Reasoning with Reward-Free Guidance 提出RFG:一种免奖励引导的扩散大语言模型推理测试时缩放方法 reinforcement learning large language model
41 SeaPO: Strategic Error Amplification for Robust Preference Optimization of Large Language Models SeaPO:通过策略性误差放大增强大语言模型偏好优化的鲁棒性 preference learning large language model
42 Let LLMs Speak Embedding Languages: Generative Text Embeddings via Iterative Contrastive Refinement 提出GIRCSE,利用生成式LLM迭代优化文本嵌入,显著提升语义表征能力。 representation learning large language model instruction following
43 Aligning Multilingual Reasoning with Verifiable Semantics from a High-Resource Expert Model 提出PB-RLSVR框架,利用高资源专家模型提升多语言LLM的推理能力。 reinforcement learning PPO large language model
44 InfoAgent: Advancing Autonomous Information-Seeking Agents InfoAgent:通过创新数据合成和自建搜索工具,提升自主信息搜寻Agent能力 reinforcement learning large language model
45 Towards Trustworthy Lexical Simplification: Exploring Safety and Efficiency with Small LLMs 提出一种基于小型LLM的安全高效词汇简化框架,并探索安全过滤策略。 distillation large language model
46 Circuit Distillation 提出电路蒸馏方法,通过对齐模型内部表征实现算法能力迁移 distillation
47 Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution Socratic-Zero:通过无数据Agent协同进化引导LLM推理能力 distillation large language model
48 AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees AdaDetectGPT:利用统计保证自适应检测LLM生成文本 Mamba large language model
49 Alternatives To Next Token Prediction In Text Generation -- A Survey 综述:探索文本生成中下一词预测的替代方案,应对LLM的固有缺陷。 flow matching large language model
50 Reinforcement Mid-Training 提出强化中训练(RMT)框架,提升大语言模型性能并加速训练。 reinforcement learning large language model
51 Beyond Repetition: Text Simplification and Curriculum Learning for Data-Constrained Pretraining 针对数据受限的预训练,提出基于文本简化和课程学习的优化方法。 curriculum learning

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
52 Evaluating Spatiotemporal Consistency in Automatically Generated Sewing Instructions 提出一种基于树结构的自动评估指标,用于评估LLM生成的缝纫步骤指令的时空一致性。 spatiotemporal

🔬 支柱七:动作重定向 (Motion Retargeting) (1 篇)

#题目一句话要点标签🔗
53 LatentEvolve: Self-Evolving Test-Time Scaling in Latent Space 提出LatentEvolve,通过潜空间自进化测试时缩放提升大语言模型推理能力。 latent optimization large language model

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
54 EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering EasySteer:基于vLLM的高性能、可扩展LLM引导统一框架 manipulation large language model

🔬 支柱六:视频提取与匹配 (Video Extraction) (1 篇)

#题目一句话要点标签🔗
55 Probing the Limits of Stylistic Alignment in Vision-Language Models 研究视觉-语言模型风格对齐的极限,探索幽默和浪漫风格所需的最少偏好数据。 HuMoR

⬅️ 返回 cs.CL 首页 · 🏠 返回主页