cs.CL(2026-01-08)

📊 共 39 篇论文 | 🔗 9 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (29 🔗6) 支柱二:RL算法与架构 (RL & Architecture) (9 🔗3) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (29 篇)

#题目一句话要点标签🔗
1 CRANE: Causal Relevance Analysis of Language-Specific Neurons in Multilingual Large Language Models CRANE:通过因果相关性分析多语言大模型中特定语言神经元 large language model language conditioned
2 V-FAT: Benchmarking Visual Fidelity Against Text-bias V-FAT基准测试揭示多模态大语言模型中文本偏差下的视觉保真度问题。 large language model multimodal visual grounding
3 See, Explain, and Intervene: A Few-Shot Multimodal Agent Framework for Hateful Meme Moderation 提出基于生成式AI和少量样本学习的多模态框架,用于检测、解释和干预仇恨表情包。 multimodal
4 BanglaLorica: Design and Evaluation of a Robust Watermarking Algorithm for Large Language Models in Bangla Text Generation BanglaLorica:针对孟加拉语LLM文本生成,提出一种鲁棒的水印算法并进行评估 large language model
5 Can Large Language Models Resolve Semantic Discrepancy in Self-Destructive Subcultures? Evidence from Jirai Kei 提出子文化对齐求解器以解决自毁亚文化行为检测问题 large language model
6 Learning from Mistakes: Negative Reasoning Samples Enhance Out-of-Domain Generalization 利用负样本推理提升大语言模型领域外泛化能力 large language model chain-of-thought
7 THaLLE-ThaiLLM: Domain-Specialized Small LLMs for Finance and Thai -- Technical Report THaLLE-ThaiLLM:面向金融和泰语的领域专用小型LLM,通过模型合并实现多功能性。 large language model instruction following
8 Measuring and Fostering Peace through Machine Learning and Artificial Intelligence 利用机器学习和人工智能测量并促进和平 large language model
9 RelayLLM: Efficient Reasoning via Collaborative Decoding RelayLLM:提出一种基于协同解码的高效推理框架,显著降低大语言模型的计算成本。 large language model
10 CuMA: Aligning LLMs with Sparse Cultural Values via Demographic-Aware Mixture of Adapters 提出CuMA,通过人口统计学感知的适配器混合模型对齐LLM与稀疏文化价值观 large language model
11 Differential syntactic and semantic encoding in LLMs 通过分析LLM内部表征,揭示句法和语义信息的差异化编码方式 large language model
12 Prior-Informed Zeroth-Order Optimization with Adaptive Direction Alignment for Memory-Efficient LLM Fine-Tuning 提出先验引导的零阶优化方法,高效微调大规模语言模型 large language model
13 SampoNLP: A Self-Referential Toolkit for Morphological Analysis of Subword Tokenizers SampoNLP:一种自参照工具包,用于亚词分词器的形态分析 large language model
14 Agent-as-a-Judge 提出Agent-as-a-Judge框架,提升复杂AI评估的可靠性与可验证性 large language model
15 Belief in Authority: Impact of Authority in Multi-Agent Evaluation Framework 首个多智能体评估框架研究:角色权威性偏见对智能体交互的影响分析 large language model
16 NC2C: Automated Convexification of Generic Non-Convex Optimization Problems NC2C:利用LLM自动凸化通用非凸优化问题,提升求解效率。 large language model
17 PILOT-Bench: A Benchmark for Legal Reasoning in the Patent Domain with IRAC-Aligned Classification Tasks 提出PILOT-Bench:一个专利领域法律推理的IRAC对齐分类基准 large language model
18 RiskAtlas: Exposing Domain-Specific Risks in LLMs through Knowledge-Graph-Guided Harmful Prompt Generation RiskAtlas:通过知识图谱引导的有害提示生成,揭示LLM在特定领域的风险 large language model
19 DSC2025 -- ViHallu Challenge: Detecting Hallucination in Vietnamese LLMs DSC2025 ViHallu Challenge:首个越南语LLM幻觉检测大规模共享任务。 large language model
20 ToolGate: Contract-Grounded and Verified Tool Execution for LLMs ToolGate:面向LLM工具执行的、基于合约验证的安全框架 large language model
21 Semantically Orthogonal Framework for Citation Classification: Disentangling Intent and Content 提出SOFT框架,解耦引用意图与内容类型,提升引文分类效果 large language model
22 Multi-Disciplinary Dataset Discovery from Citation-Verified Literature Contexts 提出一种基于引文语境的多学科数据集发现框架,提升数据集检索召回率。 large language model
23 GenProve: Learning to Generate Text with Fine-Grained Provenance GenProve:提出一种生成文本并提供细粒度来源信息的框架,解决LLM幻觉问题。 large language model
24 Faithful Summarisation under Disagreement via Belief-Level Aggregation 提出基于信念层聚合的框架,解决意见型摘要中现有方法忽略观点冲突的问题。 large language model
25 Mind2Report: A Cognitive Deep Research Agent for Expert-Level Commercial Report Synthesis 提出Mind2Report,模拟商业分析师,合成专家级商业报告 large language model
26 Fame Fades, Nature Remains: Disentangling the Character Identity of Role-Playing Agents 提出角色身份解耦框架,区分参数化和属性化身份,提升角色扮演Agent的真实性。 large language model
27 PRISM: A Unified Framework for Post-Training LLMs Without Verifiable Rewards PRISM:一种无需可验证奖励的LLM后训练统一框架 large language model
28 Thunder-KoNUBench: A Corpus-Aligned Benchmark for Korean Negation Understanding 提出Thunder-KoNUBench以解决韩语否定理解问题 large language model
29 LinguaGame: A Linguistically Grounded Game-Theoretic Paradigm for Multi-Agent Dialogue Generation LinguaGame:一种基于语言学和博弈论的多智能体对话生成范式 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (9 篇)

#题目一句话要点标签🔗
30 Hán Dān Xué Bù (Mimicry) or Qīng Chū Yú Lán (Mastery)? A Cognitive Perspective on Reasoning Distillation in Large Language Models 推理蒸馏无法有效传递大语言模型的认知结构,导致功能对齐崩溃 reinforcement learning distillation large language model
31 Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking Qwen3-VL-Embedding和Qwen3-VL-Reranker:用于多模态检索和排序的统一框架 representation learning distillation foundation model
32 SemPA: Improving Sentence Embeddings of Large Language Models through Semantic Preference Alignment SemPA:通过语义偏好对齐提升大语言模型的句子嵌入表示 DPO direct preference optimization contrastive learning
33 Compositional Steering of Large Language Models with Steering Tokens 提出基于Steering Tokens的组合式大语言模型控制方法,实现多重行为的精准引导。 distillation large language model
34 Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization 提出RL-Text2Vis,利用多目标强化学习框架提升文本到可视化的语义对齐和质量。 reinforcement learning multimodal
35 GRACE: Reinforcement Learning for Grounded Response and Abstention under Contextual Evidence GRACE:基于上下文证据,用于可信回复与拒绝的强化学习框架 reinforcement learning large language model
36 AM$^3$Safety: Towards Data Efficient Alignment of Multi-modal Multi-turn Safety for MLLMs AM$^3$Safety:面向多模态大语言模型,提升多轮对话安全性的数据高效对齐框架 reinforcement learning RLHF large language model
37 Text as a Universal Interface for Transferable Personalization 提出AlignXplore+,利用文本作为通用接口实现可迁移的个性化大语言模型。 reinforcement learning large language model
38 RAAR: Retrieval Augmented Agentic Reasoning for Cross-Domain Misinformation Detection 提出RAAR框架,通过检索增强的Agent协同推理解决跨领域虚假信息检测难题。 reinforcement learning large language model

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
39 Can AI-Generated Persuasion Be Detected? Persuaficial Benchmark and AI vs. Human Linguistic Differences 提出Persuaficial基准以检测AI生成的说服文本 manipulation large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页