cs.CL（2025-02-10）

📊 共 45 篇论文 | 🔗 9 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (36 🔗7) 支柱二：RL算法与架构 (RL & Architecture) (9 🔗2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (36 篇)

#	题目	一句话要点	标签	🔗
1	From No to Know: Taxonomy, Challenges, and Opportunities for Negation Understanding in Multimodal Foundation Models	提出多模态否定理解分类法，应对多模态大模型在否定语义理解上的挑战。	foundation model multimodal
2	Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation	提出ProverGen框架，结合LLM与符号证明器生成高质量一阶逻辑推理数据集ProverQA。	large language model chain-of-thought	✅
3	Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation	提出结构化神经元封装，提升大语言模型信息聚合与逻辑推理能力	large language model
4	Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models	提出多轮评估方法，用于衡量大型语言模型中拟人化行为的程度。	large language model
5	Specializing Large Language Models to Simulate Survey Response Distributions for Global Populations	提出一种基于微调LLM的方法，用于模拟全球人口的调查响应分布。	large language model
6	Demystifying Singular Defects in Large Language Models	揭示大语言模型奇异缺陷：基于奇异向量分析高范数Token现象	large language model	✅
7	Boosting Self-Efficacy and Performance of Large Language Models via Verbal Efficacy Stimulations	通过语言效能刺激提升大型语言模型的自我效能与表现	large language model
8	Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training	Hephaestus：通过持续预训练提升大语言模型智能体的基础能力	large language model
9	A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks	综述大型语言模型中的心理理论：评估、表征与安全风险	large language model
10	Systematic Outliers in Large Language Models	深入分析LLM中的系统性异常值，揭示其成因、功能及对模型的影响	large language model	✅
11	Latent Convergence Modulation in Large Language Models: A Novel Approach to Iterative Contextual Realignment	提出潜在收敛调制方法，提升大型语言模型长文本生成中的上下文一致性。	large language model
12	DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models	提出 DebateBench：一个用于评估大型语言模型长文本推理能力的挑战性基准	large language model
13	GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing	提出GuideLLM，探索LLM引导的对话在自传访谈中的应用	large language model instruction following
14	Non-literal Understanding of Number Words by Language Models	通过链式思考提示，提升大语言模型对数字词汇的非字面理解能力	large language model chain-of-thought
15	ConMeC: A Dataset for Metonymy Resolution with Common Nouns	ConMeC：一个用于普通名词转喻消解的数据集	large language model chain-of-thought	✅
16	Cardiverse: Harnessing LLMs for Novel Card Game Prototyping	Cardiverse：利用大型语言模型进行创新卡牌游戏原型设计	large language model	✅
17	Tokenization Standards for Linguistic Integrity: Turkish as a Benchmark	提出一种新框架以评估土耳其语的分词策略	large language model
18	AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements	提出AIMS.au数据集，用于分析企业声明中现代奴隶制应对措施	large language model
19	Finding Words Associated with DIF: Predicting Differential Item Functioning using LLMs and Explainable AI	利用LLM和可解释AI预测DIF，发现与DIF相关的词汇以提升评估公平性	large language model
20	Investigating the Zone of Proximal Development of Language Models for In-Context Learning	利用近端发展区理论分析LLM的上下文学习能力，提升推理和微调效果	large language model
21	Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling	提出计算最优的测试时缩放策略，使小模型在复杂任务上超越大模型	large language model
22	In-Context Learning (and Unlearning) of Length Biases	研究表明大语言模型能通过上下文学习长度偏差，并可用于消除模型自身编码的长度偏差。	large language model
23	Transparent NLP: Using RAG and LLM Alignment for Privacy Q&A	提出MultiRAIN对齐RAG系统，提升隐私问答中LLM的透明性和合规性	large language model
24	Do we really have to filter out random noise in pre-training data for language models?	研究表明预训练数据中的随机噪声对语言模型影响有限，并提出局部梯度匹配损失。	multimodal
25	LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM	提出知识引导的数据生成框架KgDG，提升开源法律LLM的推理能力	large language model	✅
26	Adaptive Prompting: Ad-hoc Prompt Composition for Social Bias Detection	提出自适应Prompt组合方法，用于提升社交偏见检测任务的性能。	large language model
27	KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment	KARMA：利用多智能体LLM自动丰富知识图谱	large language model
28	Can AI Examine Novelty of Patents?: Novelty Evaluation Based on the Correspondence between Patent Claim and Prior Art	提出基于LLM的专利新颖性评估方法，并构建首个相关数据集。	large language model
29	SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia	提出SeaExam和SeaBench，用于评估LLM在东南亚本地多语言场景下的能力。	large language model
30	Krutrim LLM: Multilingual Foundational Model for over a Billion People	Krutrim LLM：为十亿人口设计的印度语多语言基础模型	foundation model
31	Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE	Jakiro：利用MoE解耦多头注意力机制，加速推测解码并提升精度。	large language model	✅
32	Emergent Response Planning in LLMs	揭示LLM涌现的响应规划能力：隐藏层编码未来输出属性	large language model
33	Is LLM an Overconfident Judge? Unveiling the Capabilities of LLMs in Detecting Offensive Language with Annotation Disagreement	揭示LLM在处理标注不一致的冒犯性语言检测中的能力与过度自信问题	large language model
34	Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy	探索LLM零样本学习在公共健康文本标注中的应用，提升效率并评估标注准确性。	large language model
35	LegalViz: Legal Text Visualization by Text To Diagram Generation	提出LegalViz数据集，用于法律文本到易理解图表的生成，提升法律知识可访问性。	large language model
36	LCIRC: A Recurrent Compression Approach for Efficient Long-form Context and Query Dependent Modeling in LLMs	提出LCIRC，通过循环压缩和查询依赖建模高效处理LLM中的长文本上下文。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (9 篇)

#	题目	一句话要点	标签	🔗
37	RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning	RALLRec：融合表征学习的检索增强大语言模型推荐系统	representation learning large language model	✅
38	IRepair: An Intent-Aware Approach to Repair Data-Driven Errors in Large Language Models	提出IRepair以解决大语言模型中的数据驱动错误问题	direct preference optimization large language model
39	K-ON: Stacking Knowledge On the Head Layer of Large Language Model	K-ON：通过在大型语言模型的头部堆叠知识来解决KG与自然语言的粒度不匹配问题	representation learning large language model
40	Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning	OREAL：探索基于结果奖励的强化学习在数学推理中的极限	reinforcement learning behavior cloning distillation	✅
41	Rationalization Models for Text-to-SQL	提出基于CoT的文本到SQL生成框架，提升复杂查询的执行精度和可解释性。	distillation large language model chain-of-thought
42	Who Taught You That? Tracing Teachers in Model Distillation	提出一种基于词汇特征的教师模型溯源方法，用于识别学生模型的知识来源。	distillation instruction following
43	Optimizing Knowledge Integration in Retrieval-Augmented Generation with Self-Selection	提出Self-Selection RAG框架，通过自选择机制优化检索增强生成中的知识融合。	DPO direct preference optimization large language model
44	Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning	通过强化关键token探索，忽略KL惩罚提升RL微调效果	reinforcement learning large language model
45	C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation	提出C-3PO框架，通过轻量级代理优化实现类人检索增强生成	reinforcement learning large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2025-02-10）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (36 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (9 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理