cs.CL（2025-07-09）

📊 共 33 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (25 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (5 🔗1) 支柱八：物理动画 (Physics-based Animation) (1) 支柱一：机器人控制 (Robot Control) (1) 支柱六：视频提取与匹配 (Video Extraction) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (25 篇)

#	题目	一句话要点	标签	🔗
1	Prompt Perturbations Reveal Human-Like Biases in Large Language Model Survey Responses	提示扰动揭示大型语言模型在调查问卷中类人的偏差	large language model
2	Elite Polarization in European Parliamentary Speeches: a Novel Measurement Approach Using Large Language Models	利用大型语言模型进行政治人物情感分析，提出一种测量精英极化的新方法	large language model
3	Expediting data extraction using a large language model (LLM) and scoping review protocol: a methodological study within a complex scoping review	利用大型语言模型和范围界定审查协议加速数据提取	large language model
4	Enhancing Food-Domain Question Answering with a Multimodal Knowledge Graph: Hybrid QA Generation and Diversity Analysis	提出融合多模态知识图谱的食物领域问答框架，提升生成质量与多样性	multimodal
5	Large Language Model for Extracting Complex Contract Information in Industrial Scenes	提出一种基于大语言模型的工业场景复杂合同信息抽取方法	large language model
6	Integrating External Tools with Large Language Models to Improve Accuracy	提出Athena框架，集成外部工具显著提升LLM在教育场景下的问题解答准确率	large language model
7	InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes under Herd Behavior	InvestAlign：解决羊群效应下LLM在投资者决策对齐中的数据稀缺问题	large language model	✅
8	ixi-GEN: Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining	ixi-GEN：通过领域自适应持续预训练提升工业界小规模LLM的效率	large language model foundation model
9	CRISP: Complex Reasoning with Interpretable Step-based Plans	CRISP：通过可解释的步骤计划进行复杂推理，提升数学推理和代码生成能力	large language model chain-of-thought
10	Frontier LLMs Still Struggle with Simple Reasoning Tasks	前沿大语言模型在简单推理任务上仍面临挑战	large language model
11	Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation	提出多智能体检索增强框架，用于生成针对健康虚假信息的循证反驳言论	large language model
12	SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains	SynthTextEval：面向高风险领域的合成文本生成与评估工具包	large language model
13	Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs	研究揭示：LLM认知偏差主要源于预训练，微调影响有限	large language model
14	Exploring LLMs for Predicting Tutor Strategy and Student Outcomes in Dialogues	探索LLM在对话中预测导师策略和学生表现的能力	large language model
15	MultiJustice: A Chinese Dataset for Multi-Party, Multi-Charge Legal Prediction	提出MultiJustice数据集，用于评估LLM在多被告、多罪名法律预测中的性能	large language model	✅
16	Developing and Maintaining an Open-Source Repository of AI Evaluations: Challenges and Insights	提出开放源代码AI评估库管理框架以应对评估挑战	large language model
17	Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework	提出基于语义熵引导的自适应终止框架，提升多轮并行推理效率。	large language model
18	RAG Safety: Exploring Knowledge Poisoning Attacks to Retrieval-Augmented Generation	针对知识图谱增强的RAG系统，提出一种隐蔽的知识投毒攻击方法。	large language model
19	Text to model via SysML: Automated generation of dynamical system computational models from unstructured natural language text via enhanced System Modeling Language diagrams	提出一种基于SysML的文本到模型自动生成方法，加速工程动力系统设计与部署。	large language model
20	AblationBench: Evaluating Automated Planning of Ablations in Empirical AI Research	AblationBench：用于评估AI辅助消融实验规划的基准测试套件	chain-of-thought
21	Checklist Engineering Empowers Multilingual LLM Judges	提出基于清单工程的CE-Judge框架，赋能多语言LLM评估任务。	large language model
22	On the Effect of Uncertainty on Layer-wise Inference Dynamics	研究表明LLM的不确定性预测对层间推理动态影响较小，但模型能力可能改变这一现象。	large language model
23	A Mathematical Theory of Discursive Networks	构建话语网络数学模型，通过互审机制提升大型语言模型的信息可靠性	large language model
24	SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers	SpindleKV：一种平衡浅层和深层的新型KV缓存缩减方法	large language model
25	On the Robustness of Verbal Confidence of LLMs in Adversarial Attacks	研究表明，针对LLM语言置信度的对抗攻击能显著降低其可靠性	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (5 篇)

#	题目	一句话要点	标签	🔗
26	Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings	提出结合对比学习与语音-语义嵌入的多智能体翻译框架，用于解决双关语跨语言翻译难题。	contrastive learning HuMoR large language model
27	SCoRE: Streamlined Corpus-based Relation Extraction using Multi-Label Contrastive Learning and Bayesian kNN	SCoRE：利用多标签对比学习和贝叶斯kNN的精简型语料库关系抽取	contrastive learning large language model
28	Rethinking Verification for LLM Code Generation: From Generation to Testing	提出SAGA框架，提升LLM代码生成测试用例的覆盖率和质量，改进代码评估。	reinforcement learning large language model
29	FuDoBa: Fusing Document and Knowledge Graph-based Representations with Bayesian Optimisation	FuDoBa：融合文档与知识图谱表征，通过贝叶斯优化提升领域文档分类。	representation learning large language model
30	Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation	提出SambaY：一种高效的Decoder-Hybrid-Decoder架构，用于长文本生成推理。	reinforcement learning SSM state space model	✅

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
31	UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations	UniConv统一检索与生成，提升大型语言模型在对话搜索中的性能	UniCon large language model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
32	VisualTrap: A Stealthy Backdoor Attack on GUI Agents via Visual Grounding Manipulation	VisualTrap：一种针对GUI智能体的隐蔽后门攻击，通过视觉定位操纵实现	manipulation visual grounding

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
33	A Language-Driven Framework for Improving Personalized Recommendations: Merging LLMs with Traditional Algorithms	提出融合LLM的个性化推荐框架，提升传统算法对文本偏好的理解	HuMoR large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2025-07-09）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (25 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (5 篇)

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理