cs.CL（2025-07-16）

📊 共 28 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (23 🔗5) 支柱二：RL算法与架构 (RL & Architecture) (3) 支柱一：机器人控制 (Robot Control) (2)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (23 篇)

#	题目	一句话要点	标签	🔗
1	Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models	提出 Marco-Bench-MIF，用于评估大语言模型的多语言指令遵循能力。	large language model instruction following	✅
2	Improving Drug Identification in Overdose Death Surveillance using Large Language Models	利用大型语言模型改进药物过量死亡监测中的药物识别	large language model
3	Improving Contextual ASR via Multi-grained Fusion with Large Language Models	提出一种多粒度融合的上下文ASR方法，利用大型语言模型提升关键词识别。	large language model
4	A Comparative Approach to Assessing Linguistic Creativity of Large Language Models and Humans	提出一种评估大型语言模型和人类语言创造力的通用测试方法	large language model
5	Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness	基于价值的大语言模型智能体模拟，用于互评估信任和人际亲密度	large language model
6	Graph Representations for Reading Comprehension Analysis using Large Language Model and Eye-Tracking Biomarker	利用大语言模型和眼动追踪生物标记，提出基于图表示的阅读理解分析方法。	large language model
7	Tracing Facts or just Copies? A critical investigation of the Competitions of Mechanisms in Large Language Models	探究大语言模型中机制竞争：事实追踪还是简单复制？	large language model
8	Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data	提出一种高级RAG框架，用于处理结构化企业内部数据，提升问答性能。	large language model multimodal	✅
9	DyG-RAG: Dynamic Graph Retrieval-Augmented Generation with Event-Centric Reasoning	DyG-RAG：提出事件中心动态图检索增强生成框架，解决时序推理难题。	large language model chain-of-thought	✅
10	Findings of MEGA: Maths Explanation with LLMs using the Socratic Method for Active Learning	MEGA：结合苏格拉底教学法和LLM的数学解释方法，提升学生学习效果	large language model chain-of-thought
11	PARAM-1 BharatGen 2.9B Model	PARAM-1：一个以印度语言多样性为核心的29亿参数语言模型	large language model foundation model
12	A Survey of Deep Learning for Geometry Problem Solving	深度学习赋能几何问题求解：综述与前瞻	large language model multimodal	✅
13	Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models	提出基于CoT激活的线性探针，用于提前预测推理模型对齐状态	large language model
14	Probing for Arithmetic Errors in Language Models	利用语言模型内部激活探测算术错误并指导模型自纠错	chain-of-thought
15	Developing Visual Augmented Q&A System using Scalable Vision Embedding Retrieval & Late Interaction Re-ranker	提出一种可扩展的视觉增强问答系统，利用可扩展的视觉嵌入检索和后期交互重排序器。	multimodal
16	Beyond Single Models: Enhancing LLM Detection of Ambiguity in Requests through Debate	提出多代理辩论框架以增强LLM对请求歧义的检测能力	large language model
17	Chain-of-Descriptions: Improving Code LLMs for VHDL Code Generation and Summarization	提出Chain-of-Descriptions方法，提升代码大模型在VHDL代码生成与摘要任务上的性能	large language model
18	Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding	Text-ADBench：基于LLM嵌入的文本异常检测基准，揭示嵌入质量是关键。	large language model	✅
19	Identifying Algorithmic and Domain-Specific Bias in Parliamentary Debate Summarisation	提出多阶段总结框架，评估LLM在议会辩论总结中的算法和领域偏差。	large language model
20	Iterative Augmentation with Summarization Refinement (IASR) Evaluation for Unstructured Survey data Modeling and Analysis	提出IASR框架，用于评估和优化LLM在非结构化调查数据建模中的增广效果。	large language model
21	TopicImpact: Improving Customer Feedback Analysis with Opinion Units for Topic Modeling and Star-Rating Prediction	TopicImpact：利用观点单元改进客户反馈分析，提升主题建模和星级预测	large language model
22	Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation	提出一种毒性感知的少样本提示框架，用于低资源Singlish翻译，提升毒性内容翻译质量。	large language model
23	BlockBPE: Parallel BPE Tokenization	提出BlockBPE以解决GPU批量推理中的BPE瓶颈问题	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (3 篇)

#	题目	一句话要点	标签
24	Modeling Open-World Cognition as On-Demand Synthesis of Probabilistic Models	提出模型合成架构MSA，模拟开放世界认知中概率模型的按需合成。	world model chain-of-thought
25	Simplifications are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated Definitions	简化语言降低LLM生成定义中词义辨析能力，DPO微调可显著改善	direct preference optimization large language model
26	DualReward: A Dynamic Reinforcement Learning Framework for Cloze Tests Distractor Generation	DualReward：一种用于完形填空题干扰项生成的动态强化学习框架	reinforcement learning

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
27	Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited	评估大语言模型在基数方向推理能力，发现现有模型仍存在不足	locomotion large language model
28	PoTPTQ: A Two-step Power-of-Two Post-training for LLMs	PoTPTQ：一种用于LLM的二步幂次量化后训练方法	manipulation large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2025-07-16）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (23 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (3 篇)

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理