cs.CL（2025-08-19）

📊 共 34 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (28 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (5 🔗1) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (28 篇)

#	题目	一句话要点	标签	🔗
1	MME-SCI: A Comprehensive and Challenging Science Benchmark for Multimodal Large Language Models	提出MME-SCI以解决多模态大语言模型评估中的关键挑战	large language model multimodal	✅
2	CyPortQA: Benchmarking Multimodal Large Language Models for Cyclone Preparedness in Port Operation	提出CyPortQA以解决港口飓风应对中的多模态数据整合问题	large language model multimodal
3	Generics and Default Reasoning in Large Language Models	评估大型语言模型在默认推理中的表现与局限性	large language model chain-of-thought
4	Can Large Language Models (LLMs) Describe Pictures Like Children? A Comparative Corpus Study	比较大型语言模型与儿童语言描述的相似性	large language model multimodal
5	Mechanistic Exploration of Backdoored Large Language Model Attention Patterns	探讨后门攻击对大型语言模型注意力模式的影响	large language model
6	A Review of Developmental Interpretability in Large Language Models	综述大型语言模型的开发性可解释性研究进展	large language model
7	ViExam: Are Vision Language Models Better than Humans on Vietnamese Multimodal Exam Questions?	提出ViExam基准以评估视觉语言模型在越南多模态考试中的表现	multimodal
8	Ask Good Questions for Large Language Models	提出Ask-Good-Question框架以解决对话系统中的用户困惑问题	large language model
9	ALIGN: Word Association Learning for Cultural Alignment in Large Language Models	提出ALIGN方法以解决大型语言模型的文化偏见问题	large language model
10	The Promise of Large Language Models in Digital Health: Evidence from Sentiment Analysis in Online Health Communities	利用大型语言模型解决数字健康领域情感分析挑战	large language model
11	MATA (māta): Mindful Assessment of the Telugu Abilities of Large Language Models	提出MATA评估数据集以评估大型语言模型的泰卢固语能力	large language model
12	Scalable Scientific Interest Profiling Using Large Language Models	提出基于大语言模型的科学兴趣画像生成方法	large language model
13	Saudi-Dialect-ALLaM: LoRA Fine-Tuning for Dialectal Arabic Generation	提出LoRA微调方法以解决阿拉伯方言生成问题	large language model foundation model
14	Prompt-Based One-Shot Exact Length-Controlled Generation with LLMs	提出基于提示的一次性精确长度控制生成方法以解决LLMs文本生成问题	large language model instruction following
15	MultiFuzz: A Dense Retrieval-based Multi-Agent System for Network Protocol Fuzzing	提出MultiFuzz以解决传统协议模糊测试的有效性问题	large language model chain-of-thought
16	Sycophancy under Pressure: Evaluating and Mitigating Sycophantic Bias via Adversarial Dialogues in Scientific QA	提出Pressure-Tune以解决科学问答中的谄媚偏见问题	large language model chain-of-thought
17	Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency	提出Finch-Zk以解决大型语言模型的幻觉检测与缓解问题	large language model
18	GRILE: A Benchmark for Grammar Reasoning and Explanation in Romanian LLMs	提出GRILE基准以解决罗马尼亚LLMs的语法推理与解释问题	large language model
19	Two Birds with One Stone: Multi-Task Detection and Attribution of LLM-Generated Text	提出DA-MTL框架以解决LLM生成文本的检测与归属问题	large language model
20	Unintended Misalignment from Agentic Fine-Tuning: Risks and Mitigation	提出PING方法以解决大语言模型的安全性问题	large language model
21	DPad: Efficient Diffusion Language Models with Suffix Dropout	提出DPad以解决扩散语言模型的计算效率问题	large language model	✅
22	Prediction is not Explanation: Revisiting the Explanatory Capacity of Mapping Embeddings	挑战传统假设，揭示词嵌入的解释能力局限性	large language model
23	Alvorada-Bench: Can Language Models Solve Brazilian University Entrance Exams?	提出Alvorada-Bench以评估语言模型在巴西大学入学考试中的表现	chain-of-thought
24	Measuring LLM Code Generation Stability via Structural Entropy	通过结构熵评估大型语言模型代码生成的稳定性	large language model
25	Comparing energy consumption and accuracy in text classification inference	评估文本分类推理中的能耗与准确性权衡	large language model
26	ReviewGraph: A Knowledge Graph Embedding Based Framework for Review Rating Prediction with Sentiment Features	提出ReviewGraph框架以解决酒店客户评价评分预测问题	large language model	✅
27	MGT-Prism: Enhancing Domain Generalization for Machine-Generated Text Detection via Spectral Alignment	提出MGT-Prism以解决机器生成文本检测的领域泛化问题	large language model
28	CRISP: Persistent Concept Unlearning via Sparse Autoencoders	提出CRISP以解决大语言模型知识去除问题	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (5 篇)

#	题目	一句话要点	标签	🔗
29	Lexical Hints of Accuracy in LLM Reasoning Chains	提出词汇提示以提高大型语言模型推理链的准确性	reinforcement learning large language model chain-of-thought
30	ProMed: Shapley Information Gain Guided Reinforcement Learning for Proactive Medical LLMs	提出ProMed以解决医疗LLMs反应性不足问题	reinforcement learning large language model
31	Coarse-to-Fine Personalized LLM Impressions for Streamlined Radiology Reports	提出粗到细个性化LLM印象生成框架以解决放射科报告问题	reinforcement learning RLHF large language model
32	Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization	提出LongMab-PO以解决长上下文LLM偏好优化问题	DPO direct preference optimization large language model	✅
33	Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR	提出自我博弈与变分问题合成以提升RLVR性能	reinforcement learning large language model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
34	MMReview: A Multidisciplinary and Multimodal Benchmark for LLM-Based Peer Review Automation	提出MMReview以解决学术同行评审自动化的评估标准缺失问题	manipulation large language model multimodal

⬅️ 返回 cs.CL 首页 · 🏠 返回主页

cs.CL（2025-08-19）

🎯 兴趣领域导航

🔬 支柱九：具身大模型 (Embodied Foundation Models) (28 篇)

🔬 支柱二：RL算法与架构 (RL & Architecture) (5 篇)

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册