cs.CL（2025-03-23）

📊 共 15 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (11 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (3) 支柱一：机器人控制 (Robot Control) (1 🔗1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (11 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering	提出Misleading ChartQA基准以解决误导性图表问答问题	large language model multimodal
2	MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection	提出MathAgent框架，用于真实场景下多模态数学错误检测	large language model multimodal
3	Mind with Eyes: from Language Reasoning to Multimodal Reasoning	综述多模态推理方法，从语言中心到协同推理，为类人认知能力提供借鉴。	multimodal
4	Investigating Recent Large Language Models for Vietnamese Machine Reading Comprehension	针对越南语机器阅读理解，论文提出使用QLoRA高效微调Llama 3和Gemma模型。	large language model	✅
5	An Empirical Study of the Role of Incompleteness and Ambiguity in Interactions with Large Language Models	研究不完整性和歧义性对大语言模型交互的影响，提出神经符号框架建模多轮问答。	large language model
6	STShield: Single-Token Sentinel for Real-Time Jailbreak Detection in Large Language Models	STShield：基于单Token哨兵机制的大语言模型实时越狱攻击检测	large language model
7	"Whose Side Are You On?" Estimating Ideology of Political and News Content Using Large Language Models and Few-shot Demonstration Selection	利用大语言模型和少样本演示选择，估计政治和新闻内容的意识形态倾向	large language model
8	ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices	提出ShED-HD框架，在边缘设备上高效检测大语言模型幻觉	large language model
9	LakotaBERT: A Transformer-based Model for Low Resource Lakota Language	LakotaBERT：为低资源Lakota语定制的Transformer模型	large language model
10	Instructing the Architecture Search for Spatial-temporal Sequence Forecasting with LLM	提出基于LLM指导的时空序列预测NAS方法，提升搜索效率与效果	large language model
11	WindowKV: Task-Adaptive Group-Wise KV Cache Window Selection for Efficient LLM Inference	提出WindowKV，一种任务自适应的分组KV缓存窗口选择方法，用于高效LLM推理。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
12	Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment	提出DR-IRL，通过动态奖励调整提升LLM安全对齐效果	reinforcement learning inverse reinforcement learning large language model
13	Understanding the Effects of RLHF on the Quality and Detectability of LLM-Generated Texts	研究表明RLHF虽提升LLM文本质量，但也使其更易被检测且产生冗长重复内容	reinforcement learning RLHF large language model
14	$D^2LoRA$: Data-Driven LoRA Initialization for Low Resource Tasks	提出D²LoRA，一种数据驱动的LoRA初始化方法，提升低资源任务下的微调效率。	DPO direct preference optimization large language model

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
15	GeoBenchX: Benchmarking LLMs in Agent Solving Multistep Geospatial Tasks	GeoBenchX：评估LLM在多步骤地理空间任务中Agent工具调用能力的基准	manipulation large language model	✅

⬅️ 返回 cs.CL 首页 · 🏠 返回主页