cs.CL(2025-05-12)

📊 共 32 篇论文 | 🔗 8 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (23 🔗3) 支柱二:RL算法与架构 (RL & Architecture) (8 🔗5) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (23 篇)

#题目一句话要点标签🔗
1 Learning Dynamics in Continual Pre-Training for Large Language Models 提出CPT缩放法则以优化大语言模型的持续预训练 large language model foundation model
2 Reassessing Large Language Model Boolean Query Generation for Systematic Reviews 系统评审中提出改进的LLM布尔查询生成方法 large language model chain-of-thought
3 EmoMeta: A Multimodal Dataset for Fine-grained Emotion Classification in Chinese Metaphors 提出EmoMeta数据集以解决中文隐喻情感分类问题 multimodal
4 Large Language Models and Arabic Content: A Review 综述大型语言模型在阿拉伯语内容处理中的应用与挑战 large language model
5 Characterizing the Investigative Methods of Fictional Detectives with Large Language Models 提出AI驱动的方法系统化分析虚构侦探的调查手法 large language model
6 On the Superimposed Noise Accumulation Problem in Sequential Knowledge Editing of Large Language Models 提出DeltaEdit以解决大语言模型的噪声累积问题 large language model
7 SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models 提出SAS-Bench以解决短答案评分中的细粒度评估问题 large language model
8 ViMRHP: A Vietnamese Benchmark Dataset for Multimodal Review Helpfulness Prediction via Human-AI Collaborative Annotation 提出ViMRHP数据集以解决越南语多模态评论有用性预测问题 multimodal
9 One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models 提出D-STT以解决大型语言模型的安全性与可用性平衡问题 large language model
10 Spoken Language Understanding on Unseen Tasks With In-Context Learning 提出随机类标签的无任务特定微调方法以提升SLU性能 large language model
11 Re$^2$: A Consistency-ensured Dataset for Full-stage Peer Review and Multi-turn Rebuttal Discussions 提出Re^2数据集以解决同行评审和反驳讨论中的数据不足问题 large language model
12 OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit 提出OnPrem.LLM以解决敏感数据处理中的隐私问题 large language model
13 Semantic Retention and Extreme Compression in LLMs: Can We Have Both? 提出联合剪枝与量化以提升大语言模型压缩性能 large language model
14 Are LLMs complicated ethical dilemma analyzers? 提出伦理困境基准数据集以评估大型语言模型的伦理推理能力 large language model
15 FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning 提出FalseReject以解决大型语言模型的过度拒绝问题 large language model
16 Benchmarking Retrieval-Augmented Generation for Chemistry 提出ChemRAG-Bench以评估化学领域的检索增强生成方法 large language model
17 Concept-Level Explainability for Auditing & Steering LLM Responses 提出ConceptX以解决大语言模型响应的可解释性问题 large language model
18 ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution 提出ToolACE-DEV以解决工具学习中的自我提升问题 large language model
19 QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines 提出QUPID以提升韩国搜索引擎的相关性评估 large language model
20 Domain Regeneration: How well do LLMs match syntactic properties of text domains? 探讨大型语言模型在文本领域语法特性匹配的有效性 large language model
21 JobHop: A Large-Scale Dataset of Career Trajectories 提出JobHop数据集以解决职业轨迹分析问题 large language model
22 Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs 提出SENATOR框架以解决大语言模型知识缺陷问题 large language model
23 HAMLET: Healthcare-focused Adaptive Multilingual Learning Embedding-based Topic Modeling 提出HAMLET以解决医疗领域多语言主题建模问题 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (8 篇)

#题目一句话要点标签🔗
24 A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models 提出多维约束框架以提升大语言模型的指令遵循能力 reinforcement learning large language model instruction following
25 SEM: Reinforcement Learning for Search-Efficient Large Language Models 提出SEM框架以优化大语言模型的搜索效率 reinforcement learning large language model
26 DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation 提出DynamicRAG以解决RAG系统中文档重排序问题 reinforcement learning large language model
27 Assessing and Mitigating Medical Knowledge Drift and Conflicts in Large Language Models 提出DriftMedQA基准以解决医疗知识漂移问题 direct preference optimization large language model
28 KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification 提出KDH-MLTC以解决医疗多标签文本分类问题 distillation large language model
29 MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining 提出MiMo-7B以增强语言模型的推理能力 reinforcement learning large language model
30 Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent 提出IKEA以解决大语言模型检索能力不足问题 reinforcement learning large language model
31 On the Robustness of Reward Models for Language Model Alignment 提出批量归零正则化以解决奖励模型的过度优化问题 reinforcement learning RLHF

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
32 Must Read: A Systematic Survey of Computational Persuasion 系统性调查计算说服力以应对AI驱动的影响力挑战 manipulation

⬅️ 返回 cs.CL 首页 · 🏠 返回主页