cs.CL(2025-03-27)

📊 共 33 篇论文 | 🔗 9 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (31 🔗9) 支柱二:RL算法与架构 (RL & Architecture) (2)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (31 篇)

#题目一句话要点标签🔗
1 Keyword-Oriented Multimodal Modeling for Euphemism Identification 提出关键词导向的多模态隐晦表达识别方法,解决社交媒体内容审核难题。 large language model multimodal
2 Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection 提出NofT指标,用于任务路由和对抗性提示检测,提升LLM效率与安全性。 large language model chain-of-thought
3 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond 综述高效推理:针对大型推理模型中语言、多模态及其他方面的推理效率提升方法。 multimodal chain-of-thought
4 UGen: Unified Autoregressive Multimodal Model with Progressive Vocabulary Learning UGen:一种基于渐进式词汇学习的统一自回归多模态模型 multimodal
5 Boosting Large Language Models with Mask Fine-Tuning 提出Mask Fine-Tuning,通过掩码微调显著提升大语言模型性能 large language model
6 Local Normalization Distortion and the Thermodynamic Formalism of Decoding Strategies for Large Language Models 通过热力学形式主义分析解码策略,揭示大语言模型局部归一化失真问题 large language model
7 SWI: Speaking with Intent in Large Language Models 提出SWI:通过显式意图提升大语言模型的推理与生成能力 large language model
8 AutoPsyC: Automatic Recognition of Psychodynamic Conflicts from Semi-structured Interviews with Large Language Models AutoPsyC:利用大语言模型自动识别半结构化访谈中的心理动力冲突 large language model
9 Navigating the Risks of Using Large Language Models for Text Annotation in Social Science Research 提出LLM文本标注框架,评估其在社会科学研究中的风险与潜力 large language model
10 JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models' Detection of Human Self-Destructive Behavior Content in Jirai Community JiraiBench:一个双语基准,用于评估大型语言模型对Jirai社区中人类自毁行为内容的检测能力 large language model
11 Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach 提出一种跨模型和语义一致性方法,评估大语言模型基于内部知识生成书籍摘要的能力。 large language model
12 OpenHuEval: Evaluating Large Language Model on Hungarian Specifics 提出OpenHuEval,首个面向匈牙利语及特定文化的LLM评测基准。 large language model
13 OmniVox: Zero-Shot Emotion Recognition with Omni-LLMs OmniVox:利用全模态大语言模型实现零样本情感识别 large language model multimodal chain-of-thought
14 Large Language Model Agent: A Survey on Methodology, Applications and Challenges 对大型语言模型智能体的方法、应用与挑战进行综述 large language model
15 Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models 提出OlymMATH奥赛级数学基准,挑战大语言模型复杂推理能力 large language model
16 From User Preferences to Optimization Constraints Using Large Language Models 利用大型语言模型将用户偏好转化为家庭能源优化约束 large language model
17 Leveraging Large Language Models for Risk Assessment in Hyperconnected Logistic Hub Network Deployment 利用大型语言模型进行超互联物流枢纽网络部署中的风险评估 large language model
18 ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models ThinkEdit:通过可解释的权重编辑缓解推理模型中的过度短推理问题 large language model chain-of-thought
19 LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models 提出LLaVA-CMoE,解决LLM在视觉-语言持续学习中的灾难性遗忘和参数效率问题。 large language model multimodal
20 ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging ZJUKLAB提出基于模型融合的LLM敏感内容遗忘方法,在SemEval-2025 Task 4中排名第二。 large language model
21 Effective Skill Unlearning through Intervention and Abstention 提出基于干预和抑制的LLM技能遗忘方法,无需训练且高效。 large language model
22 How do language models learn facts? Dynamics, curricula and hallucinations 研究语言模型学习事实的动态过程,揭示知识获取的阶段性、数据分布影响及幻觉现象。 large language model
23 Shared Global and Local Geometry of Language Model Embeddings 揭示大语言模型嵌入的全局和局部几何相似性,并提出跨模型迁移方法。 large language model
24 Cognitive Prompts Using Guilford's Structure of Intellect Model 利用吉尔福特智力结构模型,提出认知提示工程以提升LLM推理能力 large language model
25 RedditESS: A Mental Health Social Support Interaction Dataset -- Understanding Effective Social Support to Refine AI-Driven Support Tools 提出RedditESS数据集,用于提升AI心理健康支持工具的有效性 large language model
26 ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition 提出ResearchBench,用于评估LLM在科学发现中基于灵感的任务分解能力。 large language model
27 MSPLoRA: A Multi-Scale Pyramid Low-Rank Adaptation for Efficient Model Fine-Tuning MSPLoRA:多尺度金字塔低秩适配,提升模型微调效率 large language model
28 Debate-Driven Multi-Agent LLMs for Phishing Email Detection 提出基于辩论驱动的多Agent LLM钓鱼邮件检测方法 large language model
29 Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad 评估LLM在2025年美国数学奥林匹克竞赛中的解题能力:证明还是虚张声势? large language model
30 R-PRM: Reasoning-Driven Process Reward Modeling 提出R-PRM:一种推理驱动的过程奖励建模方法,提升数学推理的准确性和效率。 large language model
31 EmoDebt: Bayesian-Optimized Emotional Intelligence for Strategic Agent-to-Agent Debt Recovery EmoDebt:基于贝叶斯优化的情感智能,用于智能体间债务催收 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
32 Controlling Large Language Model with Latent Actions CoLA:通过学习紧凑潜在动作空间,提升大型语言模型在强化学习中的可控性和探索能力 reinforcement learning large language model
33 Collab: Controlled Decoding using Mixture of Agents for LLM Alignment Collab:一种基于混合Agent的受控解码方法,用于LLM对齐。 reinforcement learning RLHF large language model

⬅️ 返回 cs.CL 首页 · 🏠 返回主页