cs.AI(2026-04-21)
📊 共 20 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (11 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (7)
支柱一:机器人控制 (Robot Control) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (7 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | DT2IT-MRM: Debiased Preference Construction and Iterative Training for Multimodal Reward Modeling | DT2IT-MRM:通过去偏好构建与迭代训练提升多模态奖励模型性能 | RLHF large language model multimodal | ||
| 13 | OLLM: Options-based Large Language Models | OLLM:基于选项的大语言模型,提升数学推理任务的可控性和效率。 | reinforcement learning policy learning large language model | ||
| 14 | Cyber Defense Benchmark: Agentic Threat Hunting Evaluation for LLMs in SecOps | 提出网络安全防御基准,评估LLM在SecOps中威胁狩猎任务的表现 | reinforcement learning large language model TAMP | ||
| 15 | Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic | 提出SAri-RFT,增强LVLM在视觉语义算术任务中的推理能力,应用于机器人领域。 | reinforcement learning large language model | ||
| 16 | Reasoning-Aware AIGC Detection via Alignment and Reinforcement | 提出REVEAL框架,通过对齐和强化推理能力提升AIGC文本检测性能 | reinforcement learning large language model | ||
| 17 | Reinforcement Learning Improves LLM Accuracy and Reasoning in Disease Classification from Radiology Reports | 利用强化学习提升LLM在放射报告疾病分类中的准确性和推理能力 | reinforcement learning | ||
| 18 | Reasoning Structure Matters for Safety Alignment of Reasoning Models | AltTrain:通过改变推理结构实现推理模型安全对齐 | reinforcement learning reward design |
🔬 支柱一:机器人控制 (Robot Control) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 19 | Detecting Data Contamination in Large Language Models | 评估黑盒成员推理攻击在大型语言模型数据污染检测中的可靠性 | manipulation large language model | ||
| 20 | Large Language Models Exhibit Normative Conformity | 揭示大语言模型中的规范性顺从,为LLM多智能体系统决策提供安全保障。 | manipulation large language model |