cs.AI(2026-02-23)
📊 共 17 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (6)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking | IR$^3$:通过对比逆强化学习实现奖励篡改的可解释检测与缓解 | reinforcement learning inverse reinforcement learning RLHF | ||
| 12 | Human-Guided Agentic AI for Multimodal Clinical Prediction: Lessons from the AgentDS Healthcare Benchmark | 提出人机协作的Agentic AI,用于多模态临床预测,并在AgentDS医疗基准上验证有效性。 | MAE multimodal | ||
| 13 | Tri-Subspaces Disentanglement for Multimodal Sentiment Analysis | 提出Tri-Subspace Disentanglement框架,解决多模态情感分析中模态间信息融合不充分的问题。 | MAE multimodal | ||
| 14 | Ada-RS: Adaptive Rejection Sampling for Selective Thinking | Ada-RS:自适应拒绝采样提升工具型LLM选择性推理效率 | DPO large language model chain-of-thought | ||
| 15 | Beyond Mimicry: Toward Lifelong Adaptability in Imitation Learning | 面向终身适应性的模仿学习:超越单纯模仿,实现组合泛化 | imitation learning | ||
| 16 | Meta-Learning and Meta-Reinforcement Learning - Tracing the Path towards DeepMind's Adaptive Agent | 综述元学习与元强化学习,追溯DeepMind自适应Agent的技术演进路径 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | Agentic AI as a Cybersecurity Attack Surface: Threats, Exploits, and Defenses in Runtime Supply Chains | 针对Agentic AI运行时供应链的攻击面研究:威胁、利用与防御 | manipulation large language model |