cs.AI(2025-01-31)
📊 共 21 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (14)
支柱二:RL算法与架构 (RL & Architecture) (6)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (14 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Jackpot! Alignment as a Maximal Lottery | 提出基于最大彩票的对齐方法,提升LLM在人类反馈学习中的鲁棒性 | reinforcement learning RLHF large language model | ||
| 16 | In Pursuit of Predictive Models of Human Preferences Toward AI Teammates | 探究人类对AI队友偏好的预测模型,用于Hanabi合作博弈 | reinforcement learning predictive model | ||
| 17 | An Empirical Game-Theoretic Analysis of Autonomous Cyber-Defence Agents | 提出基于潜在奖励塑造和多响应预言机的深度强化学习网络攻防博弈分析框架 | reinforcement learning deep reinforcement learning DRL | ||
| 18 | Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning | 提出基于客观指标的XRL评估方法,用于调试智能体行为和支持人机协作。 | reinforcement learning | ||
| 19 | SHARPIE: A Modular Framework for Reinforcement Learning and Human-AI Interaction Experiments | SHARPIE:用于人机交互强化学习实验的模块化通用框架 | reinforcement learning | ||
| 20 | Enabling Autonomic Microservice Management through Self-Learning Agents | 提出ServiceOdyssey,通过自学习Agent实现微服务自治管理 | curriculum learning large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 21 | Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning | 提出Safety Chain-of-Thought,增强LLM防御对抗性攻击的能力 | manipulation large language model chain-of-thought |