cs.AI(2026-04-15)
📊 共 15 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (8 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (6)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | Towards Scalable Lightweight GUI Agents via Multi-role Orchestration | LAMO:面向轻量级GUI代理的多角色协同框架,提升任务可扩展性 | reinforcement learning distillation large language model | ||
| 10 | Reward Design for Physical Reasoning in Vision-Language Models | 针对视觉语言模型物理推理,提出基于GRPO的奖励函数设计方法 | reward design | ||
| 11 | Hierarchical Reinforcement Learning with Runtime Safety Shielding for Power Grid Operation | 提出运行时安全屏蔽的分层强化学习方法,用于电力系统运行控制。 | reinforcement learning | ||
| 12 | RiskWebWorld: A Realistic Interactive Benchmark for GUI Agents in E-commerce Risk Management | RiskWebWorld:电商风控GUI智能体的真实交互基准 | reinforcement learning foundation model | ||
| 13 | Learning from Change: Predictive Models for Incident Prevention in a Regulated IT Environment | 提出一种基于LightGBM的可解释IT变更风险预测模型,用于金融等监管环境下的事件预防。 | predictive model | ||
| 14 | Towards Fine-grained Temporal Perception: Post-Training Large Audio-Language Models with Audio-Side Time Prompt | 提出音频侧时间提示以解决音频语言模型的时间感知问题 | reinforcement learning TAMP |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Secure and Privacy-Preserving Vertical Federated Learning | 提出一种安全且保护隐私的垂直联邦学习框架,适用于不同部署场景。 | MPC |