cs.AI(2026-06-03)
📊 共 3 篇论文
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | R-APS: Compositional Reasoning and In-Context Meta-Learning for Constrained Design via Reflective Adversarial Pareto Search | 提出R-APS以解决长时间规划中的推理失败问题 | large language model | ||
| 2 | Inference-Time Vulnerability Beyond Shallow Safety: Alignment Along Generation Trajectories | 提出生成轨迹对齐方法以增强LLM的安全性 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization | 提出BiasGRPO以解决高方差奖励环境中的偏见缓解问题 | PPO RLHF DPO |