cs.AI(2026-06-03)

📊 共 3 篇论文

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (2) 支柱二:RL算法与架构 (RL & Architecture) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
1 R-APS: Compositional Reasoning and In-Context Meta-Learning for Constrained Design via Reflective Adversarial Pareto Search 提出R-APS以解决长时间规划中的推理失败问题 large language model
2 Inference-Time Vulnerability Beyond Shallow Safety: Alignment Along Generation Trajectories 提出生成轨迹对齐方法以增强LLM的安全性 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)

#题目一句话要点标签🔗
3 BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization 提出BiasGRPO以解决高方差奖励环境中的偏见缓解问题 PPO RLHF DPO

⬅️ 返回 cs.AI 首页 · 🏠 返回主页