cs.LG(2025-05-22)

📊 共 4 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (3) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
1 Interactive Post-Training for Vision-Language-Action Models 提出RIPT-VLA以解决VLA模型适应性不足问题 reinforcement learning vision-language-action VLA
2 Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies 提出OGSRL以解决医疗强化学习中的OOD问题 reinforcement learning offline RL offline reinforcement learning
3 Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only 提出PORL方法以解决在线强化学习微调中对Q函数的依赖问题 reinforcement learning offline RL imitation learning

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
4 Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach 提出滑模控制方法以解决联邦学习中的数据投毒攻击问题 manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页