cs.LG(2025-05-22)

📊 共 4 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (3) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
1 Interactive Post-Training for Vision-Language-Action Models 提出RIPT-VLA,通过交互式后训练提升视觉-语言-动作模型的性能。 reinforcement learning vision-language-action VLA
2 Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies 提出OGSRL以解决医疗强化学习中的安全性与有效性问题 reinforcement learning offline RL offline reinforcement learning
3 Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only 提出PORL,仅用离线预训练策略高效微调在线强化学习,无需Q函数。 reinforcement learning offline RL imitation learning

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
4 Performance Guaranteed Poisoning Attacks in Federated Learning: A Sliding Mode Approach 提出FedSA:一种基于滑模控制的联邦学习可控投毒攻击方法 manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页