cs.LG(2025-10-11)
📊 共 4 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Reinforcement Fine-Tuning of Flow-Matching Policies for Vision-Language-Action Models | 提出流政策优化算法以提升视觉-语言-动作模型的强化学习效果 | reinforcement learning flow matching vision-language-action | ||
| 2 | RLFR: Extending Reinforcement Learning for LLMs with Flow Environment | 提出RLFR:利用流环境扩展LLM的强化学习,提升推理能力 | reinforcement learning reward shaping large language model | ||
| 3 | Structured Cooperative Multi-Agent Reinforcement Learning: a Bayesian Network Perspective | 提出基于贝叶斯网络的结构化合作多智能体强化学习方法,提升大规模系统效率。 | reinforcement learning |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | INR-Bench: A Unified Benchmark for Implicit Neural Representations in Multi-Domain Regression and Reconstruction | 提出INR-Bench以解决多模态隐式神经表示的评估问题 | multimodal | ✅ |