cs.AI(2025-12-28)
📊 共 18 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (12 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱四:生成式动作 (Generative Motion) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (12 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | SAMP-HDRL: Segmented Allocation with Momentum-Adjusted Utility for Multi-agent Portfolio Management via Hierarchical Deep Reinforcement Learning | SAMP-HDRL:通过分层深度强化学习进行多智能体投资组合管理的动态分段配置方法 | reinforcement learning deep reinforcement learning DRL | ||
| 14 | Benchmark Success, Clinical Failure: When Reinforcement Learning Optimizes for Benchmarks, Not Patients | ChexReason揭示强化学习在医学影像中优化基准测试而非患者的困境 | reinforcement learning large language model | ||
| 15 | Reinforcement Networks: novel framework for collaborative Multi-Agent Reinforcement Learning tasks | 提出Reinforcement Networks框架,解决协作式多智能体强化学习任务中的复杂结构建模与训练问题 | reinforcement learning | ||
| 16 | Audited Skill-Graph Self-Improvement for Agentic LLMs via Verifiable Rewards, Experience Synthesis, and Continual Memory | 提出ASG-SI框架,通过可验证技能图自提升Agentic LLM的安全性与可控性 | reinforcement learning large language model | ||
| 17 | Heterogeneity in Multi-Agent Reinforcement Learning | 提出异构性定义与量化方法,并应用于多智能体动态参数共享,提升MARL性能。 | reinforcement learning |
🔬 支柱四:生成式动作 (Generative Motion) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 18 | Agentic AI for Cyber Resilience: A New Security Paradigm and Its System-Theoretic Foundations | 提出基于Agentic AI的赛博韧性安全范式,解决传统安全架构的局限性。 | penetration large language model foundation model |