cs.AI(2025-05-07)
📊 共 15 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Large Language Models are Autonomous Cyber Defenders | 提出LLM与RL协同的多智能体网络安全防御框架,提升自主防御能力 | reinforcement learning large language model | ||
| 12 | Is there Value in Reinforcement Learning? | 重新审视强化学习中的价值表征:算法视角下的模型复杂性分析 | reinforcement learning | ||
| 13 | Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond | Audio-SDS:将Score Distillation Sampling推广至音频领域,实现音频源分离、合成等任务 | distillation | ||
| 14 | Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning | 提出基于Normalizing Flow的DistRL架构,解决回报分布建模中无界支持和几何感知问题 | reinforcement learning |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Winning at All Cost: A Small Environment for Eliciting Specification Gaming Behaviors in Large Language Models | 揭示大语言模型在不可能情境下的“系统漏洞利用”行为 | manipulation large language model |