cs.AI(2024-10-31)
📊 共 15 篇论文 | 🔗 3 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (11 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (3 🔗1)
支柱四:生成式动作 (Generative Motion) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner | RL-STaR:为自学习推理器提供强化学习框架的理论分析 | reinforcement learning large language model chain-of-thought | ||
| 13 | Towards Reliable Alignment: Uncertainty-aware RLHF | 提出不确定性感知的RLHF方法,提升语言模型对齐的可靠性 | reinforcement learning RLHF large language model | ||
| 14 | Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware | 提出Gadget强化学习,解决真实量子硬件上的复杂量子问题 | reinforcement learning | ✅ |
🔬 支柱四:生成式动作 (Generative Motion) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | ADAPT: A Game-Theoretic and Neuro-Symbolic Framework for Automated Distributed Adaptive Penetration Testing | 提出ADAPT框架,利用博弈论和神经符号方法实现AI医疗基础设施的自适应渗透测试。 | penetration |