cs.AI(2024-05-31)
📊 共 16 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (12 🔗2)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱五:交互与反应 (Interaction & Reaction) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (12 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 13 | Mind the Inconspicuous: Revealing the Hidden Weakness in Aligned LLMs' Refusal Boundaries | 揭示对齐LLM拒绝边界的隐蔽弱点:EOS token引发上下文分割 | reinforcement learning RLHF large language model | ||
| 14 | OpenTensor: Reproducing Faster Matrix Multiplication Discovering Algorithms | OpenTensor:复现并加速矩阵乘法算法发现,提升计算效率 | reinforcement learning deep reinforcement learning DRL | ||
| 15 | ADESSE: Advice Explanations in Complex Repeated Decision-Making Environments | ADESSE:在复杂重复决策环境中提供可解释建议,提升人机协作。 | reinforcement learning deep reinforcement learning |
🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 16 | Locking Machine Learning Models into Hardware | 提出一种硬件锁定的机器学习模型保护方案,防止未经授权的使用。 | OMOMO |