cs.LG(2024-08-11)
📊 共 5 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (2)
支柱九:具身大模型 (Embodied Foundation Models) (2 🔗1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | SMILES-Mamba: Chemical Mamba Foundation Models for Drug ADMET Prediction | SMILES-Mamba:用于药物ADMET预测的化学Mamba基础模型 | Mamba foundation model | ||
| 2 | CURLing the Dream: Contrastive Representations for World Modeling in Reinforcement Learning | Curled-Dreamer:融合对比学习的DreamerV3,提升视觉强化学习性能 | reinforcement learning world model dreamer |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Using Retriever Augmented Large Language Models for Attack Graph Generation | 利用检索增强的大语言模型自动生成攻击图,提升网络安全态势感知。 | large language model | ||
| 4 | Post-Training Sparse Attention with Double Sparsity | 提出双重稀疏注意力,通过后训练稀疏化加速大语言模型推理。 | large language model | ✅ |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 5 | A Single Goal is All You Need: Skills and Exploration Emerge from Contrastive RL without Rewards, Demonstrations, or Subgoals | 基于对比学习的无奖励强化学习,实现技能涌现与自主探索 | manipulation |