cs.LG(2024-07-11)
📊 共 14 篇论文
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (8)
支柱一:机器人控制 (Robot Control) (3)
支柱九:具身大模型 (Embodied Foundation Models) (3)
🔬 支柱二:RL算法与架构 (RL & Architecture) (8 篇)
🔬 支柱一:机器人控制 (Robot Control) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations | 提出基于时序距离感知的无监督目标条件强化学习方法TLDR | locomotion reinforcement learning | ||
| 10 | Improve Load Forecasting in Energy Communities through Transfer Learning using Open-Access Synthetic Profiles | 利用开放合成数据和迁移学习提升能源社区负荷预测精度 | model predictive control | ||
| 11 | An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio | 提出SDE方法,利用专家混合模型解决部分伪造音频跨域篡改定位问题。 | manipulation |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | Mitigating Catastrophic Forgetting in Language Transfer via Model Merging | 提出Branch-and-Merge方法,缓解LLM语言迁移中的灾难性遗忘 | large language model | ||
| 13 | FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision | FlashAttention-3:通过异步和低精度加速Transformer Attention计算。 | large language model | ||
| 14 | Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients | 提出Q-GaLore,结合量化与低秩投影,显著降低LLM训练的内存占用。 | large language model |