cs.LG(2025-04-11)
📊 共 12 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | Investigating the Treacherous Turn in Deep Reinforcement Learning | 研究深度强化学习中的“背叛性转向”现象及应对策略 | reinforcement learning deep reinforcement learning DRL | ||
| 10 | Distilling and exploiting quantitative insights from Large Language Models for enhanced Bayesian optimization of chemical reactions | 利用大语言模型知识蒸馏,增强化学反应贝叶斯优化 | preference learning large language model | ||
| 11 | Scaling Up On-Device LLMs via Active-Weight Swapping Between DRAM and Flash | ActiveFlow:通过DRAM与Flash间主动权重交换,扩展端侧LLM部署规模 | distillation large language model | ||
| 12 | Near-Driven Autonomous Rover Navigation in Complex Environments: Extensions to Urban Search-and-Rescue and Industrial Inspection | 扩展神经进化方法,实现复杂环境下自主机器人近距离导航,应用于城市搜救和工业检测。 | reinforcement learning deep reinforcement learning |