cs.LG(2026-01-16)
📊 共 23 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (9 🔗1)
支柱九:具身大模型 (Embodied Foundation Models) (8)
支柱一:机器人控制 (Robot Control) (4)
支柱四:生成式动作 (Generative Motion) (2)
🔬 支柱二:RL算法与架构 (RL & Architecture) (9 篇)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)
🔬 支柱一:机器人控制 (Robot Control) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 18 | Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation | 提出PaST框架,通过注入强化学习技能向量实现LLM的持续知识适应 | manipulation reinforcement learning large language model | ||
| 19 | Toward Adaptive Grid Resilience: A Gradient-Free Meta-RL Framework for Critical Load Restoration | 提出基于无梯度元强化学习的自适应电网恢复框架,提升关键负荷恢复能力。 | model predictive control reinforcement learning | ||
| 20 | QUPID: A Partitioned Quantum Neural Network for Anomaly Detection in Smart Grid | 提出QUPID:一种用于智能电网异常检测的分区量子神经网络 | manipulation | ||
| 21 | IMS: Intelligent Hardware Monitoring System for Secure SoCs | 提出基于神经网络的硬件监控系统IMS,用于保障SoC中AXI总线的安全 | manipulation |
🔬 支柱四:生成式动作 (Generative Motion) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 22 | GenDA: Generative Data Assimilation on Complex Urban Areas via Classifier-Free Diffusion Guidance | 提出GenDA框架以解决城市风流重建问题 | classifier-free guidance sparse sensors | ||
| 23 | TimeMar: Multi-Scale Autoregressive Modeling for Unconditional Time Series Generation | TimeMar:提出一种多尺度自回归模型用于无条件时间序列生成。 | VQ-VAE |