cs.LG(2024-12-28)

📊 共 5 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (2 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (1) 支柱八:物理动画 (Physics-based Animation) (1) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (2 篇)

#题目一句话要点标签🔗
1 Efficient and Scalable Deep Reinforcement Learning for Mean Field Control Games 提出高效可扩展的深度强化学习算法,求解平均场控制博弈问题 reinforcement learning deep reinforcement learning PPO
2 Imitation Learning from Suboptimal Demonstrations via Meta-Learning An Action Ranker 提出ILMAR,通过元学习动作排序器,从次优演示中进行模仿学习 imitation learning behavior cloning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
3 DecDEC: A Systems Approach to Advancing Low-Bit LLM Quantization DecDEC:一种通过动态残差校正提升低比特LLM量化性能的系统方案 large language model

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
4 Transforming CCTV cameras into NO$_2$ sensors at city scale for adaptive policymaking 利用城市监控摄像头和图深度模型实现城市尺度NO₂浓度预测,助力自适应政策制定 spatiotemporal

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
5 Global Search of Optimal Spacecraft Trajectories using Amortization and Deep Generative Models 提出基于深度生成模型的摊销优化方法,加速航天器轨迹全局搜索。 trajectory optimization

⬅️ 返回 cs.LG 首页 · 🏠 返回主页