cs.LG（2024-12-28）

📊 共 5 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (2 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (1) 支柱八：物理动画 (Physics-based Animation) (1) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Efficient and Scalable Deep Reinforcement Learning for Mean Field Control Games	提出高效可扩展的深度强化学习算法，求解平均场控制博弈问题	reinforcement learning deep reinforcement learning PPO
2	Imitation Learning from Suboptimal Demonstrations via Meta-Learning An Action Ranker	提出ILMAR，通过元学习动作排序器，从次优演示中进行模仿学习	imitation learning behavior cloning	✅

🔬 支柱九：具身大模型 (Embodied Foundation Models) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
3	DecDEC: A Systems Approach to Advancing Low-Bit LLM Quantization	DecDEC：一种通过动态残差校正提升低比特LLM量化性能的系统方案	large language model

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
4	Transforming CCTV cameras into NO$_2$ sensors at city scale for adaptive policymaking	利用城市监控摄像头和图深度模型实现城市尺度NO₂浓度预测，助力自适应政策制定	spatiotemporal

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
5	Global Search of Optimal Spacecraft Trajectories using Amortization and Deep Generative Models	提出基于深度生成模型的摊销优化方法，加速航天器轨迹全局搜索。	trajectory optimization

⬅️ 返回 cs.LG 首页 · 🏠 返回主页