cs.LG(2025-12-31)
📊 共 17 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10)
支柱二:RL算法与架构 (RL & Architecture) (6)
支柱八:物理动画 (Physics-based Animation) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Sparse Offline Reinforcement Learning with Corruption Robustness | 提出基于稀疏鲁棒估计的Actor-Critic算法,解决离线稀疏RL中的数据污染问题。 | reinforcement learning offline RL offline reinforcement learning | ||
| 12 | From Perception to Punchline: Empowering VLM with the Art of In-the-wild Meme | 提出HUMOR框架,赋能VLM生成更幽默、符合人类偏好的野生表情包 | reinforcement learning HuMoR multimodal | ||
| 13 | Many Minds from One Model: Bayesian Transformers for Population Intelligence | 提出Population Bayesian Transformers,提升Transformer模型的多样性和决策能力 | reinforcement learning large language model | ||
| 14 | ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning | ResponseRank:通过偏好强度学习实现数据高效的奖励建模 | reinforcement learning preference learning RLHF | ||
| 15 | Attribution-Guided Distillation of Matryoshka Sparse Autoencoders | 提出DMSAE,通过归因引导蒸馏Matryoshka稀疏自编码器,提升特征一致性和可迁移性。 | distillation | ||
| 16 | Robust Bayesian Dynamic Programming for On-policy Risk-sensitive Reinforcement Learning | 提出鲁棒贝叶斯动态规划,用于解决策略风险敏感强化学习中的转移不确定性问题 | reinforcement learning |
🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 17 | Learning Temporally Consistent Turbulence Between Sparse Snapshots via Diffusion Models | 提出基于条件扩散模型的时序一致湍流插值方法,用于稀疏快照间的湍流重建。 | spatiotemporal |