cs.LG(2025-12-24)
📊 共 16 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (9)
支柱二:RL算法与架构 (RL & Architecture) (6 🔗2)
支柱七:动作重定向 (Motion Retargeting) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (9 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | ReACT-Drug: Reaction-Template Guided Reinforcement Learning for de novo Drug Design | ReACT-Drug:基于反应模板引导的强化学习用于全新药物设计 | reinforcement learning PPO representation learning | ✅ | |
| 11 | dUltra: Ultra-Fast Diffusion Language Models via Reinforcement Learning | 提出dUltra,通过强化学习加速扩散语言模型并行解码,提升推理效率。 | reinforcement learning distillation | ||
| 12 | A Survey of Freshness-Aware Wireless Networking with Reinforcement Learning | 综述:基于强化学习的面向信息新鲜度的无线网络研究 | reinforcement learning | ||
| 13 | Model Merging via Multi-Teacher Knowledge Distillation | 提出SAMerging,通过多教师知识蒸馏实现模型合并,提升泛化性能。 | distillation | ✅ | |
| 14 | MiST: Understanding the Role of Mid-Stage Scientific Training in Developing Chemical Reasoning Models | 提出MiST:通过中阶段科学训练提升化学推理模型性能 | reinforcement learning large language model | ||
| 15 | Shared Representation Learning for High-Dimensional Multi-Task Forecasting under Resource Contention in Cloud-Native Backends | 提出用于云原生后端高维多任务预测的共享表示学习框架,解决资源竞争下的预测难题。 | representation learning |
🔬 支柱七:动作重定向 (Motion Retargeting) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 16 | Temporal Visual Semantics-Induced Human Motion Understanding with Large Language Models | 提出基于大语言模型的时间视觉语义引导的人体运动分割方法 | human motion large language model |