cs.AI(2025-04-10)
📊 共 12 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (8 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (3)
支柱七:动作重定向 (Motion Retargeting) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (8 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization | 提出2D-Curri-DPO,通过二维课程学习优化语言模型对人类偏好的对齐。 | reinforcement learning DPO direct preference optimization | ||
| 10 | Enhancing Player Enjoyment with a Two-Tier DRL and LLM-Based Agent System for Fighting Games | 提出基于双层DRL和LLM的格斗游戏Agent系统,提升玩家游戏乐趣 | reinforcement learning deep reinforcement learning DRL | ||
| 11 | Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems | 提出基于强化学习训练Transformer的遗传编程(GPRT)方法,解决现实动态调度问题。 | reinforcement learning |
🔬 支柱七:动作重定向 (Motion Retargeting) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 12 | S2Vec: Self-Supervised Geospatial Embeddings for the Built Environment | S2Vec:面向建成环境的自监督地理空间嵌入学习框架 | spatial relationship multimodal |