cs.LG(2025-06-21)
📊 共 15 篇论文 | 🔗 1 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (10 🔗1)
支柱二:RL算法与架构 (RL & Architecture) (4)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (10 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Aligning Frozen LLMs by Reinforcement Learning: An Iterative Reweight-then-Optimize Approach | 提出迭代重加权优化(IRO)框架,无需微调即可对齐冻结LLM。 | reinforcement learning RLHF DPO | ||
| 12 | Research on Low-Latency Inference and Training Efficiency Optimization for Graph Neural Network and Large Language Model-Based Recommendation Systems | 针对GNN与LLM混合推荐系统,提出低延迟推理与高效训练的优化方案 | distillation large language model | ||
| 13 | Beyond instruction-conditioning, MoTE: Mixture of Task Experts for Multi-task Embedding Models | 提出MoTE:用于多任务嵌入模型的混合任务专家方法,提升低容量模型性能。 | representation learning contrastive learning | ||
| 14 | Predicting E-commerce Purchase Behavior using a DQN-Inspired Deep Learning Model for enhanced adaptability | 提出DQN启发的深度学习模型,预测电商购买意图并提升适应性。 | reinforcement learning predictive model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Accelerating Residual Reinforcement Learning with Uncertainty Estimation | 利用不确定性估计加速残差强化学习,提升样本效率并支持随机策略。 | sim-to-real reinforcement learning |