cs.LG(2024-05-04)

📊 共 7 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (5) 支柱九:具身大模型 (Embodied Foundation Models) (2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
1 Guidance Design for Escape Flight Vehicle Using Evolution Strategy Enhanced Deep Reinforcement Learning 提出基于进化策略增强深度强化学习的逃逸飞行器制导方法 reinforcement learning deep reinforcement learning DRL
2 Sub-goal Distillation: A Method to Improve Small Language Agents 提出子目标蒸馏方法,提升小语言模型在交互式任务中的性能。 imitation learning distillation large language model
3 From Generalization Analysis to Optimization Designs for State Space Models 针对状态空间模型,提出基于泛化分析的优化设计方案,提升训练效果。 SSM state space model foundation model
4 Generic Multi-modal Representation Learning for Network Traffic Analysis 提出一种通用的多模态表征学习方法,用于网络流量分析 representation learning MAE
5 Taming Equilibrium Bias in Risk-Sensitive Multi-Agent Reinforcement Learning 提出风险平衡后悔值,解决风险敏感多智能体强化学习中的均衡偏差问题 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
6 Beyond Unimodal Learning: The Importance of Integrating Multiple Modalities for Lifelong Learning 提出多模态持续学习基准,探索多模态融合在缓解灾难性遗忘中的作用 multimodal
7 Random Masking Finds Winning Tickets for Parameter Efficient Fine-tuning 提出随机掩码方法,以更少参数高效微调大型语言模型 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页