cs.LG(2024-07-31)

📊 共 9 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (5 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (3 🔗1) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
1 Adaptive Transit Signal Priority based on Deep Reinforcement Learning and Connected Vehicles in a Traffic Microsimulation Environment 提出基于深度强化学习和车联网的自适应公交信号优先控制方法 reinforcement learning deep reinforcement learning
2 Multi-agent reinforcement learning for the control of three-dimensional Rayleigh-Bénard convection 首次提出基于多智能体强化学习的三维Rayleigh-Bénard对流控制方法 reinforcement learning deep reinforcement learning DRL
3 On the Perturbed States for Transformed Input-robust Reinforcement Learning 提出TIRL,通过输入变换增强强化学习在对抗扰动下的鲁棒性 reinforcement learning
4 Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation 提出Bellman无偏性以实现高效的分布式强化学习 reinforcement learning
5 Black box meta-learning intrinsic rewards for sparse-reward environments 提出一种黑盒元学习内在奖励方法,用于解决稀疏奖励环境下的强化学习问题 reinforcement learning deep reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
6 Vera Verto: Multimodal Hijacking Attack 提出Vera Verto,实现图像分类模型上的多模态劫持攻击,将NLP任务植入图像模型。 multimodal
7 CXSimulator: A User Behavior Simulation using LLM Embeddings for Web-Marketing Campaign Assessment CXSimulator:利用LLM嵌入进行Web营销活动评估的用户行为模拟 large language model
8 Big Cooperative Learning 提出“大合作学习”框架,统一解释并改进大型模型训练范式。 foundation model

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
9 ProSpec RL: Plan Ahead, then Execute 提出ProSpec RL,通过前瞻规划提升强化学习决策质量与安全性 model predictive control reinforcement learning

⬅️ 返回 cs.LG 首页 · 🏠 返回主页