cs.LG(2025-01-11)

📊 共 5 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (3) 支柱一:机器人控制 (Robot Control) (1) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
1 Task Delay and Energy Consumption Minimization for Low-altitude MEC via Evolutionary Multi-objective Deep Reinforcement Learning 提出基于演化多目标深度强化学习的低空MEC任务延迟与能耗最小化方法 reinforcement learning deep reinforcement learning DRL
2 Influencing Humans to Conform to Preference Models for RLHF 通过影响人类偏好表达,使之更符合RLHF的偏好模型假设 reinforcement learning RLHF
3 Hierarchical Reinforcement Learning for Optimal Agent Grouping in Cooperative Systems 提出一种层级强化学习方法,解决合作多智能体系统中的最优分组问题。 reinforcement learning

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
4 Reinforcement Learning for Enhancing Sensing Estimation in Bistatic ISAC Systems with UAV Swarms 提出基于多智能体强化学习的无人机群ISAC系统感知增强方法 trajectory optimization reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
5 Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping 提出Ladder Residual架构,通过通信重叠加速大模型并行推理。 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页