cs.LG(2023-12-21)
📊 共 4 篇论文
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (2)
支柱二:RL算法与架构 (RL & Architecture) (1)
支柱一:机器人控制 (Robot Control) (1)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Capture the Flag: Uncovering Data Insights with Large Language Models | 利用大型语言模型自动化数据洞察提取 | large language model | ||
| 2 | The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction | 提出LASER:通过层选择性秩降低提升语言模型推理能力 | large language model |
🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 3 | Deep Reinforcement Learning Based Placement for Integrated Access Backhauling in UAV-Assisted Wireless Networks | 提出基于深度强化学习的无人机放置方法,优化5G IAB网络性能。 | reinforcement learning deep reinforcement learning DRL |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 4 | Diffusion Reward: Learning Rewards via Conditional Video Diffusion | 提出Diffusion Reward,通过条件视频扩散模型学习奖励函数,解决复杂视觉强化学习问题。 | manipulation reinforcement learning |