| 1 |
HAVEN: Hierarchical Adversary-aware Visibility-Enabled Navigation with Cover Utilization using Deep Transformer Q-Networks |
提出HAVEN:一种利用深度Transformer Q网络的分层对抗感知导航方法,提升部分可观测环境下的安全性。 |
reinforcement learning point cloud navigation |
|
|
| 2 |
Sample-Efficient Expert Query Control in Active Imitation Learning via Conformal Prediction |
提出CRSAIL,通过保角预测提升主动模仿学习的样本效率,显著降低专家查询次数。 |
imitation learning MuJoCo |
|
|
| 3 |
Hardware-Software Collaborative Computing of Photonic Spiking Reinforcement Learning for Robotic Continuous Control |
提出基于光子脉冲神经网络的硬件-软件协同计算架构,用于机器人连续控制。 |
reinforcement learning |
|
|