| 8 |
Anomalous Decision Discovery using Inverse Reinforcement Learning |
提出基于逆强化学习的异常决策发现框架以解决自动驾驶中的异常检测问题 |
reinforcement learning inverse reinforcement learning |
✅ |
|
| 9 |
CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning |
提出CLIP-RL,利用对比语言-视觉预训练和强化学习进行手术场景分割。 |
reinforcement learning contrastive learning curriculum learning |
|
|
| 10 |
DC-Mamber: A Dual Channel Prediction Model based on Mamba and Linear Transformer for Multivariate Time Series Forecasting |
提出基于Mamba和线性Transformer的双通道预测模型DC-Mamber,用于提升多元时间序列预测精度。 |
Mamba SSM state space model |
|
|
| 11 |
WebSynthesis: World-Model-Guided MCTS for Efficient WebUI-Trajectory Synthesis |
WebSynthesis:利用世界模型引导的MCTS高效合成WebUI交互轨迹 |
world model large language model |
|
|
| 12 |
VOLTRON: Detecting Unknown Malware Using Graph-Based Zero-Shot Learning |
提出基于图的零样本学习框架VOLTRON,用于检测未知恶意软件。 |
Voltron |
|
|