| 13 |
MSGM: A Multi-Scale Spatiotemporal Graph Mamba for EEG Emotion Recognition |
提出MSGM:一种用于脑电情绪识别的多尺度时空图Mamba模型 |
Mamba spatiotemporal |
|
|
| 14 |
Small LLMs Do Not Learn a Generalizable Theory of Mind via Reinforcement Learning |
小规模LLM难以通过强化学习获得可泛化的心理理论能力 |
reinforcement learning large language model |
|
|
| 15 |
Mixture of Autoencoder Experts Guidance using Unlabeled and Incomplete Data for Exploration in Reinforcement Learning |
提出基于自编码专家混合模型的强化学习探索方法,利用非标记和不完整数据指导学习。 |
reinforcement learning generalist agent |
|
|
| 16 |
Long-Short Distance Graph Neural Networks and Improved Curriculum Learning for Emotion Recognition in Conversation |
提出长短距离图神经网络和改进课程学习方法,用于提升对话情绪识别性能。 |
curriculum learning multimodal |
|
|
| 17 |
Automated Design of Structured Variational Quantum Circuits with Reinforcement Learning |
提出基于强化学习的自动变分量子电路设计方法,优化组合优化问题。 |
reinforcement learning PPO |
|
|
| 18 |
Off-Policy Corrected Reward Modeling for Reinforcement Learning from Human Feedback |
提出Off-Policy修正奖励模型(OCRM)以解决RLHF中的过优化问题 |
reinforcement learning RLHF |
✅ |
|
| 19 |
To Label or Not to Label: PALM -- A Predictive Model for Evaluating Sample Efficiency in Active Learning Models |
提出PALM模型,用于预测主动学习模型在不同标注预算下的样本效率。 |
predictive model |
✅ |
|
| 20 |
Reinforcement Learning in hyperbolic space for multi-step reasoning |
提出基于双曲Transformer的强化学习框架,用于解决多步推理问题 |
reinforcement learning |
|
|
| 21 |
Minor Embedding for Quantum Annealing with Reinforcement Learning |
提出基于强化学习的量子退火次嵌入方法,提升问题规模和硬件拓扑的泛化性 |
reinforcement learning |
|
|
| 22 |
LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra |
LLM Economist:利用多智能体生成模拟环境进行经济政策设计与评估 |
reinforcement learning large language model |
|
|
| 23 |
Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training |
提出数据混合Agent,通过强化学习自动学习领域重加权策略,提升持续预训练效果。 |
reinforcement learning large language model |
|
|
| 24 |
Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario |
提出红队多智能体强化学习框架,用于挖掘紧急制动场景中的极端工况。 |
reinforcement learning |
|
|