| 1 |
Movable Antenna-Equipped UAV for Data Collection in Backscatter Sensor Networks: A Deep Reinforcement Learning-based Approach |
提出基于深度强化学习的移动天线无人机数据收集方案,优化反向散射传感器网络性能。 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 2 |
Natural Language Reinforcement Learning |
提出自然语言强化学习(NLRL),通过语言价值函数提升智能体理解与主动学习能力。 |
reinforcement learning large language model |
|
|
| 3 |
Enhancing Prediction Models with Reinforcement Learning |
Aureus:利用强化学习增强预测模型,提升新闻推荐系统性能 |
reinforcement learning large language model |
|
|
| 4 |
Multi-agent reinforcement learning strategy to maximize the lifetime of Wireless Rechargeable |
提出基于多智能体强化学习的无线可充电传感器网络寿命最大化策略 |
reinforcement learning PPO |
|
|
| 5 |
Parameter Efficient Mamba Tuning via Projector-targeted Diagonal-centric Linear Transformation |
提出ProDiaL,一种针对Mamba投影层的参数高效微调方法 |
Mamba SSM |
|
|
| 6 |
CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View Graphs |
CodeSAM:通过多代码视图图增强自注意力机制,提升源代码表示学习能力。 |
representation learning |
|
|
| 7 |
Umbrella Reinforcement Learning -- computationally efficient tool for hard non-linear problems |
提出Umbrella强化学习,高效解决具有稀疏奖励的非线性强化学习难题 |
reinforcement learning |
|
|
| 8 |
Trajectory Representation Learning on Road Networks and Grids with Spatio-Temporal Dynamics |
TIGR:融合路网与网格时空动态的轨迹表示学习模型 |
representation learning |
|
|
| 9 |
Learning to Cooperate with Humans using Generative Agents |
提出GAMMA,利用生成模型学习人类合作策略,提升人机协作性能 |
reinforcement learning behavior cloning |
|
|