| 14 |
Multimodal large language model for wheat breeding: a new exploration of smart breeding |
提出多模态大语言模型以解决小麦育种中的知识挖掘问题 |
reinforcement learning RLHF large language model |
|
|
| 15 |
S$^2$ALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning |
提出S$^2$ALM,融合序列与结构信息,用于全面抗体表征学习 |
representation learning large language model foundation model |
|
|
| 16 |
Engagement-Driven Content Generation with Large Language Models |
提出基于强化学习的框架,利用大型语言模型生成高社交互动内容 |
reinforcement learning large language model |
✅ |
|
| 17 |
A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback |
综述:利用人类和LLM反馈增强复杂环境中的强化学习 |
reinforcement learning large language model |
|
|
| 18 |
DRL-Based Optimization for AoI and Energy Consumption in C-V2X Enabled IoV |
提出基于DRL的C-V2X车联网AoI与能耗优化方法,解决资源冲突和性能矛盾问题。 |
reinforcement learning deep reinforcement learning DRL |
|
|
| 19 |
Competence-Aware AI Agents with Metacognition for Unknown Situations and Environments (MUSE) |
提出MUSE框架,赋予AI智能体元认知能力,提升未知环境适应性 |
reinforcement learning world model large language model |
|
|
| 20 |
MERLOT: A Distilled LLM-based Mixture-of-Experts Framework for Scalable Encrypted Traffic Classification |
提出MERLOT:一种基于蒸馏LLM的混合专家框架,用于可扩展的加密流量分类。 |
teacher-student distillation large language model |
|
|
| 21 |
Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise |
为马尔可夫噪声下的随机逼近和强化学习算法建立几乎必然收敛速率和集中度界限 |
reinforcement learning |
|
|
| 22 |
Effective Analog ICs Floorplanning with Relational Graph Neural Networks and Reinforcement Learning |
提出基于关系图神经网络和强化学习的模拟IC自动布局规划方法 |
reinforcement learning |
|
|
| 23 |
Conditional Distribution Learning for Graph Classification |
提出条件分布学习(CDL)方法,用于半监督图分类任务。 |
representation learning contrastive learning |
|
|