| 1 |
DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training |
提出DUMP框架以解决RL基础LLM后训练中的数据分布调度问题 |
reinforcement learning curriculum learning large language model |
✅ |
|
| 2 |
GenEDA: Towards Generative Netlist Functional Reasoning via Cross-Modal Circuit Encoder-Decoder Alignment |
GenEDA:通过跨模态电路编码器-解码器对齐实现生成式网表功能推理 |
representation learning large language model foundation model |
|
|
| 3 |
Causal integration of chemical structures improves representations of microscopy images for morphological profiling |
MICON:利用化学结构因果整合,提升细胞形态学图谱的表征学习 |
representation learning contrastive learning multimodal |
|
|
| 4 |
Adaptive Insurance Reserving with CVaR-Constrained Reinforcement Learning under Macroeconomic Regimes |
提出基于CVaR约束强化学习的自适应保险准备金方法,应对宏观经济环境变化。 |
reinforcement learning PPO |
|
|
| 5 |
Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models? |
利用LLM驱动的神经架构搜索设计高效可解释的TinyML模型 |
distillation large language model |
|
|