| 1 |
Universal and Transferable Adversarial Attack on Large Language Models Using Exponentiated Gradient Descent |
提出一种基于指数梯度下降的通用对抗攻击方法以增强大语言模型的鲁棒性 |
reinforcement learning RLHF large language model |
|
|
| 2 |
PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning |
提出PepThink-R1以解决循环肽优化的可解释性问题 |
reinforcement learning large language model chain-of-thought |
|
|
| 3 |
PGF-Net: A Progressive Gated-Fusion Framework for Efficient Multimodal Sentiment Analysis |
提出PGF-Net以解决多模态情感分析效率与可解释性问题 |
MAE multimodal |
|
|
| 4 |
CuMoLoS-MAE: A Masked Autoencoder for Remote Sensing Data Reconstruction |
提出CuMoLoS-MAE以解决遥感数据重建中的不确定性问题 |
masked autoencoder MAE |
|
|
| 5 |
Aura-CAPTCHA: A Reinforcement Learning and GAN-Enhanced Multi-Modal CAPTCHA System |
提出Aura-CAPTCHA以解决传统验证码易被攻破的问题 |
reinforcement learning large language model |
|
|
| 6 |
Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation |
提出生成性AI方法以应对偷猎问题 |
flow matching spatiotemporal |
|
|
| 7 |
Synthetic Adaptive Guided Embeddings (SAGE): A Novel Knowledge Distillation Method |
提出SAGE以解决传统蒸馏方法的效率与泛化问题 |
teacher-student distillation |
|
|
| 8 |
Universal Reinforcement Learning in Coalgebras: Asynchronous Stochastic Computation via Conduction |
提出普适强化学习以解决异步随机计算问题 |
reinforcement learning |
|
|
| 9 |
Kernel-based Equalized Odds: A Quantification of Accuracy-Fairness Trade-off in Fair Representation Learning |
提出基于核的平衡机会准则以量化公平表示学习中的准确性与公平性权衡 |
representation learning |
|
|
| 10 |
Graph Structure Learning with Temporal Graph Information Bottleneck for Inductive Representation Learning |
提出GTGIB框架以解决动态网络中的节点表示问题 |
representation learning |
|
|
| 11 |
Source-Guided Flow Matching |
提出源引导流匹配框架以优化生成模型指导问题 |
flow matching |
|
|
| 12 |
Federated Distillation on Edge Devices: Efficient Client-Side Filtering for Non-IID Data |
提出EdgeFD以解决边缘设备上的非IID数据过滤问题 |
distillation |
|
|
| 13 |
HERAKLES: Hierarchical Skill Compilation for Open-ended LLM Agents |
提出HERAKLES框架以解决开放式LLM代理的目标学习问题 |
reinforcement learning large language model |
|
|
| 14 |
A Comparative Evaluation of Teacher-Guided Reinforcement Learning Techniques for Autonomous Cyber Operations |
提出教师引导强化学习技术以提升自主网络安全操作效率 |
reinforcement learning |
|
|
| 15 |
Beyond ReLU: Chebyshev-DQN for Enhanced Deep Q-Networks |
提出Chebyshev-DQN以提升深度Q网络性能 |
reinforcement learning deep reinforcement learning |
|
|