cs.LG(2025-07-15)

📊 共 2 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (1) 支柱九:具身大模型 (Embodied Foundation Models) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (1 篇)

#题目一句话要点标签🔗
1 AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air AirLLM:基于扩散策略的自适应LoRA,用于无线环境下的LLM远程微调 PPO diffusion policy classifier-free guidance

🔬 支柱九:具身大模型 (Embodied Foundation Models) (1 篇)

#题目一句话要点标签🔗
2 First-Order Error Matters: Accurate Compensation for Quantized Large Language Models 提出FOEM,通过显式补偿一阶梯度误差,显著提升量化大语言模型精度 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页