cs.LG(2025-05-11)

📊 共 8 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (4) 支柱九:具身大模型 (Embodied Foundation Models) (4 🔗3)

🔬 支柱二:RL算法与架构 (RL & Architecture) (4 篇)

#题目一句话要点标签🔗
1 Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models 提出知识蒸馏方法以提升沃尔玛电商搜索相关性 distillation large language model
2 Multi-Objective-Guided Discrete Flow Matching for Controllable Biological Sequence Design 提出多目标引导离散流匹配以解决可控生物序列设计问题 flow matching
3 Scaling Laws and Representation Learning in Simple Hierarchical Languages: Transformers vs. Convolutional Architectures 提出层次语言模型的扩展理论以比较卷积与变换器架构 representation learning
4 Reinforcement Learning (RL) Meets Urban Climate Modeling: Investigating the Efficacy and Impacts of RL-Based HVAC Control 提出基于强化学习的HVAC控制框架以应对城市气候建模挑战 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)

#题目一句话要点标签🔗
5 GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance 提出GuidedQuant以解决大语言模型量化中的特征重要性问题 large language model
6 Turning LLM Activations Quantization-Friendly 提出量化友好的激活方法以降低LLM服务成本 large language model
7 MMiC: Mitigating Modality Incompleteness in Clustered Federated Learning 提出MMiC框架以解决多模态联邦学习中的模态不完整问题 multimodal
8 Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety 提出Self-Inf-N以识别良性样本中的异常点,提升LLM安全性 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页