cs.LG(2025-02-11)

📊 共 11 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (6 🔗2) 支柱二:RL算法与架构 (RL & Architecture) (5 🔗2)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (6 篇)

#题目一句话要点标签🔗
1 ADMN: A Layer-Wise Adaptive Multimodal Network for Dynamic Input Noise and Compute Resources 提出ADMN,一种层级自适应多模态网络,解决动态输入噪声和计算资源约束问题。 multimodal
2 DarwinLM: Evolutionary Structured Pruning of Large Language Models DarwinLM:通过演化结构化剪枝压缩大型语言模型,提升推理效率。 large language model
3 Automated Capability Discovery via Foundation Model Self-Exploration 提出自动化能力发现框架以评估基础模型的多样化能力 foundation model
4 CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs 提出CIRCUIT基准数据集,评估LLM在电路推理方面的能力 large language model
5 Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension 提出基于结构化Fisher近似和低秩扩展的高效LLM优化器设计方法 large language model
6 EvoFlow: Evolving Diverse Agentic Workflows On The Fly EvoFlow:动态演化多样化Agent工作流,实现异构LLM协同。 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
7 DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization DrugImproverGPT:基于结构化策略优化微调的大语言模型,用于药物优化 reinforcement learning large language model
8 Towards Training One-Step Diffusion Models Without Distillation 提出无需教师模型监督的单步扩散模型训练方法,性能超越蒸馏方法。 distillation
9 Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning 提出VSC-RL,通过变分子目标强化学习提升自主VLM Agent在复杂任务中的效率。 reinforcement learning
10 LASP-2: Rethinking Sequence Parallelism for Linear Attention and Its Hybrid LASP-2:重新设计线性注意力序列并行,提升超长序列训练效率 linear attention
11 Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol 针对离线策略评估的模型选择,提出新算法与实验协议。 reinforcement learning offline reinforcement learning

⬅️ 返回 cs.LG 首页 · 🏠 返回主页