cs.LG(2025-05-28)

📊 共 9 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (5 🔗1) 支柱二:RL算法与架构 (RL & Architecture) (3) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (5 篇)

#题目一句话要点标签🔗
1 SlimLLM: Accurate Structured Pruning for Large Language Models 提出SlimLLM以解决大语言模型的结构化剪枝问题 large language model
2 Revisiting Bayesian Model Averaging in the Era of Foundation Models 提出基于贝叶斯模型平均的线性分类器以提升分类性能 foundation model
3 Investigating the effectiveness of multimodal data in forecasting SARS-COV-2 case surges 提出多模态数据融合方法以提升SARS-COV-2病例激增预测能力 multimodal
4 SimuGen: Multi-modal Agentic Framework for Constructing Block Diagram-Based Simulation Models 提出SimuGen以解决Simulink模型生成问题 large language model multimodal
5 FALCON: An ML Framework for Fully Automated Layout-Constrained Analog Circuit Design 提出FALCON框架以实现全自动化的模拟电路设计 foundation model

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
6 Reinforcement Learning for Out-of-Distribution Reasoning in LLMs: An Empirical Study on Diagnosis-Related Group Coding 提出DRG-Sapphire以解决临床笔记中的DRG编码问题 reinforcement learning large language model
7 SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training 提出SDPO以解决扩散模型训练中的偏差和不稳定问题 preference learning DPO direct preference optimization
8 A Provable Approach for End-to-End Safe Reinforcement Learning 提出可证明的终身安全强化学习方法以解决安全性问题 reinforcement learning

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
9 Practical Adversarial Attacks on Stochastic Bandits via Fake Data Injection 提出假数据注入模型以解决随机带宽的对抗攻击问题 manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页