cs.LG(2025-12-25)

📊 共 11 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (5) 支柱九:具身大模型 (Embodied Foundation Models) (4 🔗1) 支柱四:生成式动作 (Generative Motion) (1) 支柱八:物理动画 (Physics-based Animation) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
1 Rethinking Output Alignment For 1-bit Post-Training Quantization of Large Language Models 提出一种面向大语言模型1比特后训练量化的输出对齐方法,提升量化性能。 distillation large language model
2 Horizon Reduction as Information Loss in Offline Reinforcement Learning 揭示离线强化学习中Horizon Reduction导致信息损失的根本原因与结构性失效模式 reinforcement learning offline RL offline reinforcement learning
3 Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations 提出BCV-LR框架,通过视频中的潜在表征实现高效的行为克隆,解决交互样本不足的问题。 reinforcement learning policy learning imitation learning
4 Global-Graph Guided and Local-Graph Weighted Contrastive Learning for Unified Clustering on Incomplete and Noise Multi-View Data 提出全局图引导和局部图加权对比学习框架,解决不完整和噪声多视图聚类问题 contrastive learning
5 AVP-Fusion: Adaptive Multi-Modal Fusion and Contrastive Learning for Two-Stage Antiviral Peptide Identification AVP-Fusion:融合自适应多模态和对比学习的两阶段抗病毒肽识别方法 contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)

#题目一句话要点标签🔗
6 Physic-HM: Restoring Physical Generative Logic in Multimodal Anomaly Detection via Hierarchical Modulation Physic-HM:通过层级调制恢复物理生成逻辑的多模态异常检测 multimodal
7 RefineBridge: Generative Bridge Models Improve Financial Forecasting by Foundation Models RefineBridge:生成桥模型通过基础模型改进金融预测 foundation model
8 nncase: An End-to-End Compiler for Efficient LLM Deployment on Heterogeneous Storage Architectures nncase:用于异构存储架构上高效LLM部署的端到端编译器 large language model
9 Hierarchical Stacking Optimization Using Dirichlet's Process (SoDip): Towards Accelerated Design for Graft Polymerization 提出SoDip框架以解决辐射诱导接枝聚合的可重复性问题 multimodal

🔬 支柱四:生成式动作 (Generative Motion) (1 篇)

#题目一句话要点标签🔗
10 Co-GRPO: Co-Optimized Group Relative Policy Optimization for Masked Diffusion Model 提出Co-GRPO,协同优化Masked Diffusion Model及其解码策略,提升生成质量。 MDM

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
11 RIPCN: A Road Impedance Principal Component Network for Probabilistic Traffic Flow Forecasting RIPCN:一种用于概率交通流预测的道路阻抗主成分网络 spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页