cs.LG(2024-08-12)

📊 共 8 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (3 🔗1) 支柱八:物理动画 (Physics-based Animation) (2) 支柱九:具身大模型 (Embodied Foundation Models) (2) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (3 篇)

#题目一句话要点标签🔗
1 Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment 提出CLAIR和APO,增强LLM对齐训练的对比性和可控性,显著提升模型性能。 contrastive learning large language model
2 MetMamba: Regional Weather Forecasting with Spatial-Temporal Mamba Model MetMamba:基于时空Mamba模型的区域天气预报 Mamba
3 A Unified Manifold Similarity Measure Enhancing Few-Shot, Transfer, and Reinforcement Learning in Manifold-Distributed Datasets 提出一种统一流形相似性度量,增强流形分布数据集上的小样本、迁移和强化学习 reinforcement learning

🔬 支柱八:物理动画 (Physics-based Animation) (2 篇)

#题目一句话要点标签🔗
4 Cluster-Segregate-Perturb (CSP): A Model-agnostic Explainability Pipeline for Spatiotemporal Land Surface Forecasting Models 提出CSP解释性流程,用于理解时空地表预测模型,揭示气象变量与地表演变关系。 spatiotemporal
5 Parameters Inference for Nonlinear Wave Equations with Markovian Switching 提出基于离散稀疏贝叶斯学习的马尔可夫切换非线性波动方程参数推断方法 spatiotemporal

🔬 支柱九:具身大模型 (Embodied Foundation Models) (2 篇)

#题目一句话要点标签🔗
6 Neural Networks as Spin Models: From Glass to Hidden Order Through Training 将神经网络映射为自旋模型,揭示训练过程中从自旋玻璃态到隐藏有序态的转变 PaLM-E
7 LUT Tensor Core: A Software-Hardware Co-Design for LUT-Based Low-Bit LLM Inference 提出LUT Tensor Core软硬件协同设计,加速低比特LLM的LUT推理。 large language model

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
8 Targeted Deep Learning System Boundary Testing Mimicry:针对深度学习系统的细粒度、目标明确的边界测试方法 manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页