cs.LG(2024-11-28)

📊 共 10 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (6 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (3 🔗2) 支柱五:交互与反应 (Interaction & Reaction) (1 🔗1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (6 篇)

#题目一句话要点标签🔗
1 TEA: Trajectory Encoding Augmentation for Robust and Transferable Policies in Offline Reinforcement Learning 提出轨迹编码增强TEA,提升离线强化学习策略在未知环境中的泛化性 reinforcement learning offline reinforcement learning
2 Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Puzzle:基于蒸馏的NAS优化LLM推理,实现单H100 GPU部署 distillation large language model
3 Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG 提出EEG-DisGCMAE,解决低密度脑电图数据下的知识迁移与蒸馏问题 masked autoencoder distillation
4 MSEMG: Surface Electromyography Denoising with a Mamba-based Efficient Network 提出MSEMG:一种基于Mamba的高效网络,用于表面肌电信号降噪。 Mamba state space model
5 LD-EnSF: Synergizing Latent Dynamics with Ensemble Score Filters for Fast Data Assimilation with Sparse Observations LD-EnSF:融合潜在动力学与集成评分滤波,加速稀疏观测下的数据同化 latent dynamics
6 ICLERB: In-Context Learning Embedding and Reranker Benchmark 提出ICLERB基准测试与RLRAIF算法,优化上下文学习的检索增强生成。 reinforcement learning large language model

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
7 Parameter-Efficient Transfer Learning for Music Foundation Models 针对音乐基础模型,提出参数高效的迁移学习方法,提升下游任务性能。 foundation model
8 On the Role of Discrete Representation in Sparse Mixture of Experts 提出VQMoE,利用向量量化离散表示解决稀疏专家混合模型中的路由不一致问题。 large language model
9 Scaling Particle Collision Data Analysis 提出BBT-Neutron,一种用于大规模粒子碰撞数据分析的通用架构。 large language model

🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)

#题目一句话要点标签🔗
10 A Lean Dataset for International Math Olympiad: Small Steps towards Writing Math Proofs for Hard Problems 构建国际数学奥赛Lean证明数据集,推动AI自动数学证明研究 IMoS

⬅️ 返回 cs.LG 首页 · 🏠 返回主页