cs.LG(2025-01-13)

📊 共 17 篇论文

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (10) 支柱九:具身大模型 (Embodied Foundation Models) (5) 支柱一:机器人控制 (Robot Control) (1) 支柱八:物理动画 (Physics-based Animation) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (10 篇)

#题目一句话要点标签🔗
1 Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring 提出AutoRefine,通过强化学习微调基础模型,解决算法招聘中的公平性问题。 reinforcement learning large language model foundation model
2 Improving DeFi Accessibility through Efficient Liquidity Provisioning with Deep Reinforcement Learning 提出基于深度强化学习的流动性提供策略,优化Uniswap v3的DeFi可访问性。 reinforcement learning deep reinforcement learning DRL
3 Performance Optimization of Ratings-Based Reinforcement Learning 探索超参数优化方法,提升基于人类评价的强化学习性能 reinforcement learning policy learning
4 Subject Representation Learning from EEG using Graph Convolutional Variational Autoencoders 提出基于图卷积变分自编码器的GC-VASE模型,用于脑电信号的个体表征学习。 representation learning contrastive learning
5 Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data 提出GILD,通过元学习离线数据目标函数,提升在线强化学习在稀疏奖励环境下的性能。 reinforcement learning imitation learning
6 Dataset Distillation as Pushforward Optimal Quantization 将数据集蒸馏重构为推前最优量化问题,提升ImageNet性能。 distillation
7 Human-Inspired Multi-Level Reinforcement Learning 提出一种受人类启发的多层次强化学习方法,提升决策优化能力 reinforcement learning
8 TIMRL: A Novel Meta-Reinforcement Learning Framework for Non-Stationary and Multi-Task Environments 提出基于高斯混合模型和Transformer的元强化学习框架,解决非平稳多任务环境下的样本效率问题。 reinforcement learning
9 Combining LLM decision and RL action selection to improve RL policy for adaptive interventions 结合LLM决策与RL动作选择,提升自适应干预的RL策略 reinforcement learning large language model
10 ACCon: Angle-Compensated Contrastive Regularizer for Deep Regression 提出角度补偿对比正则化方法ACCon,提升深度回归任务的性能。 representation learning contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (5 篇)

#题目一句话要点标签🔗
11 Multimodal semantic retrieval for product search 提出多模态语义检索方法,提升电商产品搜索的购买召回率和相关性准确度。 multimodal
12 LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch LLM360 K2:从零构建650亿参数全开源大语言模型,超越LLaMA large language model
13 Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis 利用时间序列基础模型分析车辆跟驰行为,提升预测精度和泛化性 foundation model
14 ML Mule: Mobile-Driven Context-Aware Collaborative Learning 提出ML Mule,利用移动设备进行情境感知的协同学习,解决传统联邦学习的局限性。 large language model
15 Wavelet Meets Adam: Compressing Gradients for Memory-Efficient Training 提出梯度小波变换(GWT)以压缩Adam优化器状态,实现大模型高效训练。 large language model

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
16 Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards 揭示并缓解基于投票的LLM排行榜的对抗性操纵风险 manipulation large language model

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
17 Dynami-CAL GraphNet: A Physics-Informed Graph Neural Network Conserving Linear and Angular Momentum for Dynamical Systems Dynami-CAL GraphNet:一种用于动力学系统的、满足线性和角动量守恒的物理信息图神经网络 PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页