cs.LG(2025-01-10)
📊 共 11 篇论文
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱九:具身大模型 (Embodied Foundation Models) (4)
支柱一:机器人控制 (Robot Control) (1)
支柱四:生成式动作 (Generative Motion) (1)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing | 提出基于序贯数据预处理的因果公平强化学习框架,解决多阶段决策中的偏差问题。 | reinforcement learning policy learning | ||
| 2 | A monthly sub-national Harmonized Food Insecurity Dataset for comprehensive analysis and predictive modeling | 构建月度次国家级统一粮食不安全数据集,用于综合分析和预测建模 | predictive model | ||
| 3 | Investigating the Impact of Observation Space Design Choices On Training Reinforcement Learning Solutions for Spacecraft Problems | 研究观测空间设计对强化学习解决航天器问题的性能影响 | reinforcement learning | ||
| 4 | From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training | 提出连续时间扩散采样器以提高神经随机微分方程训练效率 | reinforcement learning differentiable simulation | ||
| 5 | Element-wise Attention Is All You Need | 提出元素级注意力机制以解决自注意力复杂性问题 | state space model linear attention |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech | CognoSpeak:一种基于真实对话语音的早期认知衰退自动远程评估系统 | large language model multimodal | ||
| 7 | Using Pre-trained LLMs for Multivariate Time Series Forecasting | 提出基于LLM的多元时间序列预测方法,通过新颖的patching策略实现SOTA水平。 | large language model | ||
| 8 | Aggregating Low Rank Adapters in Federated Fine-tuning | 提出一种新的联邦微调中低秩适配器聚合方法,提升GLUE基准性能 | large language model | ||
| 9 | Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory | 针对个性化LLM的分裂学习中模型反演攻击研究,基于信息瓶颈理论提出新型攻击方法。 | large language model |
🔬 支柱一:机器人控制 (Robot Control) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | Real-Time Decision-Making for Digital Twin in Additive Manufacturing with Model Predictive Control using Time-Series Deep Neural Networks | 提出基于时序深度神经网络的MPC数字孪生方法,用于增材制造实时决策。 | MPC model predictive control |
🔬 支柱四:生成式动作 (Generative Motion) (1 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 11 | Do we actually understand the impact of renewables on electricity prices? A causal inference approach | 提出局部部分线性双重机器学习方法,量化可再生能源对电价的因果影响 | penetration |