cs.LG(2024-08-14)
📊 共 12 篇论文 | 🔗 2 篇有代码
🎯 兴趣领域导航
支柱二:RL算法与架构 (RL & Architecture) (5)
支柱九:具身大模型 (Embodied Foundation Models) (4 🔗2)
支柱八:物理动画 (Physics-based Animation) (3)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 1 | PolyCL: Contrastive Learning for Polymer Representation Learning via Explicit and Implicit Augmentations | PolyCL:通过显式和隐式增强实现聚合物表征学习的对比学习 | representation learning contrastive learning | ||
| 2 | A Nested Graph Reinforcement Learning-based Decision-making Strategy for Eco-platooning | 提出基于嵌套图强化学习的决策策略,解决大规模混合车队中的拥堵和能耗问题。 | reinforcement learning penetration | ||
| 3 | Off-Policy Reinforcement Learning with High Dimensional Reward | 提出基于高维回报的离线强化学习算法,解决传统方法难以处理的问题 | reinforcement learning DRL | ||
| 4 | Adaptive Behavioral AI: Reinforcement Learning to Enhance Pharmacy Services | 提出自适应行为AI,利用强化学习优化药房服务 | reinforcement learning | ||
| 5 | FedQUIT: On-Device Federated Unlearning via a Quasi-Competent Virtual Teacher | FedQUIT:通过准胜任虚拟教师实现设备端联邦遗忘 | teacher-student distillation |
🔬 支柱九:具身大模型 (Embodied Foundation Models) (4 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 6 | ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area | 提出ChemVLM:化学领域的多模态大型语言模型,提升化学信息理解能力 | large language model multimodal | ✅ | |
| 7 | MedTsLLM: Leveraging LLMs for Multimodal Medical Time Series Analysis | MedTsLLM:利用大型语言模型进行多模态医学时间序列分析 | large language model multimodal | ||
| 8 | Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities | 综述模型合并技术:系统性回顾方法、理论、应用及未来方向 | large language model multimodal | ✅ | |
| 9 | Nonlocal Attention Operator: Materializing Hidden Knowledge Towards Interpretable Physics Discovery | 提出非局部注意力算子(NAO)用于可解释的物理发现,解决逆PDE问题。 | foundation model |
🔬 支柱八:物理动画 (Physics-based Animation) (3 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | Time-inversion of spatiotemporal beam dynamics using uncertainty-aware latent evolution reversal | 提出一种基于不确定性感知潜在演化逆转的时空束动力学时间反演模型 | spatiotemporal | ||
| 11 | Learning-Augmented Competitive Algorithms for Spatiotemporal Online Allocation with Deadline Constraints | 提出学习增强竞争算法以解决时空在线分配问题 | spatiotemporal | ||
| 12 | Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models | 提出基于扩散模型的无监督盲解混响与房间声学估计方法BUDDy | PULSE |