cs.LG(2025-05-14)
📊 共 16 篇论文 | 🔗 4 篇有代码
🎯 兴趣领域导航
支柱九:具身大模型 (Embodied Foundation Models) (9 🔗3)
支柱二:RL算法与架构 (RL & Architecture) (5 🔗1)
支柱八:物理动画 (Physics-based Animation) (2)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (9 篇)
🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 10 | A Multi-Task Foundation Model for Wireless Channel Representation Using Contrastive and Masked Autoencoder Learning | 提出ContraWiMAE,一种用于无线信道表征的对比掩码自编码器基础模型 | representation learning masked autoencoder contrastive learning | ||
| 11 | Adversarial Attack on Large Language Models using Exponentiated Gradient Descent | 提出基于指数梯度下降的对抗攻击方法,有效破解大型语言模型。 | reinforcement learning RLHF large language model | ✅ | |
| 12 | Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data | 提出P4L算法,解决异构数据下的个体最优离线强化学习问题 | reinforcement learning policy learning offline RL | ||
| 13 | Community-based Multi-Agent Reinforcement Learning with Transfer and Active Exploration | 提出基于社群的多智能体强化学习框架,实现知识迁移和主动探索 | reinforcement learning | ||
| 14 | Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model | 利用无约束特征模型分析神经网络多元回归,为模仿学习等任务提供设计指导。 | reinforcement learning imitation learning |
🔬 支柱八:物理动画 (Physics-based Animation) (2 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 15 | Neural models for prediction of spatially patterned phase transitions: methods and challenges | 利用神经网络预测空间模式相变,揭示早期预警信号的局限与泛化能力。 | spatiotemporal | ||
| 16 | Generating time-consistent dynamics with discriminator-guided image diffusion models | 提出时间一致性判别器,引导预训练图像扩散模型生成时序动态 | spatiotemporal |