cs.LG(2024-06-13)

📊 共 30 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (15 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (13 🔗2) 支柱八:物理动画 (Physics-based Animation) (2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (15 篇)

#题目一句话要点标签🔗
1 DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning DiffPoGAN:结合扩散模型与GAN的离线强化学习方法,解决外推误差问题。 reinforcement learning offline RL offline reinforcement learning
2 Is Value Learning Really the Main Bottleneck in Offline RL? 离线强化学习瓶颈研究:策略提取与泛化能力是关键,而非单纯价值学习 reinforcement learning policy learning offline RL
3 CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms 提出CUER算法,通过修正的均匀经验回放提升离策略连续控制深度强化学习性能 reinforcement learning deep reinforcement learning
4 A Dual Approach to Imitation Learning from Observations with Offline Datasets DILO:基于离线数据集和观测的对偶模仿学习方法 policy learning offline RL imitation learning
5 Cognitively Inspired Energy-Based World Models 提出能量基世界模型(EBWM),模拟人类认知,提升世界模型的推理和规划能力。 world model large language model
6 Online Bandit Learning with Offline Preference Data for Improved RLHF 提出warmPref-PS算法以利用离线偏好数据改进RLHF reinforcement learning RLHF
7 Q-S5: Towards Quantized State Space Models Q-S5:面向边缘部署的量化状态空间模型研究 SSM state space model
8 CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving CIMRL:结合模仿学习与强化学习的安全自动驾驶方法 reinforcement learning imitation learning
9 You Don't Need Domain-Specific Data Augmentations When Scaling Self-Supervised Learning 大规模自监督学习中,仅使用裁剪的数据增强即可达到SOTA性能 MAE foundation model
10 Federated Contrastive Learning for Personalized Semantic Communication 提出联邦对比学习框架,用于个性化语义通信,解决异构数据下的语义不平衡问题。 contrastive learning
11 XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning 提出XLand-100B大规模数据集,用于提升上下文强化学习泛化能力。 reinforcement learning
12 Current applications and potential future directions of reinforcement learning-based Digital Twins in agriculture 综述:强化学习驱动的农业数字孪生应用与未来方向 reinforcement learning
13 Introducing Diminutive Causal Structure into Graph Representation Learning 提出基于微型因果结构的图表示学习方法,提升GNN在复杂图数据中的性能 representation learning
14 Hadamard Representations: Augmenting Hyperbolic Tangents in RL 提出Hadamard表示增强RL中双曲正切激活,缓解死亡神经元问题 reinforcement learning PPO
15 T-JEPA: A Joint-Embedding Predictive Architecture for Trajectory Similarity Computation 提出T-JEPA,通过联合嵌入预测架构提升轨迹相似度计算 representation learning contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (13 篇)

#题目一句话要点标签🔗
16 FairCoT: Enhancing Fairness in Text-to-Image Generation via Chain of Thought Reasoning with Multimodal Large Language Models 提出FairCoT以解决文本到图像生成中的公平性问题 large language model multimodal chain-of-thought
17 Advanced Multimodal Deep Learning Architecture for Image-Text Matching 提出一种先进的多模态深度学习架构,用于提升图像-文本匹配的准确性和效率。 multimodal
18 DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks DrivAerNet++:大规模多模态汽车数据集,助力CFD模拟与深度学习基准测试。 multimodal
19 Large Language Model as a Teacher for Zero-shot Tagging at Extreme Scales LMTX:利用大语言模型作为教师,实现极端规模下的零样本标签标注 large language model
20 Weakly-supervised anomaly detection for multimodal data distributions 提出基于变分混合模型的弱监督异常检测方法,解决多模态数据分布下的异常检测问题。 multimodal
21 Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling? 利用生成式模型合成音频,辅助音频识别与语音建模 foundation model
22 Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models 揭示预训练模型下无排练持续学习的真实现状与局限性 foundation model
23 Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations 深入理解并优化最大流形容量表征,提升多视角自监督学习性能 multimodal
24 Separations in the Representational Capabilities of Transformers and Recurrent Architectures 对比Transformer与RNN表征能力,揭示模型尺寸与任务复杂度的关系 foundation model
25 GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning GuardAgent:通过知识增强推理的守护代理保障LLM Agent安全 large language model
26 Towards Effective Evaluations and Comparisons for LLM Unlearning Methods 针对LLM不可学习的有效评估与对比框架,提升评估鲁棒性与实用性 large language model
27 State-Space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era 综述:Transformer时代长序列建模中的状态空间模型与循环机制 large language model
28 LLM-based Knowledge Pruning for Time Series Data Analytics on Edge-computing Devices 提出基于LLM知识剪枝的时间序列边缘计算分析方法,提升资源受限设备性能。 large language model

🔬 支柱八:物理动画 (Physics-based Animation) (2 篇)

#题目一句话要点标签🔗
29 Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns 提出资产依赖神经网络(ADNN),利用时空模式预测金融资产依赖关系。 spatiotemporal
30 Generalizable Implicit Neural Representation As a Universal Spatiotemporal Traffic Data Learner 提出通用时空隐式神经表示,解决多尺度交通数据统一建模问题 spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页