cs.LG（2024-06-13）

📊 共 30 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (15 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (13 🔗2) 支柱八：物理动画 (Physics-based Animation) (2)

🔬 支柱二：RL算法与架构 (RL & Architecture) (15 篇)

#	题目	一句话要点	标签	🔗	⭐
1	DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning	DiffPoGAN：结合扩散模型与GAN的离线强化学习方法，解决外推误差问题。	reinforcement learning offline RL offline reinforcement learning
2	Is Value Learning Really the Main Bottleneck in Offline RL?	离线强化学习瓶颈研究：策略提取与泛化能力是关键，而非单纯价值学习	reinforcement learning policy learning offline RL
3	CUER: Corrected Uniform Experience Replay for Off-Policy Continuous Deep Reinforcement Learning Algorithms	提出CUER算法，通过修正的均匀经验回放提升离策略连续控制深度强化学习性能	reinforcement learning deep reinforcement learning
4	A Dual Approach to Imitation Learning from Observations with Offline Datasets	DILO：基于离线数据集和观测的对偶模仿学习方法	policy learning offline RL imitation learning	✅
5	Cognitively Inspired Energy-Based World Models	提出能量基世界模型(EBWM)，模拟人类认知，提升世界模型的推理和规划能力。	world model large language model
6	Online Bandit Learning with Offline Preference Data for Improved RLHF	提出warmPref-PS算法以利用离线偏好数据改进RLHF	reinforcement learning RLHF
7	Q-S5: Towards Quantized State Space Models	Q-S5：面向边缘部署的量化状态空间模型研究	SSM state space model
8	CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving	CIMRL：结合模仿学习与强化学习的安全自动驾驶方法	reinforcement learning imitation learning
9	You Don't Need Domain-Specific Data Augmentations When Scaling Self-Supervised Learning	大规模自监督学习中，仅使用裁剪的数据增强即可达到SOTA性能	MAE foundation model
10	Federated Contrastive Learning for Personalized Semantic Communication	提出联邦对比学习框架，用于个性化语义通信，解决异构数据下的语义不平衡问题。	contrastive learning
11	XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning	提出XLand-100B大规模数据集，用于提升上下文强化学习泛化能力。	reinforcement learning
12	Current applications and potential future directions of reinforcement learning-based Digital Twins in agriculture	综述：强化学习驱动的农业数字孪生应用与未来方向	reinforcement learning
13	Introducing Diminutive Causal Structure into Graph Representation Learning	提出基于微型因果结构的图表示学习方法，提升GNN在复杂图数据中的性能	representation learning
14	Hadamard Representations: Augmenting Hyperbolic Tangents in RL	提出Hadamard表示增强RL中双曲正切激活，缓解死亡神经元问题	reinforcement learning PPO
15	T-JEPA: A Joint-Embedding Predictive Architecture for Trajectory Similarity Computation	提出T-JEPA，通过联合嵌入预测架构提升轨迹相似度计算	representation learning contrastive learning

🔬 支柱九：具身大模型 (Embodied Foundation Models) (13 篇)

#	题目	一句话要点	标签	🔗	⭐
16	FairCoT: Enhancing Fairness in Text-to-Image Generation via Chain of Thought Reasoning with Multimodal Large Language Models	提出FairCoT以解决文本到图像生成中的公平性问题	large language model multimodal chain-of-thought
17	Advanced Multimodal Deep Learning Architecture for Image-Text Matching	提出一种先进的多模态深度学习架构，用于提升图像-文本匹配的准确性和效率。	multimodal
18	DrivAerNet++: A Large-Scale Multimodal Car Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks	DrivAerNet++：大规模多模态汽车数据集，助力CFD模拟与深度学习基准测试。	multimodal	✅
19	Large Language Model as a Teacher for Zero-shot Tagging at Extreme Scales	LMTX：利用大语言模型作为教师，实现极端规模下的零样本标签标注	large language model
20	Weakly-supervised anomaly detection for multimodal data distributions	提出基于变分混合模型的弱监督异常检测方法，解决多模态数据分布下的异常检测问题。	multimodal
21	Can Synthetic Audio From Generative Foundation Models Assist Audio Recognition and Speech Modeling?	利用生成式模型合成音频，辅助音频识别与语音建模	foundation model	✅
22	Reflecting on the State of Rehearsal-free Continual Learning with Pretrained Models	揭示预训练模型下无排练持续学习的真实现状与局限性	foundation model
23	Towards an Improved Understanding and Utilization of Maximum Manifold Capacity Representations	深入理解并优化最大流形容量表征，提升多视角自监督学习性能	multimodal
24	Separations in the Representational Capabilities of Transformers and Recurrent Architectures	对比Transformer与RNN表征能力，揭示模型尺寸与任务复杂度的关系	foundation model
25	GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning	GuardAgent：通过知识增强推理的守护代理保障LLM Agent安全	large language model
26	Towards Effective Evaluations and Comparisons for LLM Unlearning Methods	针对LLM不可学习的有效评估与对比框架，提升评估鲁棒性与实用性	large language model
27	State-Space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era	综述：Transformer时代长序列建模中的状态空间模型与循环机制	large language model
28	LLM-based Knowledge Pruning for Time Series Data Analytics on Edge-computing Devices	提出基于LLM知识剪枝的时间序列边缘计算分析方法，提升资源受限设备性能。	large language model

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
29	Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns	提出资产依赖神经网络(ADNN)，利用时空模式预测金融资产依赖关系。	spatiotemporal
30	Generalizable Implicit Neural Representation As a Universal Spatiotemporal Traffic Data Learner	提出通用时空隐式神经表示，解决多尺度交通数据统一建模问题	spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页