cs.LG（2024-07-18）

📊 共 22 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (12 🔗2) 支柱九：具身大模型 (Embodied Foundation Models) (6 🔗2) 支柱八：物理动画 (Physics-based Animation) (2) 支柱一：机器人控制 (Robot Control) (1) 支柱五：交互与反应 (Interaction & Reaction) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (12 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Random Latent Exploration for Deep Reinforcement Learning	提出随机潜在空间探索(RLE)算法，提升深度强化学习的探索效率。	reinforcement learning deep reinforcement learning	✅
2	Analyzing and Bridging the Gap between Maximizing Total Reward and Discounted Reward in Deep Reinforcement Learning	提出两种目标对齐方法，解决深度强化学习中总回报与折扣回报差异问题	reinforcement learning deep reinforcement learning
3	Event-Triggered Reinforcement Learning Based Joint Resource Allocation for Ultra-Reliable Low-Latency V2X Communications	提出基于事件触发强化学习的联合资源分配方案，用于超可靠低延迟V2X通信。	reinforcement learning deep reinforcement learning DRL
4	Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review	综述：基于强化学习的扩散模型微调方法，优化生物序列生成任务	reinforcement learning PPO	✅
5	Model-based Policy Optimization using Symbolic World Model	提出基于符号世界模型的策略优化方法，提升机器人学习的样本效率	reinforcement learning world model
6	Instance Selection for Dynamic Algorithm Configuration with Reinforcement Learning: Improving Generalization	提出基于强化学习动态算法配置的实例选择方法，提升泛化性能	reinforcement learning deep reinforcement learning
7	Reconfigurable Intelligent Surface Aided Vehicular Edge Computing: Joint Phase-shift Optimization and Multi-User Power Allocation	提出基于RIS辅助的VEC系统，利用DRL优化相移和功率分配，提升车辆边缘计算性能。	reinforcement learning deep reinforcement learning DRL
8	A reinforcement learning strategy to automate and accelerate h/p-multigrid solvers	提出基于强化学习的h/p-多重网格求解器自动优化策略，提升求解效率与稳定性	reinforcement learning
9	Optimistic Q-learning for average reward and episodic reinforcement learning	提出乐观Q学习算法以解决平均奖励强化学习中的后悔最小化问题	reinforcement learning
10	Data-Driven Estimation of Conditional Expectations, Application to Optimal Stopping and Reinforcement Learning	提出一种纯数据驱动的条件期望估计方法，并应用于最优停止和强化学习。	reinforcement learning
11	HHGT: Hierarchical Heterogeneous Graph Transformer for Heterogeneous Graph Representation Learning	提出HHGT：一种用于异构图表示学习的分层异构图Transformer模型	representation learning
12	PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods	PG-Rainbow：将分布强化学习融入策略梯度方法，提升Atari游戏性能	reinforcement learning

🔬 支柱九：具身大模型 (Embodied Foundation Models) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
13	NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals	NNsight与NDIF： democratize 开放权重基础模型内部机制的研究	large language model foundation model
14	CogniVoice: Multimodal and Multilingual Fusion Networks for Mild Cognitive Impairment Assessment from Spontaneous Speech	CogniVoice：多模态多语种融合网络用于从语音中评估轻度认知障碍	multimodal
15	A Foundation Model for Soccer	提出足球领域的基础模型，用于预测足球比赛中的后续动作。	foundation model	✅
16	TrialEnroll: Predicting Clinical Trial Enrollment Success with Deep & Cross Network and Large Language Models	TrialEnroll：利用深度交叉网络和LLM增强文本特征预测临床试验招募成功率	large language model
17	Reconstruct the Pruned Model without Any Retraining	提出LIAR框架以无须重训练重构剪枝模型	large language model
18	Integrated Hardware Architecture and Device Placement Search	PHAZE：通过联合搜索硬件架构与设备放置策略优化分布式深度学习训练。	large language model	✅

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
19	A Survey on Differential Privacy for SpatioTemporal Data in Transportation Research	综述时空数据差分隐私在交通研究中的应用与挑战	spatiotemporal
20	Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence	提出基于贝叶斯优化的神经网络与ASIC协同设计方法，实现像素内智能	PULSE

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
21	Deterministic Trajectory Optimization through Probabilistic Optimal Control	提出基于概率最优控制的确定性轨迹优化算法，提升数值稳定性和收敛速度	trajectory optimization

🔬 支柱五：交互与反应 (Interaction & Reaction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
22	Privacy-preserving gradient-based fair federated learning	提出一种基于同态加密的、保护隐私的梯度公平联邦学习方案	OMOMO

⬅️ 返回 cs.LG 首页 · 🏠 返回主页