cs.LG（2025-01-24）

📊 共 24 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (12) 支柱二：RL算法与架构 (RL & Architecture) (11 🔗3) 支柱三：空间感知与语义 (Perception & Semantics) (1 🔗1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (12 篇)

#	题目	一句话要点	标签	🔗	⭐
1	DarkMind: Latent Chain-of-Thought Backdoor in Customized LLMs	提出DarkMind以解决定制LLMs中的潜在后门攻击问题	large language model chain-of-thought
2	LLM4DistReconfig: A Fine-tuned Large Language Model for Power Distribution Network Reconfiguration	提出LLM4DistReconfig，一种用于电力配电网络重构的微调大语言模型	large language model
3	Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course	利用大型语言模型自动评分：生物信息学课程的实践与启示	large language model
4	SwiftPrune: Hessian-Free Weight Pruning for Large Language Models	SwiftPrune：一种用于大型语言模型的无Hessian矩阵权重剪枝方法	large language model
5	Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models	提出Argos以解决云基础设施中的时间序列异常检测问题	large language model
6	Internal Activation Revision: Safeguarding Vision Language Models Without Parameter Update	提出内部激活修正方法，无需参数更新即可提升视觉语言模型的安全性	large language model multimodal
7	Feasible Learning	提出可行学习(FL)范式，提升模型在图像分类、回归和偏好优化等任务中的尾部性能。	large language model
8	The Karp Dataset	提出Karp数据集，用于评估和提升大型语言模型在NP完备性规约证明中的数学推理能力。	large language model
9	Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval	提出虫洞记忆模块，实现跨对话记忆检索，优化LLM记忆管理	large language model
10	Locality-aware Fair Scheduling in LLM Serving	提出 locality-aware 的公平调度算法，提升LLM Serving的吞吐和公平性。	large language model
11	Advances in Temporal Point Processes: Bayesian, Neural, and LLM Approaches	综述时间点过程：贝叶斯、深度学习与大语言模型方法	large language model
12	Humanity's Last Exam	提出人类最后考试基准以评估大型语言模型能力	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (11 篇)

#	题目	一句话要点	标签	🔗	⭐
13	Multimodal Prescriptive Deep Learning	提出多模态处方深度学习框架PNN，用于优化医疗决策。	distillation multimodal
14	TFG-Flow: Training-free Guidance in Multimodal Generative Flow	TFG-Flow：用于多模态生成Flow的免训练引导方法，应用于分子设计。	flow matching foundation model multimodal
15	Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework	提出基于奖励引导的保守Q学习算法，协调拼车与公共交通，提升多模式交通系统效率。	reinforcement learning CQL conservative q-learning
16	Age and Power Minimization via Meta-Deep Reinforcement Learning in UAV Networks	提出基于元深度强化学习的无人机网络AoI与功耗最小化方案	reinforcement learning deep reinforcement learning
17	Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation	提出基于因果效应估计的动作空间缩减方法，提升深度强化学习探索效率。	reinforcement learning deep reinforcement learning	✅
18	ACT-JEPA: Novel Joint-Embedding Predictive Architecture for Efficient Policy Representation Learning	ACT-JEPA：一种高效策略表示学习的联合嵌入预测架构	imitation learning world model representation learning
19	Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies	提出Fat-to-Thin策略优化算法，解决离线强化学习中稀疏策略学习问题	reinforcement learning offline RL offline reinforcement learning	✅
20	A Deep State Space Model for Rainfall-Runoff Simulations	提出基于S4D-FT的深度状态空间模型，用于提升降雨径流模拟精度。	SSM state space model
21	E-Gen: Leveraging E-Graphs to Improve Continuous Representations of Symbolic Expressions	E-Gen：利用E-图改进符号表达式的连续表示	contrastive learning large language model	✅
22	Reinforcement Learning for Efficient Returns Management	提出基于强化学习的在线多背包问题解决方案，优化零售退货管理效率。	reinforcement learning
23	Bi-directional Curriculum Learning for Graph Anomaly Detection: Dual Focus on Homogeneity and Heterogeneity	提出双向课程学习（BCL）策略，提升图异常检测模型性能。	curriculum learning

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
24	TrajFlow: A Generative Framework for Occupancy Density Estimation Using Normalizing Flows	TrajFlow：利用Normalizing Flows进行动态场景下占据密度估计的生成框架	occupancy grid CHOIS	✅

⬅️ 返回 cs.LG 首页 · 🏠 返回主页