cs.LG(2025-01-24)

📊 共 24 篇论文 | 🔗 4 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (12) 支柱二:RL算法与架构 (RL & Architecture) (11 🔗3) 支柱三:空间感知与语义 (Perception & Semantics) (1 🔗1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (12 篇)

#题目一句话要点标签🔗
1 DarkMind: Latent Chain-of-Thought Backdoor in Customized LLMs 提出DarkMind以解决定制LLMs中的潜在后门攻击问题 large language model chain-of-thought
2 LLM4DistReconfig: A Fine-tuned Large Language Model for Power Distribution Network Reconfiguration 提出LLM4DistReconfig,一种用于电力配电网络重构的微调大语言模型 large language model
3 Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course 利用大型语言模型自动评分:生物信息学课程的实践与启示 large language model
4 SwiftPrune: Hessian-Free Weight Pruning for Large Language Models SwiftPrune:一种用于大型语言模型的无Hessian矩阵权重剪枝方法 large language model
5 Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models 提出Argos以解决云基础设施中的时间序列异常检测问题 large language model
6 Internal Activation Revision: Safeguarding Vision Language Models Without Parameter Update 提出内部激活修正方法,无需参数更新即可提升视觉语言模型的安全性 large language model multimodal
7 Feasible Learning 提出可行学习(FL)范式,提升模型在图像分类、回归和偏好优化等任务中的尾部性能。 large language model
8 The Karp Dataset 提出Karp数据集,用于评估和提升大型语言模型在NP完备性规约证明中的数学推理能力。 large language model
9 Wormhole Memory: A Rubik's Cube for Cross-Dialogue Retrieval 提出虫洞记忆模块,实现跨对话记忆检索,优化LLM记忆管理 large language model
10 Locality-aware Fair Scheduling in LLM Serving 提出 locality-aware 的公平调度算法,提升LLM Serving的吞吐和公平性。 large language model
11 Advances in Temporal Point Processes: Bayesian, Neural, and LLM Approaches 综述时间点过程:贝叶斯、深度学习与大语言模型方法 large language model
12 Humanity's Last Exam 提出人类最后考试基准以评估大型语言模型能力 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (11 篇)

#题目一句话要点标签🔗
13 Multimodal Prescriptive Deep Learning 提出多模态处方深度学习框架PNN,用于优化医疗决策。 distillation multimodal
14 TFG-Flow: Training-free Guidance in Multimodal Generative Flow TFG-Flow:用于多模态生成Flow的免训练引导方法,应用于分子设计。 flow matching foundation model multimodal
15 Coordinating Ride-Pooling with Public Transit using Reward-Guided Conservative Q-Learning: An Offline Training and Online Fine-Tuning Reinforcement Learning Framework 提出基于奖励引导的保守Q学习算法,协调拼车与公共交通,提升多模式交通系统效率。 reinforcement learning CQL conservative q-learning
16 Age and Power Minimization via Meta-Deep Reinforcement Learning in UAV Networks 提出基于元深度强化学习的无人机网络AoI与功耗最小化方案 reinforcement learning deep reinforcement learning
17 Reducing Action Space for Deep Reinforcement Learning via Causal Effect Estimation 提出基于因果效应估计的动作空间缩减方法,提升深度强化学习探索效率。 reinforcement learning deep reinforcement learning
18 ACT-JEPA: Novel Joint-Embedding Predictive Architecture for Efficient Policy Representation Learning ACT-JEPA:一种高效策略表示学习的联合嵌入预测架构 imitation learning world model representation learning
19 Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies 提出Fat-to-Thin策略优化算法,解决离线强化学习中稀疏策略学习问题 reinforcement learning offline RL offline reinforcement learning
20 A Deep State Space Model for Rainfall-Runoff Simulations 提出基于S4D-FT的深度状态空间模型,用于提升降雨径流模拟精度。 SSM state space model
21 E-Gen: Leveraging E-Graphs to Improve Continuous Representations of Symbolic Expressions E-Gen:利用E-图改进符号表达式的连续表示 contrastive learning large language model
22 Reinforcement Learning for Efficient Returns Management 提出基于强化学习的在线多背包问题解决方案,优化零售退货管理效率。 reinforcement learning
23 Bi-directional Curriculum Learning for Graph Anomaly Detection: Dual Focus on Homogeneity and Heterogeneity 提出双向课程学习(BCL)策略,提升图异常检测模型性能。 curriculum learning

🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)

#题目一句话要点标签🔗
24 TrajFlow: A Generative Framework for Occupancy Density Estimation Using Normalizing Flows TrajFlow:利用Normalizing Flows进行动态场景下占据密度估计的生成框架 occupancy grid CHOIS

⬅️ 返回 cs.LG 首页 · 🏠 返回主页