cs.LG(2025-12-07)

📊 共 14 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (7 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (5 🔗1) 支柱四:生成式动作 (Generative Motion) (1) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (7 篇)

#题目一句话要点标签🔗
1 Deep Reinforcement Learning for Phishing Detection with Transformer-Based Semantic Features 提出基于Transformer语义特征的QR-DQN深度强化学习方法,用于提升钓鱼网站检测的准确性和泛化性。 reinforcement learning deep reinforcement learning
2 Statistical analysis of Inverse Entropy-regularized Reinforcement Learning 提出基于熵正则化逆强化学习的统计分析框架,解决奖励函数非唯一性问题。 reinforcement learning behavior cloning inverse reinforcement learning
3 Parent-Guided Semantic Reward Model (PGSRM): Embedding-Based Reward Functions for Reinforcement Learning of Transformer Language Models 提出Parent-Guided Semantic Reward Model,用于Transformer语言模型的强化学习。 reinforcement learning PPO RLHF
4 State Diversity Matters in Offline Behavior Distillation 提出状态密度加权离线行为蒸馏算法,提升状态多样性以改善策略学习。 offline RL distillation
5 Always Keep Your Promises: DynamicLRP, A Model-Agnostic Solution To Layer-Wise Relevance Propagation 提出DynamicLRP,一种模型无关的逐层相关性传播解决方案 Mamba multimodal
6 Adaptive Normalization Mamba with Multi Scale Trend Decomposition and Patch MoE Encoding 提出AdaMamba,通过自适应归一化和多尺度趋势分解增强时间序列预测的稳定性和准确性。 Mamba
7 Know your Trajectory -- Trustworthy Reinforcement Learning deployment through Importance-Based Trajectory Analysis 提出基于重要性的轨迹分析方法,提升强化学习部署的可信度 reinforcement learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (5 篇)

#题目一句话要点标签🔗
8 KV-CAR: KV Cache Compression using Autoencoders and KV Reuse in Large Language Models KV-CAR:利用自编码器压缩KV缓存并在大语言模型中复用KV,降低内存占用。 large language model
9 A Novel Multimodal RUL Framework for Remaining Useful Life Estimation with Layer-wise Explanations 提出一种多模态剩余寿命预测框架,结合分层解释性,提升滚动轴承剩余寿命预测的准确性和可信度。 multimodal
10 Block Sparse Flash Attention 提出块稀疏Flash Attention加速长文本推理,保持模型质量。 large language model
11 GradientSpace: Unsupervised Data Clustering for Improved Instruction Tuning GradientSpace:用于改进指令调优的无监督数据聚类方法 large language model
12 GSAE: Graph-Regularized Sparse Autoencoders for Robust LLM Safety Steering 提出图正则化稀疏自编码器GSAE,用于增强LLM安全引导,提升对抗攻击下的鲁棒性。 large language model

🔬 支柱四:生成式动作 (Generative Motion) (1 篇)

#题目一句话要点标签🔗
13 Vector Quantization using Gaussian Variational Autoencoder 提出高斯量化(GQ)方法,无需额外训练即可将高斯VAE转化为高效VQ-VAE VQ-VAE

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
14 The Art of Storytelling in Authoritarian Regimes: Crafting State Narratives on Chinese Social Media 提出一种分析框架,用于研究威权政权如何在社交媒体上构建国家叙事 manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页