cs.LG(2025-08-06)

📊 共 36 篇论文 | 🔗 5 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (17 🔗3) 支柱九:具身大模型 (Embodied Foundation Models) (16 🔗2) 支柱八:物理动画 (Physics-based Animation) (1) 支柱三:空间感知与语义 (Perception & Semantics) (1) 支柱五:交互与反应 (Interaction & Reaction) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (17 篇)

#题目一句话要点标签🔗
1 AttriLens-Mol: Attribute Guided Reinforcement Learning for Molecular Property Prediction with Large Language Models 提出AttriLens-Mol以解决分子属性预测中的推理效率问题 reinforcement learning large language model chain-of-thought
2 SVGen: Interpretable Vector Graphics Generation with Large Language Models 提出SVGen以解决自然语言到SVG图形生成的挑战 reinforcement learning curriculum learning large language model
3 Are Large Language Models Dynamic Treatment Planners? An In Silico Study from a Prior Knowledge Injection Angle 利用大型语言模型优化动态治疗方案以改善临床决策 reinforcement learning large language model chain-of-thought
4 Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success 提出VL-DAC以解决现有视觉语言模型训练不足问题 reinforcement learning PPO multimodal
5 Emergent time-keeping mechanisms in a deep reinforcement learning agent performing an interval timing task 提出深度强化学习代理的时间保持机制以解决时间处理问题 reinforcement learning deep reinforcement learning DRL
6 FeDaL: Federated Dataset Learning for Time Series Foundation Models 提出FeDaL以解决时间序列基础模型中的数据集异质性问题 representation learning foundation model
7 Dynamic User-controllable Privacy-preserving Few-shot Sensing Framework 提出PrivCLIP框架以解决用户隐私控制问题 contrastive learning motion generation multimodal
8 MambaITD: An Efficient Cross-Modal Mamba Network for Insider Threat Detection 提出MambaITD以解决内部威胁检测中的多模态融合问题 Mamba state space model
9 Symmetric Behavior Regularized Policy Optimization 提出对称行为正则化策略优化以解决离线强化学习中的分布偏移问题 reinforcement learning offline RL offline reinforcement learning
10 COPO: Consistency-Aware Policy Optimization 提出一致性意识的策略优化以解决强化学习中的梯度消失问题 reinforcement learning reward design large language model
11 Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment 提出Agnostics以解决低资源编程语言的后训练问题 reinforcement learning large language model
12 Unified Flow Matching for Long Horizon Event Forecasting 提出统一流匹配框架以解决长时间事件预测问题 flow matching
13 Automatic LLM Red Teaming 提出基于MDP的红队策略以提升LLM安全性 reinforcement learning large language model
14 Communication-Learning Co-Design for Differentially Private Over-the-Air Federated Distillation 提出差分隐私的空中联邦蒸馏框架以提升通信效率与隐私保护 distillation
15 WSS-CL: Weight Saliency Soft-Guided Contrastive Learning for Efficient Machine Unlearning Image Classification 提出WSS-CL以解决高效机器遗忘问题 contrastive learning
16 T3Time: Tri-Modal Time Series Forecasting via Adaptive Multi-Head Alignment and Residual Fusion 提出T3Time以解决多变量时间序列预测中的适应性不足问题 MAE large language model
17 Decoupled Contrastive Learning for Federated Learning 提出解耦对比学习以解决联邦学习中的数据异质性问题 contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (16 篇)

#题目一句话要点标签🔗
18 Multimodal RAG Enhanced Visual Description 提出轻量级RAG增强视觉描述方法以解决多模态对齐问题 multimodal
19 Explainable Deep Neural Network for Multimodal ECG Signals: Intermediate vs Late Fusion 提出多模态深度神经网络以提高心电图信号分类准确性 multimodal
20 GraphProp: Training the Graph Foundation Models using Graph Properties 提出GraphProp以解决图基础模型的结构性泛化问题 foundation model
21 Decoding the Multimodal Maze: A Systematic Review on the Adoption of Explainability in Multimodal Attention-based Models 系统评估多模态注意力模型的可解释性研究 multimodal
22 Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning 提出遗忘机制以改善大语言模型的微调效果 large language model
23 Leveraging large language models for SQL behavior-based database intrusion detection 提出基于BERT的SQL异常检测方法以解决数据库入侵问题 large language model
24 PA-RNet: Perturbation-Aware Reasoning Network for Multimodal Time Series Forecasting 提出PA-RNet以解决多模态时间序列预测中的干扰问题 multimodal
25 Retrieval-Augmented Water Level Forecasting for Everglades 提出检索增强水位预测方法以解决生态管理问题 foundation model
26 Provable Post-Training Quantization: Theoretical Analysis of OPTQ and Qronos 提出可证明的后训练量化方法以解决OPTQ和Qronos的理论保证问题 large language model
27 FedHiP: Heterogeneity-Invariant Personalized Federated Learning Through Closed-Form Solutions 提出FedHiP以解决个性化联邦学习中的数据异质性问题 foundation model
28 FlexQ: Efficient Post-training INT6 Quantization for LLM Serving via Algorithm-System Co-Design 提出FlexQ以解决大语言模型量化效率问题 large language model
29 Mockingbird: How does LLM perform in general machine learning tasks? 提出Mockingbird框架以提升LLM在通用机器学习任务中的表现 large language model
30 Empowering Time Series Forecasting with LLM-Agents 提出DCATS以提升时间序列预测的数据质量 large language model
31 Do Vision-Language Models Leak What They Learn? Adaptive Token-Weighted Model Inversion Attacks 提出自适应令牌加权模型反演攻击以解决视觉语言模型隐私泄露问题 visual grounding
32 Fine-tuning for Better Few Shot Prompting: An Empirical Comparison for Short Answer Grading 提出微调方法以改善少量样本提示的短答案评分 large language model
33 Sparse Attention across Multiple-context KV Cache 提出SamKV以解决多上下文KV缓存的稀疏注意力问题 large language model

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
34 CaPulse: Detecting Anomalies by Tuning in to the Causal Rhythms of Time Series 提出CaPulse以解决时间序列异常检测中的因果机制问题 PULSE

🔬 支柱三:空间感知与语义 (Perception & Semantics) (1 篇)

#题目一句话要点标签🔗
35 Keyword Spotting with Hyper-Matched Filters for Small Footprint Devices 提出一种开放词汇的关键词检测模型以解决小型设备的检测精度问题 open-vocabulary open vocabulary

🔬 支柱五:交互与反应 (Interaction & Reaction) (1 篇)

#题目一句话要点标签🔗
36 Evaluating Selective Encryption Against Gradient Inversion Attacks 提出选择性加密以应对梯度反演攻击问题 OMOMO

⬅️ 返回 cs.LG 首页 · 🏠 返回主页