cs.LG（2025-08-26）

📊 共 25 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (12 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (10) 支柱八：物理动画 (Physics-based Animation) (3)

🔬 支柱二：RL算法与架构 (RL & Architecture) (12 篇)

#	题目	一句话要点	标签	🔗	⭐
1	(DEMO) Deep Reinforcement Learning Based Resource Allocation in Distributed IoT Systems	提出基于深度强化学习的资源分配框架以解决分布式物联网系统问题	reinforcement learning deep reinforcement learning DRL
2	DRMD: Deep Reinforcement Learning for Malware Detection under Concept Drift	提出DRMD以解决恶意软件检测中的概念漂移问题	reinforcement learning deep reinforcement learning DRL
3	HAEPO: History-Aggregated Exploratory Policy Optimization	提出HAEPO以解决长时间任务探索不足的问题	reinforcement learning PPO DPO
4	History Rhymes: Accelerating LLM Reinforcement Learning with RhymeRL	提出RhymeRL以解决大语言模型强化学习中的GPU利用率低下问题	reinforcement learning large language model
5	Re:Frame -- Retrieving Experience From Associative Memory	提出Re:Frame以解决离线强化学习中的专家数据稀缺问题	reinforcement learning offline RL offline reinforcement learning
6	Beyond Tokens: Enhancing RTL Quality Estimation via Structural Graph Learning	提出StructRTL框架以提升RTL设计质量估计	representation learning distillation large language model
7	Latent Variable Modeling in Multi-Agent Reinforcement Learning via Expectation-Maximization for UAV-Based Wildlife Protection	提出基于期望最大化的潜变量建模以解决无人机野生动物保护问题	reinforcement learning PPO
8	Stability and Generalization for Bellman Residuals	提出Bellman残差最小化以解决离线强化学习中的一致性问题	reinforcement learning offline reinforcement learning inverse reinforcement learning
9	Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks	提出混合专家模型的最优稀疏性以提升推理任务性能	reinforcement learning large language model	✅
10	Atrial Fibrillation Prediction Using a Lightweight Temporal Convolutional and Selective State Space Architecture	提出轻量级深度学习模型以实现心房颤动的早期预测	Mamba state space model
11	Revisiting associative recall in modern recurrent models	探讨现代递归模型中的联想回忆问题及其优化策略	Mamba SSM
12	Dual-Distilled Heterogeneous Federated Learning with Adaptive Margins for Trainable Global Prototypes	提出双蒸馏异构联邦学习以解决原型边界收缩问题	contrastive learning distillation

🔬 支柱九：具身大模型 (Embodied Foundation Models) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
13	Grounding the Ungrounded: A Spectral-Graph Framework for Quantifying Hallucinations in Multimodal LLMs	提出谱图框架量化多模态LLM中的幻觉问题	multimodal
14	FFT-MoE: Efficient Federated Fine-Tuning for Foundation Models via Large-scale Sparse MoE under Heterogeneous Edge	提出FFT-MoE以解决异构边缘环境下的联邦微调问题	foundation model
15	The Sound of Risk: A Multimodal Physics-Informed Acoustic Model for Forecasting Market Volatility and Enhancing Market Interpretability	提出多模态物理信息声学模型以增强市场波动预测能力	multimodal
16	Fine-Tuning Vision-Language Models for Neutrino Event Analysis in High-Energy Physics Experiments	提出基于视觉-语言模型的中微子事件分类方法	large language model multimodal
17	Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding	提出LRTab以提升大型语言模型在表格理解中的推理能力	large language model chain-of-thought
18	Understanding Tool-Integrated Reasoning	提出工具集成推理以提升大语言模型能力	large language model
19	APT-LLM: Exploiting Arbitrary-Precision Tensor Core Computing for LLM Acceleration	提出APT-LLM以解决大语言模型加速问题	large language model
20	PAX-TS: Model-agnostic multi-granular explanations for time series forecasting via localized perturbations	提出PAX-TS以解决时间序列预测模型的可解释性问题	large language model
21	Enhancing Model Privacy in Federated Learning with Random Masking and Quantization	提出FedQSN以解决联邦学习中的模型隐私保护问题	large language model
22	Rethinking Caching for LLM Serving Systems: Beyond Traditional Heuristics	提出SISO以优化大语言模型服务系统中的缓存策略	large language model

🔬 支柱八：物理动画 (Physics-based Animation) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
23	GENIE-ASI: Generative Instruction and Executable Code for Analog Subcircuit Identification	提出GENIE-ASI以解决模拟电路子电路识别问题	AMP large language model foundation model
24	Data-Augmented Few-Shot Neural Emulator for Computer-Model System Identification	提出数据增强的少样本神经仿真器以解决计算模型系统识别问题	spatiotemporal
25	Universal Dynamics with Globally Controlled Analog Quantum Simulators	提出全球控制的模拟器以实现普适量子动力学	PULSE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页