cs.LG（2025-04-14）

📊 共 25 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (11 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (10) 支柱四：生成式动作 (Generative Motion) (1) 支柱七：动作重定向 (Motion Retargeting) (1) 支柱八：物理动画 (Physics-based Animation) (1) 支柱一：机器人控制 (Robot Control) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (11 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Enhancing Ultra-Low-Bit Quantization of Large Language Models Through Saliency-Aware Partial Retraining	提出基于显著性感知的局部重训练方法，提升大语言模型超低比特量化性能。	large language model	✅
2	Foundation models for electronic health records: representation dynamics and transferability	研究电子病历基础模型的表征动态与迁移能力，提升临床预测任务性能。	foundation model
3	Satellite Federated Fine-Tuning for Foundation Models in Space Computing Power Networks	提出卫星联邦微调框架，解决星载计算资源受限和空间通信挑战。	foundation model
4	How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients	通过层级梯度谱分析，揭示指令和推理数据质量对LLM后训练的影响	large language model instruction following
5	LLM-based AI Agent for Sizing of Analog and Mixed Signal Circuit	提出基于LLM的AI Agent，用于模拟和混合信号电路的晶体管尺寸设计	large language model
6	Can LLMs Handle WebShell Detection? Overcoming Detection Challenges with Behavioral Function-Aware Framework	提出BFAD框架，提升LLM在WebShell检测中的性能，超越传统方法。	large language model
7	Efficient Process Reward Model Training via Active Learning	提出ActPRM主动学习方法，高效训练过程奖励模型，降低标注成本。	large language model
8	Graph Neural Networks Based Analog Circuit Link Prediction	提出GNN-ACLP方法，利用图神经网络进行模拟电路链路预测，提升电路设计自动化水平。	large language model
9	FedRecon: Missing Modality Reconstruction in Heterogeneous Distributed Environments	FedRecon：异构分布式环境下缺失模态重建的联邦学习方法	multimodal
10	KeepKV: Achieving Periodic Lossless KV Cache Compression for Efficient LLM Inference	KeepKV：实现LLM高效推理的周期性无损KV缓存压缩	large language model
11	Ember: A Compiler for Efficient Embedding Operations on Decoupled Access-Execute Architectures	Ember编译器：为解耦访问-执行架构优化嵌入操作	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
12	On the Value of Cross-Modal Misalignment in Multimodal Representation Learning	通过建模跨模态不对齐，提升多模态表征学习的性能与可解释性	representation learning contrastive learning multimodal
13	M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models	提出基于Mamba的混合线性RNN推理模型M1，提升测试时计算效率。	Mamba distillation large language model
14	Achieving Optimal Tissue Repair Through MARL with Reward Shaping and Curriculum Learning	提出基于MARL的组织修复框架，通过奖励塑造和课程学习优化修复过程	reinforcement learning curriculum learning reward shaping
15	Adaptive Sensor Steering Strategy Using Deep Reinforcement Learning for Dynamic Data Acquisition in Digital Twins	提出基于深度强化学习的自适应传感器控制策略，用于数字孪生中的动态数据采集。	reinforcement learning deep reinforcement learning
16	STaRFormer: Semi-Supervised Task-Informed Representation Learning via Dynamic Attention-Based Regional Masking for Sequential Data	STaRFormer：基于动态注意力区域掩码的半监督任务感知序列数据表征学习	representation learning contrastive learning spatiotemporal
17	Reasoning without Regret	提出BARS框架以解决稀疏奖励信号的有效性问题	reward shaping large language model chain-of-thought
18	Using Reinforcement Learning to Integrate Subjective Wellbeing into Climate Adaptation Decision Making	提出强化学习框架以整合主观幸福感于气候适应决策中	reinforcement learning
19	AimTS: Augmented Series and Image Contrastive Learning for Time Series Classification	AimTS：通过增强序列和图像对比学习提升时间序列分类性能	contrastive learning
20	Improving Controller Generalization with Dimensionless Markov Decision Processes	提出基于无量纲MDP的强化学习方法，提升控制器在不同环境下的泛化能力	reinforcement learning world model
21	Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss	提出基于期望分位损失的适度Actor-Critic方法，抑制Q函数过估计偏差	reinforcement learning SAC

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
22	RadarLLM: Empowering Large Language Models to Understand Human Motion from Millimeter-Wave Point Cloud Sequence	RadarLLM：利用大语言模型理解毫米波雷达点云序列中的人体运动	VQ-VAE human motion large language model

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
23	A Structure-Preserving Framework for Solving Parabolic Partial Differential Equations with Neural Networks	提出Sidecar框架，增强神经网络求解抛物型偏微分方程的物理一致性	structure preservation

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
24	Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network	提出MDSTNet，利用气象引导的解耦时空网络进行空气质量预测	spatiotemporal

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
25	Undermining Federated Learning Accuracy in EdgeIoT via Variational Graph Auto-Encoders	提出数据独立模型操控攻击以解决EdgeIoT中的联邦学习准确性问题	manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页