cs.LG（2025-03-05）

📊 共 29 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (14 🔗2) 支柱二：RL算法与架构 (RL & Architecture) (9 🔗1) 支柱八：物理动画 (Physics-based Animation) (3) 支柱七：动作重定向 (Motion Retargeting) (1) 支柱三：空间感知与语义 (Perception & Semantics) (1) 支柱四：生成式动作 (Generative Motion) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (14 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Multimodal Stock Price Prediction: A Case Study of the Russian Securities Market	提出一种融合新闻文本和时间序列的多模态方法，用于提升俄罗斯股市价格预测精度。	large language model multimodal
2	LLM as GNN: Graph Vocabulary Learning for Text-Attributed Graph Foundation Models	提出PromptGFM，通过图词汇学习实现文本属性图的图基础模型。	large language model foundation model	✅
3	Exploring the Potential of Large Language Models as Predictors in Dynamic Text-Attributed Graphs	提出GraphAgent-Dynamic框架，利用协作LLM解决动态文本属性图预测难题。	large language model foundation model
4	TEDDY: A Family Of Foundation Models For Understanding Single Cell Biology	TEDDY：用于理解单细胞生物学的系列Transformer基础模型	foundation model
5	PAIR: A Novel Large Language Model-Guided Selection Strategy for Evolutionary Algorithms	PAIR：基于大语言模型引导的进化算法选择策略，提升TSP问题求解性能	large language model
6	Bridging Molecular Graphs and Large Language Models	提出Graph2Token，将分子图与大语言模型对齐，实现分子性质预测。	large language model
7	An Optimization Algorithm for Multimodal Data Alignment	提出AlignXpert算法，优化多模态数据对齐，提升跨模态推理能力	multimodal
8	LLM-Powered Prediction of Hyperglycemia and Discovery of Behavioral Treatment Pathways from Wearables and Diet	提出GlucoLens以预测餐后高血糖并发现行为治疗路径	large language model multimodal	✅
9	LEWIS (LayEr WIse Sparsity) -- A Training Free Guided Model Merging Approach	LEWIS：一种免训练的层级稀疏引导模型合并方法	large language model instruction following
10	A Little Depth Goes a Long Way: The Expressive Power of Log-Depth Transformers	提出对数深度Transformer，解决传统Transformer在长序列推理上的表达能力不足问题	chain-of-thought
11	The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems	提出MASK基准，用于区分AI系统中的诚实性与准确性	large language model
12	Memory Injection Attacks on LLM Agents via Query-Only Interaction	提出MINJA：一种针对LLM Agent的查询注入式记忆攻击方法	large language model
13	Mixture of Experts Made Intrinsically Interpretable	提出MoE-X，一种本质上可解释的混合专家语言模型，提升模型可解释性。	large language model
14	Robust Learning of Diverse Code Edits	提出SeleKT算法和NextCoder模型，提升代码语言模型在多样化代码编辑任务中的鲁棒性。	instruction following

🔬 支柱二：RL算法与架构 (RL & Architecture) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
15	Feature Matching Intervention: Leveraging Observational Data for Causal Representation Learning	提出特征匹配干预(FMI)方法，从观测数据中学习因果表示	representation learning feature matching
16	Rebalanced Multimodal Learning with Data-aware Unimodal Sampling	提出数据感知的单模态采样方法，解决多模态学习中的模态不平衡问题。	reinforcement learning multimodal
17	Seldonian Reinforcement Learning for Ad Hoc Teamwork	提出基于Seldonian优化的离线强化学习方法，解决Ad Hoc团队合作中的安全可靠性问题	reinforcement learning offline RL
18	State-offset Tuning: State-based Parameter-Efficient Fine-Tuning for State Space Models	提出State-offset Tuning，一种面向状态空间模型的高效参数微调方法	SSM state space model	✅
19	Towards Understanding Distilled Reasoning Models: A Representational Approach	通过表征分析理解蒸馏推理模型：揭示模型蒸馏对LLM推理能力的影响	distillation large language model
20	Probabilistic Insights for Efficient Exploration Strategies in Reinforcement Learning	针对稀疏奖励强化学习，提出基于概率洞察的高效探索策略	reinforcement learning
21	Chunking the Critic: A Transformer-based Soft Actor-Critic with N-Step Returns	提出基于Transformer的序列条件评论家，结合N步回报，提升SAC在长程任务中的性能。	reinforcement learning SAC
22	DO-IQS: Dynamics-Aware Offline Inverse Q-Learning for Optimal Stopping with Unknown Gain Functions	提出DO-IQS，解决未知增益函数下最优停止的逆向Q学习问题	reinforcement learning inverse reinforcement learning
23	Less is more? Rewards in RL for Cyber Defence	提出基于稀疏奖励的强化学习网络防御方法，提升复杂网络环境下的防御效果。	reinforcement learning deep reinforcement learning

🔬 支柱八：物理动画 (Physics-based Animation) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
24	A Survey of Foundation Models for Environmental Science	综述：环境科学领域的基础模型应用与发展	spatiotemporal foundation model
25	Limits of nonlinear and dispersive fiber propagation for an optical fiber-based extreme learning machine	基于光纤传播的极限学习机：研究非线性与色散效应对性能的限制	PULSE
26	WVEmbs with its Masking: A Method For Radar Signal Sorting	提出基于掩码的宽值嵌入方法WVEmbs，用于复杂雷达信号分选。	PULSE

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
27	Dimensionality reduction for homological stability and global structure preservation	提出DiRe工具包以解决传统降维方法的局限性	structure preservation

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
28	Is Pre-training Applicable to the Decoder for Dense Prediction?	提出$ imes$Net，实现预训练解码器在密集预测任务中的应用，显著提升性能。	depth estimation monocular depth

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
29	Handling Uncertainty in Health Data using Generative Algorithms	RIGA：利用生成对抗网络处理医疗数据中的不确定性和类别不平衡问题	VQ-VAE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页