cs.LG（2025-01-21）

📊 共 22 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (11) 支柱二：RL算法与架构 (RL & Architecture) (8) 支柱一：机器人控制 (Robot Control) (2 🔗1) 支柱六：视频提取与匹配 (Video Extraction) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (11 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Large Language Models Meet Graph Neural Networks for Text-Numeric Graph Reasoning	提出结合大语言模型与图神经网络的文本-数值图推理方法，用于科学发现。	large language model
2	CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning	提出CroMe模型，利用跨模态Tri-Transformer和度量学习进行多模态假新闻检测。	large language model multimodal
3	CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning	提出CDW-CoT，通过聚类和距离加权优化提示，提升LLM在复杂推理任务中的性能。	large language model chain-of-thought
4	How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models?	研究预训练数据空间分布对地理空间基础模型性能的影响	foundation model
5	Adaptive PII Mitigation Framework for Large Language Models	提出一种自适应PII缓解框架，用于应对大语言模型中的隐私合规挑战。	large language model
6	BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks	提出双极水印以增强大语言模型文本水印检测能力	large language model
7	Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models	提出基于线性反馈控制的LLM迭代提示优化方法，提升输出质量	large language model
8	The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws	提出基于平均参数计数的缩放法则，统一稀疏和稠密预训练LLM的性能预测。	large language model
9	Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation	提出DTA-Llama，通过并行工具调用提升LLM在复杂任务中的性能与效率。	large language model
10	FOCUS: First Order Concentrated Updating Scheme	提出FOCUS优化器，提升LLM在梯度噪声下的训练稳定性和速度	large language model
11	ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation	ALoFTRAG：面向RAG的自动局部微调框架，提升特定领域准确率	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

#	题目	一句话要点	标签	🔗	⭐
12	Compositional Instruction Following with Language Models and Reinforcement Learning	提出CERLLA，利用组合策略表示和强化学习语义解析器，提升语言模型在组合指令跟随任务中的泛化能力。	reinforcement learning instruction following language conditioned
13	GLAM: Global-Local Variation Awareness in Mamba-based World Model	GLAM：基于Mamba的世界模型，通过全局-局部变异感知提升样本效率	reinforcement learning world model Mamba
14	Explainable AI for Mental Health Emergency Returns: Integrating LLMs with Predictive Modeling	融合LLM与预测模型，提升精神健康急诊返诊预测的准确性和可解释性	predictive model large language model
15	Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics	提出基于终止动态集成的多智能体强化学习方法，解决不确定性问题。	reinforcement learning policy learning
16	Reinforcement Learning Constrained Beam Search for Parameter Optimization of Paper Drying Under Flexible Constraints	提出RLCBS算法，解决强化学习中柔性约束下的参数优化问题，应用于纸张干燥过程。	reinforcement learning
17	Community-Aware Temporal Walks: Parameter-Free Representation Learning on Continuous-Time Dynamic Graphs	提出CTWalks，解决连续时间动态图表示学习中时序和结构动态建模难题	representation learning
18	Group-Agent Reinforcement Learning with Heterogeneous Agents	提出异构智能体组学习强化学习框架，加速个体智能体学习并提升性能	reinforcement learning
19	Toward Effective Digraph Representation Learning: A Magnetic Adaptive Propagation based Approach	提出磁适应传播方法，有效提升有向图表示学习性能	representation learning

🔬 支柱一：机器人控制 (Robot Control) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
20	Audio Texture Manipulation by Exemplar-Based Analogy	提出基于范例类比的音频纹理操控方法，通过配对语音样本实现声音转换。	manipulation	✅
21	SafePowerGraph-HIL: Real-Time HIL Validation of Heterogeneous GNNs for Bridging Sim-to-Real Gap in Power Grids	SafePowerGraph-HIL：利用异构图神经网络和硬件在环仿真弥合电力系统Sim-to-Real差距	sim-to-real

🔬 支柱六：视频提取与匹配 (Video Extraction) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
22	Learning Dynamic Representations via An Optimally-Weighted Maximum Mean Discrepancy Optimization Framework for Continual Learning	提出OWMMD框架，通过优化最大均值差异动态学习表征，缓解持续学习中的灾难性遗忘。	feature matching

⬅️ 返回 cs.LG 首页 · 🏠 返回主页