cs.LG（2025-04-25）

📊 共 14 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二：RL算法与架构 (RL & Architecture) (8 🔗1) 支柱九：具身大模型 (Embodied Foundation Models) (5) 支柱三：空间感知与语义 (Perception & Semantics) (1)

🔬 支柱二：RL算法与架构 (RL & Architecture) (8 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Multimodal graph representation learning for website generation based on visual sketch	提出基于多模态图表示学习的网站生成方法，提升设计到代码的自动化水平。	representation learning multimodal	✅
2	Deep Reinforcement Learning Based Navigation with Macro Actions and Topological Maps	提出基于宏动作和拓扑地图的深度强化学习导航方法，解决复杂环境下的稀疏奖励问题。	reinforcement learning deep reinforcement learning
3	Generalization Capability for Imitation Learning	基于信息论与数据分布特性的模仿学习泛化能力分析框架	imitation learning foundation model
4	Foundations of Safe Online Reinforcement Learning in the Linear Quadratic Regulator: $\sqrt{T}$-Regret	针对安全约束线性二次调节器，提出$\tilde{O}_T(\sqrt{T})$遗憾度的安全在线强化学习算法	reinforcement learning
5	Representation Learning for Distributional Perturbation Extrapolation	提出扰动分布自编码器(PDAE)，用于预测未见扰动下的测量分布。	representation learning
6	Generalization Guarantees for Multi-View Representation Learning and Application to Regularization via Gaussian Product Mixture Prior	针对多视角表征学习，提出基于高斯乘积混合先验的正则化方法，提升泛化性能。	representation learning
7	A Generative Graph Contrastive Learning Model with Global Signal	提出CSG2L框架，通过全局信号生成对比学习样本，提升图学习性能。	contrastive learning
8	Learning from Less: SINDy Surrogates in RL	提出基于SINDy的强化学习代理环境，降低计算成本并保持性能。	reinforcement learning model-based RL

🔬 支柱九：具身大模型 (Embodied Foundation Models) (5 篇)

#	题目	一句话要点	标签	🔗	⭐
9	TLoRA: Tri-Matrix Low-Rank Adaptation of Large Language Models	TLoRA：一种用于大型语言模型的三矩阵低秩自适应方法	large language model
10	A Unified MDL-based Binning and Tensor Factorization Framework for PDF Estimation	提出基于MDL的联合分箱与张量分解框架，用于概率密度函数估计	multimodal
11	NoEsis: Differentially Private Knowledge Transfer in Modular LLM Adaptation	NoEsis：模块化LLM自适应中的差分隐私知识迁移框架	large language model
12	Think, Prune, Train, Improve: Scaling Reasoning without Scaling Models	提出Think, Prune, Train框架，无需扩大模型即可提升LLM推理能力	large language model
13	Streaming, Fast and Slow: Cognitive Load-Aware Streaming for Efficient LLM Serving	提出认知负荷感知的LLM流式服务，提升效率并节约计算资源	large language model

🔬 支柱三：空间感知与语义 (Perception & Semantics) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
14	Three Types of Calibration with Properties and their Semantic and Formal Relationships	提出基于属性的三种校准方法，并分析其语义和形式关系，以应对预测系统校准概念碎片化问题。	semantic map

⬅️ 返回 cs.LG 首页 · 🏠 返回主页