cs.LG（2024-12-12）

📊 共 15 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (9) 支柱二：RL算法与架构 (RL & Architecture) (6 🔗1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
1	A Wander Through the Multimodal Landscape: Efficient Transfer Learning via Low-rank Sequence Multimodal Adapter	提出Wander：一种低秩序列多模态适配器，用于高效多模态迁移学习	multimodal
2	Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals	提出NormWear，用于可穿戴生理信号的多变量通用表征学习	foundation model
3	Federated Foundation Models on Heterogeneous Time Series	提出FFTS：一种异构时间序列联邦学习框架，用于训练泛化性强的基础模型	foundation model
4	Explore Theory of Mind: Program-guided adversarial data generation for theory of mind reasoning	提出ExploreToM框架，通过程序引导的对抗数据生成增强LLM的心智理论推理能力	large language model
5	CUAL: Continual Uncertainty-aware Active Learner	提出CUAL以解决持续不确定性感知主动学习问题	foundation model
6	Capturing the Temporal Dependence of Training Data Influence	提出数据价值嵌入方法，捕捉训练数据影响的时序依赖性，解决传统影响函数的局限性。	foundation model
7	CRVQ: Channel-Relaxed Vector Quantization for Extreme Compression of LLMs	提出通道松弛向量量化CRVQ，用于大语言模型极限压缩。	large language model
8	GeLoRA: Geometric Adaptive Ranks For Efficient LoRA Fine-tuning	GeLoRA：提出几何自适应秩LoRA微调方法，提升大语言模型微调效率。	large language model
9	MOPI-HFRS: A Multi-objective Personalized Health-aware Food Recommendation System with LLM-enhanced Interpretation	MOPI-HFRS：结合LLM解释的多目标个性化健康饮食推荐系统	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
10	Does Representation Matter? Exploring Intermediate Layers in Large Language Models	探索LLM中间层表征质量，发现其优于最终层并揭示架构差异	SSM state space model large language model
11	PickLLM: Context-Aware RL-Assisted Large Language Model Routing	提出PickLLM以解决LLM选择优化问题	reinforcement learning large language model
12	Optimising TinyML with Quantization and Distillation of Transformer and Mamba Models for Indoor Localisation on Edge Devices	针对边缘设备室内定位，提出Transformer和Mamba模型量化与蒸馏的TinyML优化方案	Mamba distillation
13	FAWAC: Feasibility Informed Advantage Weighted Regression for Persistent Safety in Offline Reinforcement Learning	提出FAWAC算法，通过可行性约束优势加权回归实现离线强化学习中的持久安全	reinforcement learning offline reinforcement learning
14	Single-View Graph Contrastive Learning with Soft Neighborhood Awareness	提出SIGNA：一种基于软邻域感知的单视图图对比学习框架，提升节点分类性能。	contrastive learning	✅
15	Optimal Control with Natural Images: Efficient Reinforcement Learning using Overcomplete Sparse Codes	提出基于过完备稀疏编码的强化学习方法，高效解决自然图像序列上的最优控制问题	reinforcement learning

⬅️ 返回 cs.LG 首页 · 🏠 返回主页