cs.LG(2024-12-04)

📊 共 10 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (7) 支柱九:具身大模型 (Embodied Foundation Models) (3 🔗2)

🔬 支柱二:RL算法与架构 (RL & Architecture) (7 篇)

#题目一句话要点标签🔗
1 Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models 通过评估合成数据的质量、多样性和复杂性,深入分析大语言模型生成合成数据的影响。 reinforcement learning large language model
2 Tight PAC-Bayesian Risk Certificates for Contrastive Learning 提出基于PAC-Bayes的对比学习风险证书,解决SimCLR框架下的泛化性保证问题。 representation learning contrastive learning foundation model
3 PathletRL++: Optimizing Trajectory Pathlet Extraction and Dictionary Formation via Reinforcement Learning PathletRL++:通过强化学习优化轨迹Pathlet提取和字典构建,提升轨迹数据表示效率。 reinforcement learning deep reinforcement learning
4 Cluster Specific Representation Learning 提出聚类特定表示学习框架,提升下游任务的泛化性能 representation learning contrastive learning
5 Inverse Delayed Reinforcement Learning 提出逆延迟强化学习框架,从受延迟扰动的专家轨迹中提取奖励特征并恢复策略。 reinforcement learning inverse reinforcement learning
6 Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning 提出Hyper算法,解决强化学习中探索策略对超参数敏感的问题 reinforcement learning
7 AI-Driven Day-to-Day Route Choice 提出基于LLM的出行者建模框架LLMTraveler,用于模拟日常路径选择行为 reinforcement learning large language model

🔬 支柱九:具身大模型 (Embodied Foundation Models) (3 篇)

#题目一句话要点标签🔗
8 Assessing Foundation Models' Transferability to Physiological Signals in Precision Medicine 提出评估框架模型在生理信号精准医学迁移能力的系统性流程 foundation model
9 A Water Efficiency Dataset for African Data Centers 构建非洲数据中心水资源效率数据集,评估LLM推理用水量 large language model
10 ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression ClusterKV:通过语义空间操纵LLM KV缓存,实现可召回的压缩 large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页