cs.LG(2024-11-27)

📊 共 21 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (11) 支柱二:RL算法与架构 (RL & Architecture) (5) 支柱一:机器人控制 (Robot Control) (3 🔗1) 支柱八:物理动画 (Physics-based Animation) (1) 支柱四:生成式动作 (Generative Motion) (1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (11 篇)

#题目一句话要点标签🔗
1 SCoTT: Strategic Chain-of-Thought Tasking for Wireless-Aware Robot Navigation in Digital Twins 提出SCoTT框架,利用视觉语言模型在数字孪生中实现无线感知机器人导航。 chain-of-thought
2 Cyber-Attack Technique Classification Using Two-Stage Trained Large Language Models 提出一种基于两阶段训练的大语言模型网络攻击技术分类方法 large language model
3 Foundation Models in Radiology: What, How, When, Why and Why Not 放射学领域的基础模型综述:定义、训练、应用与挑战 foundation model
4 FastSwitch: Optimizing Context Switching Efficiency in Fairness-aware Large Language Model Serving FastSwitch:优化公平感知大语言模型服务中的上下文切换效率 large language model
5 Energy-Efficient Split Learning for Fine-Tuning Large Language Models in Edge Networks 提出一种节能的分割学习框架,用于在边缘网络中微调大型语言模型。 large language model
6 Multimodal Integration of Longitudinal Noninvasive Diagnostics for Survival Prediction in Immunotherapy Using Deep Learning 提出MMTSimTA网络,利用多模态纵向数据预测免疫疗法癌症患者的生存率 multimodal
7 Visual Error Patterns in Multi-Modal AI: A Statistical Approach 统计建模揭示多模态AI视觉错误模式,提升模型架构 large language model
8 Timing Matters: Enhancing User Experience through Temporal Prediction in Smart Homes 提出Timing-Matters模型,预测智能家居中用户行为的时间,提升用户体验。 TAMP
9 Break the ID-Language Barrier: An Adaption Framework for LLM-based Sequential Recommendation 提出IDLE-Adapter框架,弥合LLM在序列推荐中ID与语言的知识鸿沟 large language model
10 Evaluating and Improving the Robustness of Security Attack Detectors Generated by LLMs 利用RAG和自排序提升LLM生成安全攻击检测器的鲁棒性 large language model
11 Regularized Multi-LLMs Collaboration for Enhanced Score-based Causal Discovery 提出一种正则化多LLM协作框架,增强基于分数的因果发现 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (5 篇)

#题目一句话要点标签🔗
12 Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization 提出基于线性结构f-散度正则化的鲁棒离线强化学习方法,提升策略鲁棒性。 reinforcement learning policy learning offline reinforcement learning
13 Unpacking the Individual Components of Diffusion Policy 解构扩散策略:探究各组件对机器人技能学习的贡献 imitation learning diffusion policy
14 Multi-Label Contrastive Learning : A Comprehensive Study 多标签对比学习的综合研究:探索损失函数设计与优化方案 contrastive learning
15 Dynamic Retail Pricing via Q-Learning -- A Reinforcement Learning Framework for Enhanced Revenue Management 提出基于Q-Learning的强化学习框架,用于零售动态定价以提升收益管理 reinforcement learning
16 Scalable Multi-Objective Reinforcement Learning with Fairness Guarantees using Lorenz Dominance 提出基于Lorenz支配的可扩展多目标强化学习算法,保证公平性并应用于交通规划。 reinforcement learning

🔬 支柱一:机器人控制 (Robot Control) (3 篇)

#题目一句话要点标签🔗
17 Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective 提出基于稀疏训练的多任务学习梯度冲突缓解方法,提升模型性能。 manipulation generalist agent
18 One-Step Early Stopping Strategy using Neural Tangent Kernel Theory and Rademacher Complexity 基于神经正切核理论和Rademacher复杂度的神经网络单步早停策略 MPC
19 Dynamic Logistic Ensembles with Recursive Probability and Automatic Subset Splitting for Enhanced Binary Classification 提出动态Logistic集成模型,通过递归概率和自动子集划分增强二分类性能 manipulation

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
20 SPTTE: A Spatiotemporal Probabilistic Framework for Travel Time Estimation 提出SPTTE时空概率框架,解决出行时间估计中数据稀疏和分布不均问题 spatiotemporal

🔬 支柱四:生成式动作 (Generative Motion) (1 篇)

#题目一句话要点标签🔗
21 Synthetic ECG Generation for Data Augmentation and Transfer Learning in Arrhythmia Classification 利用合成ECG数据增强和迁移学习提升心律失常分类性能 VQ-VAE

⬅️ 返回 cs.LG 首页 · 🏠 返回主页