cs.LG(2023-12-15)

📊 共 17 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (8) 支柱九:具身大模型 (Embodied Foundation Models) (5 🔗2) 支柱八:物理动画 (Physics-based Animation) (3) 支柱一:机器人控制 (Robot Control) (1)

🔬 支柱二:RL算法与架构 (RL & Architecture) (8 篇)

#题目一句话要点标签🔗
1 Toward Computationally Efficient Inverse Reinforcement Learning via Reward Shaping 利用奖励塑造加速逆强化学习的计算效率 reinforcement learning inverse reinforcement learning reward shaping
2 Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation 提出基于模型增强的离线预训练方法,提升小数据集强化学习性能 reinforcement learning offline reinforcement learning world model
3 Multi-Objective Reinforcement Learning-based Approach for Pressurized Water Reactor Optimization 提出PEARL方法以优化压水反应堆的多目标问题 reinforcement learning curriculum learning
4 GraphRARE: Reinforcement Learning Enhanced Graph Neural Network with Relative Entropy GraphRARE:利用相对熵和强化学习增强图神经网络,提升异质图上的节点分类性能 reinforcement learning deep reinforcement learning
5 Student as an Inherent Denoiser of Noisy Teacher 提出Peer-Advised KD,利用学生模型内在去噪能力提升噪声教师模型的知识蒸馏效果 distillation large language model
6 Assume-Guarantee Reinforcement Learning 提出一种基于假设-保证的模块化强化学习方法,解决复杂环境下的控制问题。 reinforcement learning
7 Stethoscope-guided Supervised Contrastive Learning for Cross-domain Adaptation on Respiratory Sound Classification 提出听诊器引导的监督对比学习,解决呼吸音分类中的跨域适应问题。 contrastive learning
8 Urban Region Embedding via Multi-View Contrastive Prediction 提出ReCP模型,通过多视角对比预测学习城市区域嵌入表示,提升城市功能理解。 representation learning contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (5 篇)

#题目一句话要点标签🔗
9 Data-Efficient Multimodal Fusion on a Single GPU 提出FuseMix:一种数据高效的多模态融合方法,显著降低训练成本。 multimodal
10 TSRNet: Simple Framework for Real-time ECG Anomaly Detection with Multimodal Time and Spectrogram Restoration Network TSRNet:一种基于多模态时域与频谱图恢复网络的实时心电图异常检测框架 multimodal
11 Beyond Empirical Windowing: An Attention-Based Approach for Trust Prediction in Autonomous Vehicles 提出基于注意力机制的选择性窗口网络SWAN,用于自动驾驶中的信任预测。 multimodal
12 3FM: Multi-modal Meta-learning for Federated Tasks 提出3FM:一种用于联邦任务的多模态元学习框架,解决模态异构和数据缺失问题。 multimodal
13 Vectorizing string entries for data processing on tables: when are larger language models better? 研究表格数据向量化中,大型语言模型在何种情况下更优 large language model

🔬 支柱八:物理动画 (Physics-based Animation) (3 篇)

#题目一句话要点标签🔗
14 Multi-stage Learning for Radar Pulse Activity Segmentation 提出多阶段学习方法,用于雷达脉冲活动分割与定位,提升电子战系统效能。 PULSE
15 Challenges with unsupervised LLM knowledge discovery 揭示无监督LLM知识发现的局限性:现有方法易提取显著特征而非真实知识 simulated character large language model
16 Accelerating Neural Network Training: A Brief Review 研究加速深度神经网络训练的方法,关注ResNet50、ViT和EfficientNet模型。 AMP

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
17 Adversarial Robustness on Image Classification with $k$-means 提出基于k-means的对抗训练方法,提升图像分类聚类算法的鲁棒性。 manipulation

⬅️ 返回 cs.LG 首页 · 🏠 返回主页