cs.LG(2024-12-19)

📊 共 22 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱二:RL算法与架构 (RL & Architecture) (11 🔗1) 支柱九:具身大模型 (Embodied Foundation Models) (7) 支柱八:物理动画 (Physics-based Animation) (4)

🔬 支柱二:RL算法与架构 (RL & Architecture) (11 篇)

#题目一句话要点标签🔗
1 Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment 提出Cal-DPO,通过校准隐式奖励优化语言模型对齐,提升人类偏好对齐效果。 DPO direct preference optimization large language model
2 AdaCred: Adaptive Causal Decision Transformers with Feature Crediting AdaCred:基于特征可信度自适应因果决策Transformer,提升离线强化学习效率 reinforcement learning offline RL offline reinforcement learning
3 GFormer: Accelerating Large Language Models with Optimized Transformers on Gaudi Processors GFormer:利用优化Transformer加速Gaudi处理器上的大语言模型 linear attention large language model
4 MARIA: a Multimodal Transformer Model for Incomplete Healthcare Data 提出MARIA模型,解决医疗多模态数据缺失下的诊断与预测难题。 predictive model multimodal
5 Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning HiSPO:面向离线持续强化学习,利用分层策略子空间解决导航任务中的知识遗忘问题。 reinforcement learning offline reinforcement learning
6 Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning 提出熵正则化任务表征学习方法,提升离线元强化学习泛化能力 reinforcement learning representation learning
7 Offline Safe Reinforcement Learning Using Trajectory Classification 提出基于轨迹分类的离线安全强化学习方法,解决现有方法保守或违反约束问题 reinforcement learning
8 MIETT: Multi-Instance Encrypted Traffic Transformer for Encrypted Traffic Classification 提出MIETT:一种用于加密流量分类的多示例加密流量Transformer模型 contrastive learning foundation model
9 Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance 提出RNN-Attention-KD框架,用于早期预测学生学业表现,助力教育干预。 distillation
10 CLDG: Contrastive Learning on Dynamic Graphs 提出CLDG框架,通过对比学习动态图中的时间平移不变性,提升无监督节点表示。 contrastive learning
11 ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting 提出ST-ReP模型,高效学习时空预测的表征,提升预测精度和可扩展性 representation learning contrastive learning

🔬 支柱九:具身大模型 (Embodied Foundation Models) (7 篇)

#题目一句话要点标签🔗
12 Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models 利用LLM输出token数量的时间侧信道泄露推理输入敏感信息 large language model
13 Bag of Tricks for Multimodal AutoML with Image, Text, and Tabular Data 针对图像、文本和表格数据的多模态AutoML的经验总结与最佳实践 multimodal
14 Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offs 针对资源受限系统,论文评估并优化大语言模型在Jetson Orin上的性能表现。 large language model
15 Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning 针对数学Copilot,提出更优的证明呈现方式,以提升机器学习效果。 large language model
16 HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages HPC-Coder-V2:研究低资源并行语言上的代码大语言模型,提升并行代码生成能力。 large language model
17 Rethinking Uncertainty Estimation in Natural Language Generation 提出G-NLL,通过单次贪婪解码实现高效可靠的自然语言生成不确定性估计 large language model
18 GenAIOps for GenAI Model-Agility 提出GenAIOps方法,应对生成式AI模型快速迭代带来的应用质量退化问题。 foundation model

🔬 支柱八:物理动画 (Physics-based Animation) (4 篇)

#题目一句话要点标签🔗
19 LISA: Learning-Integrated Space Partitioning Framework for Traffic Accident Forecasting on Heterogeneous Spatiotemporal Data 提出LISA框架,用于异构时空数据上的交通事故事故预测,实现自适应空间划分。 spatiotemporal
20 A Generative Framework for Probabilistic, Spatiotemporally Coherent Downscaling of Climate Simulation 提出基于扩散模型的生成框架,用于气候模拟的时空一致性降尺度 spatiotemporal
21 GeoPro-Net: Learning Interpretable Spatiotemporal Prediction Models through Statistically-Guided Geo-Prototyping GeoPro-Net:通过统计引导的地理原型学习可解释的时空预测模型 spatiotemporal
22 A Universal Model for Human Mobility Prediction 提出UniMob,一个通用的人类移动预测模型,可同时处理个体轨迹和群体流量预测。 spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页