cs.LG(2024-11-20)

📊 共 25 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九:具身大模型 (Embodied Foundation Models) (13) 支柱二:RL算法与架构 (RL & Architecture) (10 🔗1) 支柱一:机器人控制 (Robot Control) (1) 支柱八:物理动画 (Physics-based Animation) (1 🔗1)

🔬 支柱九:具身大模型 (Embodied Foundation Models) (13 篇)

#题目一句话要点标签🔗
1 LightLLM: A Versatile Large Language Model for Predictive Light Sensing LightLLM:用于预测性光感知的多功能大语言模型 large language model
2 Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry 材料科学与化学领域LLM应用:2024黑客松成果展示与未来展望 large language model
3 Sampling with Adaptive Variance for Multimodal Distributions 提出自适应方差采样算法,加速多峰分布的采样过程。 multimodal
4 Exploring Large Language Models for Climate Forecasting 探索大型语言模型在气候预测中的应用潜力与局限性 large language model
5 Unlocking Historical Clinical Trial Data with ALIGN: A Compositional Large Language Model System for Medical Coding ALIGN:一种用于医学编码的组合式大语言模型系统,解锁历史临床试验数据。 large language model
6 M2oE: Multimodal Collaborative Expert Peptide Model 提出M2oE多模态协同专家肽模型,提升复杂任务中功能肽预测性能 multimodal
7 Federated Continual Learning for Edge-AI: A Comprehensive Survey 首个Edge-AI联邦持续学习综述,应对边缘设备动态环境下的知识融合与保留挑战。 foundation model
8 A Collaborative Ensemble Framework for CTR Prediction 提出CETNet协同集成训练框架,利用多模型协同学习提升CTR预测性能。 foundation model
9 Promoting User Data Autonomy During the Dissolution of a Monopolistic Firm 提出基于Conscious Data Contribution框架,解决垄断企业解体时用户数据自主权问题。 foundation model
10 Quantized symbolic time series approximation 提出QABBA:一种量化的符号时间序列近似方法,提升存储效率并保持精度,应用于时间序列回归。 large language model
11 Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders 提出稀疏自编码器优化推理方法以提升特征解释性 large language model
12 Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training 大规模分布式训练中硬件扩展的收益递减分析与优化策略 large language model
13 LLMSteer: Improving Long-Context LLM Inference by Steering Attention on Reused Contexts LLMSteer:通过引导注意力重用上下文,提升长文本LLM推理效率 large language model

🔬 支柱二:RL算法与架构 (RL & Architecture) (10 篇)

#题目一句话要点标签🔗
14 Multimodal large language model for wheat breeding: a new exploration of smart breeding 提出多模态大语言模型以解决小麦育种中的知识挖掘问题 reinforcement learning RLHF large language model
15 S$^2$ALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning 提出S$^2$ALM,融合序列与结构信息,用于全面抗体表征学习 representation learning large language model foundation model
16 Engagement-Driven Content Generation with Large Language Models 提出基于强化学习的框架,利用大型语言模型生成高社交互动内容 reinforcement learning large language model
17 A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback 综述:利用人类和LLM反馈增强复杂环境中的强化学习 reinforcement learning large language model
18 DRL-Based Optimization for AoI and Energy Consumption in C-V2X Enabled IoV 提出基于DRL的C-V2X车联网AoI与能耗优化方法,解决资源冲突和性能矛盾问题。 reinforcement learning deep reinforcement learning DRL
19 Competence-Aware AI Agents with Metacognition for Unknown Situations and Environments (MUSE) 提出MUSE框架,赋予AI智能体元认知能力,提升未知环境适应性 reinforcement learning world model large language model
20 MERLOT: A Distilled LLM-based Mixture-of-Experts Framework for Scalable Encrypted Traffic Classification 提出MERLOT:一种基于蒸馏LLM的混合专家框架,用于可扩展的加密流量分类。 teacher-student distillation large language model
21 Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise 为马尔可夫噪声下的随机逼近和强化学习算法建立几乎必然收敛速率和集中度界限 reinforcement learning
22 Effective Analog ICs Floorplanning with Relational Graph Neural Networks and Reinforcement Learning 提出基于关系图神经网络和强化学习的模拟IC自动布局规划方法 reinforcement learning
23 Conditional Distribution Learning for Graph Classification 提出条件分布学习(CDL)方法,用于半监督图分类任务。 representation learning contrastive learning

🔬 支柱一:机器人控制 (Robot Control) (1 篇)

#题目一句话要点标签🔗
24 Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning 提出一种有效的动作操控攻击以解决连续强化学习中的安全问题 manipulation reinforcement learning PPO

🔬 支柱八:物理动画 (Physics-based Animation) (1 篇)

#题目一句话要点标签🔗
25 UniFlow: A Foundation Model for Unified Urban Spatio-Temporal Flow Prediction 提出UniFlow以解决城市时空流预测问题 spatiotemporal foundation model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页