cs.LG（2024-11-20）

📊 共 25 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (13) 支柱二：RL算法与架构 (RL & Architecture) (10 🔗1) 支柱一：机器人控制 (Robot Control) (1) 支柱八：物理动画 (Physics-based Animation) (1 🔗1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (13 篇)

#	题目	一句话要点	标签	🔗	⭐
1	LightLLM: A Versatile Large Language Model for Predictive Light Sensing	LightLLM：用于预测性光感知的多功能大语言模型	large language model
2	Reflections from the 2024 Large Language Model (LLM) Hackathon for Applications in Materials Science and Chemistry	材料科学与化学领域LLM应用：2024黑客松成果展示与未来展望	large language model
3	Sampling with Adaptive Variance for Multimodal Distributions	提出自适应方差采样算法，加速多峰分布的采样过程。	multimodal
4	Exploring Large Language Models for Climate Forecasting	探索大型语言模型在气候预测中的应用潜力与局限性	large language model
5	Unlocking Historical Clinical Trial Data with ALIGN: A Compositional Large Language Model System for Medical Coding	ALIGN：一种用于医学编码的组合式大语言模型系统，解锁历史临床试验数据。	large language model
6	M2oE: Multimodal Collaborative Expert Peptide Model	提出M2oE多模态协同专家肽模型，提升复杂任务中功能肽预测性能	multimodal
7	Federated Continual Learning for Edge-AI: A Comprehensive Survey	首个Edge-AI联邦持续学习综述，应对边缘设备动态环境下的知识融合与保留挑战。	foundation model
8	A Collaborative Ensemble Framework for CTR Prediction	提出CETNet协同集成训练框架，利用多模型协同学习提升CTR预测性能。	foundation model
9	Promoting User Data Autonomy During the Dissolution of a Monopolistic Firm	提出基于Conscious Data Contribution框架，解决垄断企业解体时用户数据自主权问题。	foundation model
10	Quantized symbolic time series approximation	提出QABBA：一种量化的符号时间序列近似方法，提升存储效率并保持精度，应用于时间序列回归。	large language model
11	Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders	提出稀疏自编码器优化推理方法以提升特征解释性	large language model
12	Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training	大规模分布式训练中硬件扩展的收益递减分析与优化策略	large language model
13	LLMSteer: Improving Long-Context LLM Inference by Steering Attention on Reused Contexts	LLMSteer：通过引导注意力重用上下文，提升长文本LLM推理效率	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (10 篇)

#	题目	一句话要点	标签	🔗	⭐
14	Multimodal large language model for wheat breeding: a new exploration of smart breeding	提出多模态大语言模型以解决小麦育种中的知识挖掘问题	reinforcement learning RLHF large language model
15	S$^2$ALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning	提出S$^2$ALM，融合序列与结构信息，用于全面抗体表征学习	representation learning large language model foundation model
16	Engagement-Driven Content Generation with Large Language Models	提出基于强化学习的框架，利用大型语言模型生成高社交互动内容	reinforcement learning large language model	✅
17	A Survey On Enhancing Reinforcement Learning in Complex Environments: Insights from Human and LLM Feedback	综述：利用人类和LLM反馈增强复杂环境中的强化学习	reinforcement learning large language model
18	DRL-Based Optimization for AoI and Energy Consumption in C-V2X Enabled IoV	提出基于DRL的C-V2X车联网AoI与能耗优化方法，解决资源冲突和性能矛盾问题。	reinforcement learning deep reinforcement learning DRL
19	Competence-Aware AI Agents with Metacognition for Unknown Situations and Environments (MUSE)	提出MUSE框架，赋予AI智能体元认知能力，提升未知环境适应性	reinforcement learning world model large language model
20	MERLOT: A Distilled LLM-based Mixture-of-Experts Framework for Scalable Encrypted Traffic Classification	提出MERLOT：一种基于蒸馏LLM的混合专家框架，用于可扩展的加密流量分类。	teacher-student distillation large language model
21	Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise	为马尔可夫噪声下的随机逼近和强化学习算法建立几乎必然收敛速率和集中度界限	reinforcement learning
22	Effective Analog ICs Floorplanning with Relational Graph Neural Networks and Reinforcement Learning	提出基于关系图神经网络和强化学习的模拟IC自动布局规划方法	reinforcement learning
23	Conditional Distribution Learning for Graph Classification	提出条件分布学习(CDL)方法，用于半监督图分类任务。	representation learning contrastive learning

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
24	Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning	提出一种有效的动作操控攻击以解决连续强化学习中的安全问题	manipulation reinforcement learning PPO

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
25	UniFlow: A Foundation Model for Unified Urban Spatio-Temporal Flow Prediction	提出UniFlow以解决城市时空流预测问题	spatiotemporal foundation model	✅

⬅️ 返回 cs.LG 首页 · 🏠 返回主页