cs.LG（2024-11-26）

📊 共 20 篇论文 | 🔗 3 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (11 🔗3) 支柱二：RL算法与架构 (RL & Architecture) (5) 支柱八：物理动画 (Physics-based Animation) (2) 支柱七：动作重定向 (Motion Retargeting) (1) 支柱四：生成式动作 (Generative Motion) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (11 篇)

#	题目	一句话要点	标签	🔗	⭐
1	APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents	APT：利用大型语言模型进行开放世界智能体的建筑规划与蓝图构建	large language model multimodal chain-of-thought
2	Pushing the Limits of Large Language Model Quantization via the Linearity Theorem	提出线性定理以优化大语言模型量化方法	large language model
3	TabulaX: Leveraging Large Language Models for Multi-Class Table Transformations	TabulaX：利用大型语言模型进行多类别表格转换，提升数据集成效率。	large language model
4	Synthetic Data Generation with LLM for Improved Depression Prediction	利用LLM生成合成数据，提升抑郁症预测模型性能	large language model chain-of-thought
5	COAP: Memory-Efficient Training with Correlation-Aware Gradient Projection	COAP：一种相关性感知梯度投影的内存高效训练方法	multimodal
6	Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens	揭示低比特量化偏好欠训练LLM：百兆token训练量化LLM的缩放法则	large language model	✅
7	Scalable iterative pruning of large language and vision models using block coordinate descent	提出基于块坐标下降的可扩展迭代剪枝算法iCBS，用于压缩大型语言和视觉模型。	large language model
8	Data-driven development of cycle prediction models for lithium metal batteries using multi modal mining	提出基于多模态数据挖掘的锂金属电池循环寿命预测模型	large language model
9	KVPR: Efficient LLM Inference with I/O-Aware KV Cache Partial Recomputation	KVPR：一种I/O感知的KV缓存部分重计算方法，用于高效LLM推理。	large language model	✅
10	Conformalised Conditional Normalising Flows for Joint Prediction Regions in time series	提出基于Conformal Prediction的条件Normalizing Flow，用于时间序列联合预测区域生成。	multimodal
11	Condense, Don't Just Prune: Enhancing Efficiency and Performance in MoE Layer Pruning	提出ConDense-MoE，通过压缩而非剪枝MoE层，提升效率与性能	large language model	✅

🔬 支柱二：RL算法与架构 (RL & Architecture) (5 篇)

#	题目	一句话要点	标签	🔗	⭐
12	AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling	AutoElicit：利用大语言模型为预测模型提取专家先验知识	predictive model large language model
13	Joint Combinatorial Node Selection and Resource Allocations in the Lightning Network using Attention-based Reinforcement Learning	提出基于注意力机制强化学习的LN节点选择与资源分配联合优化方法	reinforcement learning deep reinforcement learning DRL
14	CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening	提出CRASH框架，利用强化学习对抗场景提升自动驾驶安全性	reinforcement learning deep reinforcement learning
15	From Machine Learning to Machine Unlearning: Complying with GDPR's Right to be Forgotten while Maintaining Business Value of Predictive Models	提出ETID框架，在满足GDPR“被遗忘权”的同时保持预测模型业务价值。	predictive model distillation
16	Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards	提出混合策略PPO与TWTL奖励塑造，加速解决延迟奖励环境下的强化学习问题	reinforcement learning PPO reward shaping

🔬 支柱八：物理动画 (Physics-based Animation) (2 篇)

#	题目	一句话要点	标签	🔗	⭐
17	Multiscale spatiotemporal heterogeneity analysis of bike-sharing system's self-loop phenomenon: Evidence from Shanghai	针对共享单车自循环现象，提出多尺度时空异质性分析方法，助力优化单车分配。	spatiotemporal multimodal
18	A Graph Neural Network deep-dive into successful counterattacks	提出性别特定的图神经网络，用于预测足球反击成功率并分析关键因素。	spatiotemporal

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
19	MetaGraphLoc: A Graph-based Meta-learning Scheme for Indoor Localization via Sensor Fusion	MetaGraphLoc：基于图神经网络和元学习的室内定位传感器融合方案	spatial relationship

🔬 支柱四：生成式动作 (Generative Motion) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
20	Contrastive CFG: Improving CFG in Diffusion Models by Contrasting Positive and Negative Concepts	提出对比CFG方法，通过对比正负概念提升扩散模型中CFG的性能。	classifier-free guidance

⬅️ 返回 cs.LG 首页 · 🏠 返回主页