cs.LG（2024-12-13）

📊 共 17 篇论文

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (12) 支柱二：RL算法与架构 (RL & Architecture) (3) 支柱一：机器人控制 (Robot Control) (1) 支柱八：物理动画 (Physics-based Animation) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (12 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Higher Order Transformers: Enhancing Stock Movement Prediction On Multimodal Time-Series Data	提出高阶Transformer，增强多模态时间序列数据上的股票走势预测	multimodal
2	Aspen Open Jets: Unlocking LHC Data for Foundation Models in Particle Physics	AspenOpenJets：利用LHC开放数据预训练粒子物理领域Foundation模型	foundation model
3	Benchmarking large language models for materials synthesis: the case of atomic layer deposition	ALDbench：评估大语言模型在原子层沉积材料合成中的性能	large language model
4	FDM-Bench: A Comprehensive Benchmark for Evaluating Large Language Models in Additive Manufacturing Tasks	FDM-Bench：用于评估大语言模型在增材制造任务中性能的综合基准	large language model
5	Activation Sparsity Opportunities for Compressing General Large Language Models	探索激活稀疏性以压缩通用大语言模型，实现边缘设备高效部署。	large language model
6	KVDirect: Distributed Disaggregated LLM Inference	KVDirect：实现分布式解耦LLM推理，提升资源利用率与服务能力	large language model
7	METIS: Fast Quality-Aware RAG Systems with Configuration Adaptation	METIS：通过配置自适应实现快速高质量的RAG系统	large language model
8	AdvPrefix: An Objective for Nuanced LLM Jailbreaks	AdvPrefix：一种用于细粒度大语言模型越狱的目标函数	large language model
9	Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Ambiguous Prompts and Unanswerable Questions	通过层级信息缺失检测LLM幻觉以应对模糊提示和无解问题	large language model
10	Text2Cypher: Bridging Natural Language and Graph Databases	Text2Cypher：构建自然语言到图数据库查询的桥梁，提升非技术用户的使用体验。	large language model
11	Llama 3 Meets MoE: Efficient Upcycling	利用Llama 3高效训练MoE模型：低成本实现性能提升	large language model
12	HashEvict: A Pre-Attention KV Cache Eviction Strategy using Locality-Sensitive Hashing	HashEvict：利用局部敏感哈希的预注意力KV缓存淘汰策略，降低LLM推理的GPU内存消耗。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (3 篇)

#	题目	一句话要点	标签	🔗	⭐
13	Hybrid Preference Optimization for Alignment: Provably Faster Convergence Rates by Combining Offline Preferences with Online Exploration	提出混合偏好优化(HPO)，结合离线偏好与在线探索，加速RLHF对齐。	reinforcement learning RLHF large language model
14	Solving the Inverse Alignment Problem for Efficient RLHF	提出逆向对齐方法，提升RLHF中奖励模型的训练效率与对齐效果	reinforcement learning RLHF
15	Scaling Combinatorial Optimization Neural Improvement Heuristics with Online Search and Adaptation	提出LRBS搜索策略，提升DRL组合优化启发式算法的性能与泛化性	reinforcement learning deep reinforcement learning DRL

🔬 支柱一：机器人控制 (Robot Control) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
16	What constitutes a Deep Fake? The blurry line between legitimate processing and manipulation under the EU AI Act	分析欧盟AI法案对Deepfake的定义，指出其模糊性及潜在问题	manipulation

🔬 支柱八：物理动画 (Physics-based Animation) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
17	Data Integration with Fusion Searchlight: Classifying Brain States from Resting-state fMRI	提出Fusion Searchlight框架，融合多指标提升静息态fMRI脑状态分类精度	spatiotemporal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页