cs.LG（2025-12-24）

📊 共 16 篇论文 | 🔗 2 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (9) 支柱二：RL算法与架构 (RL & Architecture) (6 🔗2) 支柱七：动作重定向 (Motion Retargeting) (1)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Assessing the Software Security Comprehension of Large Language Models	系统评估大型语言模型在软件安全理解方面的能力，揭示其知识边界与常见误解。	large language model
2	Interpretable Perturbation Modeling Through Biomedical Knowledge Graphs	提出基于生物医学知识图谱的可解释扰动建模框架，用于预测药物对基因表达的影响。	foundation model multimodal
3	Fuzzwise: Intelligent Initial Corpus Generation for Fuzzing	FuzzWise：利用LLM智能生成模糊测试初始语料库，提升测试效率	large language model
4	Learning to Reconfigure: Using Device Status to Select the Right Constrained Coding Scheme	针对TDMR存储，提出基于设备状态学习的自适应约束编码方案选择方法	TAMP
5	LLMTM: Benchmarking and Optimizing LLMs for Temporal Motif Analysis in Dynamic Graphs	LLMTM：基准测试并优化LLM在动态图时间motif分析中的应用	large language model
6	LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics	提出基于瑞士轮动态竞争的LLM综合评估框架，解决静态评估的局限性。	large language model
7	Deadline-Aware Online Scheduling for LLM Fine-Tuning with Spot Market Predictions	提出基于预测的在线调度方法以优化LLM微调成本	foundation model
8	Can Agentic AI Match the Performance of Human Data Scientists?	Agentic AI在数据科学中难匹人类专家：领域知识缺失是瓶颈	large language model
9	RevFFN: Memory-Efficient Full-Parameter Fine-Tuning of Mixture-of-Experts LLMs with Reversible Blocks	RevFFN：利用可逆块实现MoE LLM全参数高效微调	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
10	ReACT-Drug: Reaction-Template Guided Reinforcement Learning for de novo Drug Design	ReACT-Drug：基于反应模板引导的强化学习用于全新药物设计	reinforcement learning PPO representation learning	✅
11	dUltra: Ultra-Fast Diffusion Language Models via Reinforcement Learning	提出dUltra，通过强化学习加速扩散语言模型并行解码，提升推理效率。	reinforcement learning distillation
12	A Survey of Freshness-Aware Wireless Networking with Reinforcement Learning	综述：基于强化学习的面向信息新鲜度的无线网络研究	reinforcement learning
13	Model Merging via Multi-Teacher Knowledge Distillation	提出SAMerging，通过多教师知识蒸馏实现模型合并，提升泛化性能。	distillation	✅
14	MiST: Understanding the Role of Mid-Stage Scientific Training in Developing Chemical Reasoning Models	提出MiST：通过中阶段科学训练提升化学推理模型性能	reinforcement learning large language model
15	Shared Representation Learning for High-Dimensional Multi-Task Forecasting under Resource Contention in Cloud-Native Backends	提出用于云原生后端高维多任务预测的共享表示学习框架，解决资源竞争下的预测难题。	representation learning

🔬 支柱七：动作重定向 (Motion Retargeting) (1 篇)

#	题目	一句话要点	标签	🔗	⭐
16	Temporal Visual Semantics-Induced Human Motion Understanding with Large Language Models	提出基于大语言模型的时间视觉语义引导的人体运动分割方法	human motion large language model

⬅️ 返回 cs.LG 首页 · 🏠 返回主页