cs.LG（2026-01-06）

📊 共 15 篇论文 | 🔗 1 篇有代码

🎯 兴趣领域导航

支柱九：具身大模型 (Embodied Foundation Models) (9 🔗1) 支柱二：RL算法与架构 (RL & Architecture) (6)

🔬 支柱九：具身大模型 (Embodied Foundation Models) (9 篇)

#	题目	一句话要点	标签	🔗	⭐
1	Uni-FinLLM: A Unified Multimodal Large Language Model with Modular Task Heads for Micro-Level Stock Prediction and Macro-Level Systemic Risk Assessment	Uni-FinLLM：统一多模态大语言模型，用于微观股票预测和宏观系统性风险评估	large language model multimodal
2	Empowering Reliable Visual-Centric Instruction Following in MLLMs	提出VC-IFEval基准，提升多模态大语言模型在视觉约束下的指令跟随能力	large language model multimodal instruction following
3	From Memorization to Creativity: LLM as a Designer of Novel Neural-Architectures	提出基于LLM的闭环神经架构设计框架，实现从记忆到创造的飞跃。	large language model
4	ATLAS: Adaptive Test-Time Latent Steering with External Verifiers for Enhancing LLMs Reasoning	ATLAS：利用外部验证器进行自适应测试时潜在引导，增强LLM推理能力	large language model
5	Audit Me If You Can: Query-Efficient Active Fairness Auditing of Black-Box LLMs	提出BAFA以解决黑箱LLM的公平性审计问题	large language model
6	Joint Encoding of KV-Cache Blocks for Scalable LLM Serving	提出KV-Cache联合编码，提升LLM高并发服务吞吐量并降低内存占用	large language model	✅
7	When the Coffee Feature Activates on Coffins: An Analysis of Feature Extraction and Steering for Mechanistic Interpretability	针对Llama 3.1的稀疏自编码器特征提取与操控脆弱性分析	large language model
8	Bridging Mechanistic Interpretability and Prompt Engineering with Gradient Ascent for Interpretable Persona Control	提出基于梯度上升的可解释Prompt工程方法，实现对LLM行为Persona的精准控制	large language model
9	Prioritized Replay for RL Post-training	提出一种基于问题优先级的强化学习后训练框架，提升大型语言模型在强化学习任务中的性能。	large language model

🔬 支柱二：RL算法与架构 (RL & Architecture) (6 篇)

#	题目	一句话要点	标签	🔗	⭐
10	Adversarial Contrastive Learning for LLM Quantization Attacks	提出对抗对比学习ACL，提升LLM量化攻击的成功率	contrastive learning large language model
11	Sparse Knowledge Distillation: A Mathematical Framework for Probability-Domain Temperature Scaling and Multi-Stage Compression	提出稀疏知识蒸馏框架以优化模型压缩与温度缩放问题	distillation
12	Decentralized Autoregressive Generation	提出去中心化自回归生成方法，解决多模态语言模型训练中的专家协作问题。	flow matching multimodal
13	Causal Manifold Fairness: Enforcing Geometric Invariance in Representation Learning	提出因果流形公平性(CMF)，通过几何不变性实现表征学习中的公平性。	representation learning
14	In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior	SPICE：通过上下文和价值先验的贝叶斯融合实现上下文强化学习	reinforcement learning
15	Stratified Hazard Sampling: Minimal-Variance Event Scheduling for CTMC/DTMC Discrete Diffusion and Flow Models	提出分层风险抽样(SHS)，最小化CTMC/DTMC离散扩散模型的事件调度方差，提升生成质量。	flow matching multimodal

⬅️ 返回 cs.LG 首页 · 🏠 返回主页