cs.LG(2025-07-06)
📊 共 15 篇论文
🎯 兴趣领域导航
🔬 支柱二:RL算法与架构 (RL & Architecture) (8 篇)
🔬 支柱九:具身大模型 (Embodied Foundation Models) (7 篇)
| # | 题目 | 一句话要点 | 标签 | 🔗 | ⭐ |
|---|---|---|---|---|---|
| 9 | Model Inversion Attacks on Llama 3: Extracting PII from Large Language Models | 针对Llama 3的逆向攻击揭示PII泄露风险 | large language model | ||
| 10 | Sampling-aware Adversarial Attacks Against Large Language Models | 提出采样感知对抗攻击,提升大语言模型有害响应攻击的成功率和效率。 | large language model | ||
| 11 | Evaluating LLMs on Real-World Forecasting Against Expert Forecasters | 评估LLM在真实世界预测中的表现,对比专家预测 | large language model | ||
| 12 | DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging | DOTResize:通过基于离散最优传输的神经元合并减少LLM宽度 | large language model | ||
| 13 | Source Attribution in Retrieval-Augmented Generation | 针对RAG系统,提出基于Shapley值的文档溯源方法,提升可解释性并降低计算成本。 | large language model | ||
| 14 | LoRA Is Slower Than You Think | 揭示LoRA微调并非始终加速,并提出更高效的LLM微调方法 | large language model | ||
| 15 | Just Enough Shifts: Mitigating Over-Refusal in Aligned Language Models with Targeted Representation Fine-Tuning | 提出ACTOR框架,通过激活模式微调缓解对齐语言模型过度拒绝问题 | large language model |