| 1 |
Model-Dowser: Data-Free Importance Probing to Mitigate Catastrophic Forgetting in Multimodal Large Language Models |
Model-Dowser:一种数据无关的重要性探测方法,用于缓解多模态大语言模型中的灾难性遗忘 |
large language model multimodal |
|
|
| 2 |
Alignment Drift in Multimodal LLMs: A Two-Phase, Longitudinal Evaluation of Harm Across Eight Model Releases |
纵向评估多模态LLM安全性:揭示八个模型版本中的对齐漂移现象 |
large language model multimodal |
|
|
| 3 |
OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models |
OmniSIFT:面向高效Omni-LLM的模态非对称Token压缩框架 |
large language model multimodal |
|
|
| 4 |
Focus-LIME: Surgical Interpretation of Long-Context Large Language Models via Proxy-Based Neighborhood Selection |
Focus-LIME:通过代理模型邻域选择实现长文本LLM的可解释性 |
large language model |
|
|
| 5 |
Revisiting Prompt Sensitivity in Large Language Models for Text Classification: The Role of Prompt Underspecification |
研究表明,大语言模型文本分类中Prompt敏感性部分源于Prompt欠规范问题。 |
large language model |
|
|
| 6 |
Bi-directional Bias Attribution: Debiasing Large Language Models without Modifying Prompts |
提出双向偏差归因方法,无需修改提示即可消除大型语言模型中的偏见。 |
large language model |
✅ |
|
| 7 |
DeFrame: Debiasing Large Language Models Against Framing Effects |
DeFrame:通过消除框架效应来提升大型语言模型的公平性 |
large language model |
|
|
| 8 |
Inference-Time Reasoning Selectively Reduces Implicit Social Bias in Large Language Models |
推理时推理选择性减少大型语言模型中的隐性社会偏见 |
large language model |
|
|
| 9 |
Beyond Unimodal Shortcuts: MLLMs as Cross-Modal Reasoners for Grounded Named Entity Recognition |
提出Modality-aware Consistency Reasoning (MCR)以解决GMNER中MLLM的模态偏见问题 |
large language model multimodal visual grounding |
|
|
| 10 |
Evaluating the Presence of Sex Bias in Clinical Reasoning by Large Language Models |
评估大型语言模型在临床推理中存在的性别偏见 |
large language model |
|
|
| 11 |
History-Guided Iterative Visual Reasoning with Self-Correction |
提出H-GIVR框架,通过历史信息引导迭代视觉推理并进行自校正,提升多模态大语言模型的推理可靠性。 |
large language model multimodal |
|
|
| 12 |
CoT is Not the Chain of Truth: An Empirical Internal Analysis of Reasoning LLMs for Fake News Generation |
揭示推理LLM生成虚假新闻时CoT的潜在风险,即使拒绝请求也可能包含不安全叙事。 |
large language model chain-of-thought |
|
|
| 13 |
LinGO: A Linguistic Graph Optimization Framework with LLMs for Interpreting Intents of Online Uncivil Discourse |
LinGO:利用语言图优化框架与LLM提升在线不文明言论意图识别 |
large language model chain-of-thought |
|
|
| 14 |
Can Vision Replace Text in Working Memory? Evidence from Spatial n-Back in Vision-Language Models |
探究视觉信息在视觉-语言模型工作记忆中的作用:基于空间n-back任务的证据 |
large language model multimodal |
|
|
| 15 |
Contextual Drag: How Errors in the Context Affect LLM Reasoning |
揭示上下文拖拽效应:上下文错误如何影响大语言模型推理 |
large language model |
|
|
| 16 |
Exploiting contextual information to improve stance detection in informal political discourse with LLMs |
利用上下文信息,通过大语言模型提升非正式政治语境下的立场检测 |
large language model |
|
|
| 17 |
VILLAIN at AVerImaTeC: Verifying Image-Text Claims via Multi-Agent Collaboration |
VILLAIN:基于多智能体协作验证图像-文本声明的系统 |
multimodal |
✅ |
|
| 18 |
LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding |
LycheeDecode:通过混合头稀疏解码加速长文本LLM推理。 |
large language model |
|
|
| 19 |
How Few-shot Demonstrations Affect Prompt-based Defenses Against LLM Jailbreak Attacks |
探究Few-shot示例对Prompt防御LLM越狱攻击的影响,揭示RoP与ToP的差异。 |
large language model |
|
|
| 20 |
Enforcing Monotonic Progress in Legal Cross-Examination: Preventing Long-Horizon Stagnation in LLM-Based Inquiry |
提出Soft-FSM,通过外部状态控制解决LLM在法律交叉询问中长期停滞问题 |
large language model |
|
|
| 21 |
Horizon-LM: A RAM-Centric Architecture for LLM Training |
Horizon-LM:一种以内存为中心的LLM训练架构,突破GPU限制。 |
large language model |
|
|
| 22 |
Can LLMs capture stable human-generated sentence entropy measures? |
研究表明LLM在多大程度上能捕捉人类句子熵的稳定性,并提供人类数据规范化的实践指南。 |
large language model |
|
|
| 23 |
Fine-Grained Activation Steering: Steering Less, Achieving More |
AUSteer:通过细粒度激活控制,以更少干预实现更优大语言模型行为调控 |
large language model |
|
|
| 24 |
Beyond Rejection Sampling: Trajectory Fusion for Scaling Mathematical Reasoning |
TrajFusion:通过轨迹融合提升LLM在数学推理中的性能 |
large language model |
|
|