| 1 |
Checkup2Action: A Multimodal Clinical Check-up Report Dataset for Patient-Oriented Action Card Generation |
提出Checkup2Action数据集,用于评估多模态临床报告生成患者导向行动卡片的能力 |
large language model multimodal |
|
|
| 2 |
Human-Grounded Multimodal Benchmark with 900K-Scale Aggregated Student Response Distributions from Japan's National Assessment of Academic Ability |
提出Gakucho基准,用于评估多模态大语言模型在真实日本K-12学业评估中的表现。 |
large language model multimodal |
✅ |
|
| 3 |
Scalable Token-Level Hallucination Detection in Large Language Models |
提出TokenHD,实现大规模语言模型中token级别幻觉检测,无需步骤分割。 |
large language model |
|
|
| 4 |
From Token to Token Pair: Efficient Prompt Compression for Large Language Models in Clinical Prediction |
提出MedTPE方法,用于临床预测中LLM的EHR序列高效压缩。 |
large language model |
|
|
| 5 |
Question Difficulty Estimation for Large Language Models via Answer Plausibility Scoring |
提出Q-DAPS方法,通过答案合理性评分估计大语言模型问题难度 |
large language model |
|
|
| 6 |
Pretraining Exposure Explains Popularity Judgments in Large Language Models |
通过预训练数据暴露度解释大语言模型中的流行度判断 |
large language model |
|
|
| 7 |
Correcting Selection Bias in Sparse User Feedback for Large Language Model Quality Estimation: A Multi-Agent Hierarchical Bayesian Approach |
提出多代理层次贝叶斯方法以纠正稀疏用户反馈中的选择偏差 |
large language model |
|
|
| 8 |
Large Language Models for Causal Relations Extraction in Social Media: A Validation Framework for Disaster Intelligence |
提出基于专家知识的评估框架,验证大型语言模型在灾害情报中提取因果关系的能力。 |
large language model |
|
|
| 9 |
Reconstruction of Personally Identifiable Information from Supervised Finetuned Models |
提出COVA算法,用于从监督微调模型中重建个人身份信息(PII)。 |
large language model instruction following |
|
|
| 10 |
Towards Visually-Guided Movie Subtitle Translation for Indic Languages |
提出视觉引导的电影字幕翻译方法,提升印地语等低资源语言的翻译质量 |
multimodal visual grounding |
|
|
| 11 |
Task-Adaptive Embedding Refinement via Test-time LLM Guidance |
提出基于测试时LLM指导的任务自适应嵌入精炼方法,提升零样本检索和分类性能。 |
instruction following |
✅ |
|
| 12 |
ORBIT: Preserving Foundational Language Capabilities in GenRetrieval via Origin-Regulated Merging |
ORBIT:通过源头调控合并,在生成式检索中保留基础语言能力 |
large language model |
|
|
| 13 |
Stories in Space: In-Context Learning Trajectories in Conceptual Belief Space |
提出概念信念空间,以几何视角理解LLM上下文学习中的信念动态变化 |
large language model |
|
|
| 14 |
Training-Inference Consistent Segmented Execution for Long-Context LLMs |
提出训练-推理一致的分段执行框架,提升长文本LLM的效率和可扩展性 |
large language model |
|
|
| 15 |
Three Regimes of Context-Parametric Conflict: A Predictive Framework and Empirical Validation |
提出上下文参数冲突三阶段框架,预测并验证大语言模型知识更新行为 |
large language model |
|
|
| 16 |
Taming Extreme Tokens: Covariance-Aware GRPO with Gaussian-Kernel Advantage Reweighting |
提出基于协方差感知的GRPO方法,通过高斯核优势重加权稳定大语言模型推理能力。 |
large language model |
|
|
| 17 |
The Algorithmic Caricature: Auditing LLM-Generated Political Discourse Across Crisis Events |
提出算法漫画方法,通过对比真实与生成政治言论,评估LLM在危机事件中生成内容的人口真实性。 |
large language model |
|
|
| 18 |
Mitigating Context-Memory Conflicts in LLMs through Dynamic Cognitive Reconciliation Decoding |
提出DCRD动态认知协调解码,缓解大语言模型中的上下文-记忆冲突 |
large language model |
|
|
| 19 |
Geometric Factual Recall in Transformers |
揭示Transformer中几何事实记忆机制,突破参数线性增长瓶颈 |
chain-of-thought |
|
|
| 20 |
Latent Causal Void: Explicit Missing-Context Reconstruction for Misinformation Detection |
提出隐性因果空洞(LCV),通过显式重建缺失上下文来提升信息检测性能。 |
large language model |
|
|
| 21 |
Metaphor Is Not All Attention Needs |
诗意越狱并非仅依赖注意力机制,而是源于风格不规则性对LLM处理方式的改变 |
large language model |
|
|
| 22 |
Do Language Models Encode Knowledge of Linguistic Constraint Violations? |
提出稀疏自编码器以检测语言模型中的语法约束违规特征 |
large language model |
|
|
| 23 |
Safety-Oriented Evaluation of Language Understanding Systems for Air Traffic Control |
提出安全导向评估框架以解决空中交通管制语言理解系统的可靠性问题 |
large language model |
|
|
| 24 |
Robust LLM Unlearning Against Relearning Attacks: The Minor Components in Representations Matter |
提出MCU方法,通过优化表征中的次要成分,增强LLM抗重学习攻击的卸载能力 |
large language model |
|
|
| 25 |
StoicLLM: Preference Optimization for Philosophical Alignment in Small Language Models |
StoicLLM:小模型哲学对齐的偏好优化方法 |
large language model |
|
|
| 26 |
Freeze Deep, Train Shallow: Interpretable Layer Allocation for Continued Pre-Training |
提出LayerTracer以解决大语言模型继续预训练中的层分配问题 |
large language model |
|
|