| 1 |
Harnessing Structural Context for Entity Alignment Foundation Models |
提出ContextEA以解决知识图谱实体对齐中的结构上下文不足问题 |
foundation model |
|
|
| 2 |
LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs |
提出PropMe框架以评估大型语言模型的记忆能力 |
large language model |
|
|
| 3 |
Evaluating Stochastic Collapse and Implicit Bias in Multimodal Large Language Models |
提出RandomBench以评估多模态大语言模型的随机性与隐含偏差问题 |
large language model multimodal |
|
|
| 4 |
Mechanistic Insights into Functional Sparsity in Multimodal LLMs via CoRe Heads |
提出CoRe头以揭示多模态大语言模型中的功能稀疏性 |
large language model multimodal |
|
|
| 5 |
To Be Multimodal or Not to Be: Query-Adaptive Audio-Visual Person Retrieval via Active Modality Detection |
提出查询自适应框架以解决多模态人检索问题 |
multimodal |
|
|
| 6 |
IR3DE: A Linear Router for Large Language Models |
提出IR3DE以解决大型语言模型的高效路由问题 |
large language model |
|
|
| 7 |
MARDoc: A Memory-Aware Refinement Agent Framework for Multimodal Long Document QA |
提出MARDoc框架以解决长文档多模态问答中的信息稀疏问题 |
multimodal |
|
|
| 8 |
The Tell-Tale Norm: $\ell_2$ Magnitude as a Signal for Reasoning Dynamics in Large Language Models |
提出l2范数作为大型语言模型推理动态的信号 |
large language model |
✅ |
|
| 9 |
Large Language Models are Perplexed by some Political Parties |
评估大型语言模型在政治公平性上的表现 |
large language model |
|
|
| 10 |
Analysis of the Neglect-Zero Effect in Large Language Models |
探讨大语言模型中的忽视零效应及其认知过程 |
large language model |
✅ |
|
| 11 |
An Embarrassingly Simple Detector for Model Extraction Attacks in Large Language Model API Traffic |
提出一种简单有效的检测器以应对大语言模型API的模型提取攻击 |
large language model |
✅ |
|
| 12 |
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints |
提出AdaPlanBench以解决大语言模型在动态约束下的自适应规划问题 |
large language model |
|
|
| 13 |
Using Large Language Models to Support High Volume Application Review for an Undergraduate Research Program |
基于大型语言模型的工具助力本科研究项目申请评审 |
large language model |
|
|
| 14 |
Latent Reasoning with Normalizing Flows |
提出NF-CoT框架以提升潜在推理能力 |
large language model chain-of-thought |
|
|
| 15 |
Revising Context, Shifting Simulated Stance: Auditing LLM-Based Stance Simulation in Online Discussions |
提出基于反事实上下文修订的框架以审计LLM立场模拟 |
large language model multimodal |
|
|
| 16 |
IA-RAG: Interval-Algebra-Driven Temporal Reasoning for Dynamic Knowledge Retrieval |
提出IA-RAG框架以解决动态知识检索中的时间推理问题 |
large language model TAMP |
✅ |
|
| 17 |
Beyond tokens: a unified framework for latent communication in LLM-based multi-agent systems |
提出统一框架以解决LLM多智能体系统中的潜在通信问题 |
large language model chain-of-thought |
|
|
| 18 |
When New Generators Arrive: Lifelong Machine-Generated Text Attribution via Ridge Feature Transfer |
提出RidgeFT以解决长期机器生成文本归属问题 |
large language model |
|
|
| 19 |
Re-Centering Humans in LLM Personalization |
提出人类数据驱动的LLM个性化评估方法以解决现有系统局限性 |
large language model |
|
|
| 20 |
Human Adults and LLMs as Scientists: Who Benefits from Active Exploration? |
通过主动探索提升成人的因果推理能力 |
large language model |
|
|
| 21 |
Value-and-Structure Alignment for Routing-Consistent Quantization of Mixture-of-Experts Models |
提出VSRAQ以解决MoE模型量化中的路由不一致问题 |
foundation model |
|
|
| 22 |
PromptPrint: Behavioral Biometrics Through Natural Language Prompting in LLMs |
提出PromptPrint以解决用户身份识别问题 |
large language model |
|
|
| 23 |
When to Think Deeply: Inhibitory Deliberation for LLM Reasoning |
提出IDPR框架以优化LLM推理效率 |
large language model |
|
|
| 24 |
UnpredictaBench: A Benchmark for Evaluating Distributional Randomness in LLMs |
提出UnpredictaBench以评估LLMs的分布随机性 |
large language model |
|
|
| 25 |
Scaffold, Not Vocabulary? A Controlled, Two-Tier, Pre-Registered Study of a Popperian Code-Generation Skill |
提出两层次的消融研究以评估代码生成技能的有效性 |
large language model |
|
|
| 26 |
FOXGLOVE: Understanding Goal-Oriented and Anchored Writing Feedback from Experts and LLMs on Argumentative Essays |
提出FOXGLOVE以系统比较专家与LLM在写作反馈中的差异 |
large language model |
|
|
| 27 |
Automatic Labelling of Speech Translation Errors |
提出语音翻译错误自动标注方法以提升系统可信度 |
multimodal |
|
|
| 28 |
Contextualized Prompting For Stance Detection On Social Media |
提出上下文化提示以解决社交媒体立场检测问题 |
large language model |
✅ |
|
| 29 |
The Generator-Eraser Paradox: Community Guidelines for Responsible LLM-Assisted Dialect Resource Creation |
提出生成器-消除者悖论以指导负责任的方言资源创建 |
large language model |
|
|
| 30 |
ReverseEOL: Improving Training-free Text Embeddings via Text Reversal in Decoder-only LLMs |
提出ReverseEOL以提升无训练文本嵌入的表示能力 |
large language model |
|
|
| 31 |
ProSPy: A Profiling-Driven SQL-Python Agentic Framework for Enterprise Text-to-SQL |
提出ProSPy框架以解决企业级Text-to-SQL的挑战 |
large language model |
|
|
| 32 |
Can LLMs Be Constrained to the Past? Improving Knowledge Cutoff through Recall-Based Prompting |
提出基于回忆的提示策略以改善知识截止问题 |
large language model |
|
|
| 33 |
PlanBench-V: A Spatial Planning Map Benchmark for Vision-Language Models |
提出PlanBench-V以解决空间规划图解释的评估问题 |
multimodal |
|
|
| 34 |
Predictable Scaling Laws of Optimal Hyperparameters for LLM Continued Pre-training |
提出可预测的超参数缩放法则以优化大语言模型继续预训练 |
large language model |
|
|