Disentangling Answer Engine Optimization from Platform Growth: A Log-Based Natural Experiment on ChatGPT Referral Traffic

📄 arXiv: 2606.04362v1 📥 PDF

作者: Keisuke Watanabe, Kazuki Nakayashiki

分类: cs.IR, cs.CL

发布日期: 2026-06-03

备注: 9 pages, 4 figures, 1 table


💡 一句话要点

提出基于日志的实验方法以优化答案引擎流量分析

🎯 匹配领域: 支柱九:具身大模型 (Embodied Foundation Models)

关键词: 答案引擎优化 流量分析 因果推断 数字营销 内容优化

📋 核心要点

  1. 现有的答案引擎优化(AEO)研究通常未能有效区分平台增长与具体优化措施的影响,导致结果不够准确。
  2. 本研究通过对特定网站的AEO干预进行纵向分析,利用第一方分析和服务器日志,提供了更为清晰的因果关系视角。
  3. 实验结果显示,经过AEO干预的页面流量增长显著,但由于实验设计的局限性,需谨慎解读其因果效应。

📝 摘要(中文)

大型语言模型(LLM)如ChatGPT现在能够向开放网络发送可测量的推荐流量,衍生出一种类似于搜索引擎优化的实践,称为答案引擎优化(AEO)。本研究通过对一个高流量域(glasp.co)进行纵向实地研究,分析了AEO干预的效果。研究发现,平台的整体增长对推荐流量的影响显著,未经处理的页面也经历了相应的增长。尽管AEO干预显示出一定的效果,但由于实验前期数据的噪声,结果仍需谨慎解读。该研究强调了通过域内控制分离干预与平台增长的重要性,表明现有AEO成功案例可能高估了因果效应。

🔬 方法详解

问题定义:本研究旨在解决现有AEO研究中未能有效区分平台增长与具体优化措施的影响问题,导致对流量增长的因果关系理解不清晰。

核心思路:通过对单一高流量域的AEO干预进行纵向研究,利用同域未处理页面作为对照组,分析干预效果与平台增长的相对影响。

技术框架:研究采用了第一方分析工具和服务器日志,构建了一个包含干预与对照组的时间序列模型,分析流量变化。

关键创新:本研究的创新在于通过域内控制分离干预与平台增长的影响,提供了更为准确的因果分析框架,避免了传统方法的偏差。

关键设计:实验中采用了干预前后的流量对比,使用了中断时间序列模型来估计干预效果,并进行了保守的时间置换检验以验证结果的稳健性。

🖼️ 关键图片

fig_0
fig_1
fig_2

📊 实验亮点

实验结果显示,经过AEO干预的页面流量增长了1.82倍(95% CI 1.31-2.54),而未经处理页面的增长为3.5倍。尽管干预效果显著,但由于实验前期数据的噪声,需谨慎解读其因果关系。

🎯 应用场景

该研究的结果对数字营销和内容优化领域具有重要的应用价值,能够帮助企业更准确地评估AEO策略的有效性。同时,研究方法也可推广至其他平台流量分析,提升因果推断的准确性。

📄 摘要(原文)

Large language model (LLM) "answer engines" such as ChatGPT now send measurable referral traffic to the open web, and a practice analogous to search engine optimization, here called Answer Engine Optimization (AEO), has emerged. Public AEO success stories typically quote large raw growth multiples, but raw referral growth is confounded by the rapid platform-level growth of the answer engines themselves. We report a longitudinal field study on a single high-traffic domain (glasp.co) whose corpus of hundreds of thousands of YouTube question-and-answer pages received a defined bundle of AEO interventions in January 2026 (detailed in Section 4). Because the interventions were concentrated on one subset of the site, the untreated remainder of the same domain acts as a contemporaneous control that absorbs the platform tailwind. Using first-party analytics and server logs rather than probabilistic third-party estimators, we find: (1) raw growth is dominated by the platform tailwind: on monthly aggregates total ChatGPT referrals grew 5.7x while untreated pages on the same domain grew 3.5x over the same window; (2) an interrupted time-series model on the weekly treated/control ratio estimates a discrete, intervention-aligned level increase of 1.82x (95% CI 1.31-2.54, HAC p=0.001), robust across engagement-filtered traffic (2.27x) and alternative specifications; (3) however, a conservative placebo-in-time permutation test yields p=0.16, so the effect is suggestive, not conclusive, given a short and noisy pre-period; and (4) Google organic clicks to treated pages did not fall beyond the ambient site-wide trend and indexation was preserved, consistent with the SEO-protection rule. The methodological message, separating treatment from platform tailwind with an on-domain control, matters more than any single multiple, and implies that headline AEO multiples substantially overstate causal effect.