Automatic detection of abnormal clinical EEG: comparison of a finetuned foundation model with two deep learning models

作者: Aurore Bussalb, François Le Gac, Guillaume Jubien, Mohamed Rahmouni, Ruggero G. Bettinardi, Pedro Marinho R. de Oliveira, Phillipe Derambure, Nicolas Gaspard, Jacques Jonas, Louis Maillard, Laurent Vercueil, Hervé Vespignani, Philippe Laval, Laurent Koessler, Ulysse Gimenez

分类: q-bio.NC, cs.LG, eess.SP

发布日期: 2025-05-13

备注: 20 pages, 7 figures

💡 一句话要点

利用微调的预训练模型BioSerenity-E1实现脑电图异常自动检测

🎯 匹配领域: 支柱九：具身大模型 (Embodied Foundation Models)

关键词: 脑电图分析 异常检测 预训练模型 迁移学习 深度学习

📋 核心要点

脑电图判读耗时且需要专业知识，现有AI辅助工具仍有提升空间，尤其是在小样本和泛化性方面。
论文提出利用预训练的BioSerenity-E1模型，通过微调适应脑电图异常检测任务，降低了对大规模标注数据的依赖。
实验结果表明，微调后的BioSerenity-E1在多个数据集上均优于其他深度学习模型，展现了良好的性能和泛化能力。

📝 摘要（中文）

脑电图(EEG)是医生诊断多种神经系统疾病的常用手段。由于需要解读的脑电图数量庞大且涉及专业知识，因此正在开发基于人工智能的工具来辅助视觉分析。本文比较了两个深度学习模型(CNN-LSTM和基于Transformer的模型)与最近提出的基础模型BioSerenity-E1，用于将整个脑电图记录分类为正常或异常。这三个模型在2500个脑电图记录上进行了训练或微调，并在两个私有数据集和一个公共数据集上评估了它们的性能：一个由单个专家注释的大型多中心数据集(数据集A，n = 4,480个记录)，一个由三个专家注释的小型多中心数据集(数据集B，n = 198个记录)和Temple University Abnormal (TUAB) EEG语料库评估数据集(n = 276个记录)。在数据集A上，这三个模型都达到了至少86%的平衡准确率，其中微调后的BioSerenity-E1达到了最高的平衡准确率(89.19% [88.36-90.41])。微调后的BioSerenity-E1在数据集B上也取得了最佳性能，平衡准确率为94.63% [92.32-98.12]。然后在TUAB评估数据集上验证了这些模型，该数据集的相应训练集在训练期间未使用，它们达到了至少76%的准确率。具体而言，微调后的BioSerenity-E1优于其他两个模型，达到了82.25% [78.27-87.48]的准确率。结果表明，利用预训练模型进行自动脑电图分类的有效性：能够以更少的资源和更广泛的适用性实现对脑电图数据的稳健而有效的解释。

🔬 方法详解

问题定义：论文旨在解决脑电图(EEG)异常的自动检测问题。现有方法，如传统的深度学习模型，通常需要大量的标注数据进行训练，且在不同数据集上的泛化能力有限。此外，脑电图判读需要专业的医学知识，标注成本高昂。

核心思路：论文的核心思路是利用预训练的基础模型BioSerenity-E1，通过微调的方式将其适应于脑电图异常检测任务。预训练模型已经在大量数据上学习了通用的特征表示，因此可以通过少量目标任务的数据进行微调，从而提高模型的性能和泛化能力。

技术框架：整体框架包括三个主要步骤：1) 选择预训练模型BioSerenity-E1；2) 使用脑电图数据集对BioSerenity-E1进行微调，使其适应正常/异常脑电图分类任务；3) 在独立的测试数据集上评估微调后模型的性能。同时，论文还对比了CNN-LSTM和Transformer两种深度学习模型，作为基线模型。

关键创新：论文的关键创新在于将预训练的基础模型应用于脑电图异常检测任务。与从头开始训练的深度学习模型相比，预训练模型可以利用已有的知识，从而在数据量有限的情况下获得更好的性能。此外，BioSerenity-E1本身也是一个创新点，但论文中没有详细介绍其预训练过程和架构。

关键设计：论文中没有详细描述BioSerenity-E1的具体架构和预训练方式，但提到了使用平衡准确率作为评估指标，以应对数据不平衡问题。微调过程的具体参数设置（如学习率、batch size等）也未详细说明。损失函数未知。

📊 实验亮点

实验结果表明，微调后的BioSerenity-E1在数据集A上达到了89.19%的平衡准确率，在数据集B上达到了94.63%的平衡准确率，均优于CNN-LSTM和Transformer模型。在独立的TUAB数据集上，BioSerenity-E1的准确率达到了82.25%，也优于其他模型，展现了良好的泛化能力。

🎯 应用场景

该研究成果可应用于临床脑电图的辅助诊断，提高诊断效率和准确性，减轻医生的工作负担。尤其是在医疗资源匮乏的地区，该技术可以帮助医生快速筛查异常脑电图，从而更好地为患者提供服务。未来，该技术有望集成到智能医疗设备中，实现远程脑电图监测和诊断。

📄 摘要（原文）

Electroencephalography (EEG) is commonly used by physicians for the diagnosis of numerous neurological disorders. Due to the large volume of EEGs requiring interpretation and the specific expertise involved, artificial intelligence-based tools are being developed to assist in their visual analysis. In this paper, we compare two deep learning models (CNN-LSTM and Transformer-based) with BioSerenity-E1, a recently proposed foundation model, in the task of classifying entire EEG recordings as normal or abnormal. The three models were trained or finetuned on 2,500 EEG recordings and their performances were evaluated on two private and one public datasets: a large multicenter dataset annotated by a single specialist (dataset A composed of n = 4,480 recordings), a small multicenter dataset annotated by three specialists (dataset B, n = 198), and the Temple University Abnormal (TUAB) EEG corpus evaluation dataset (n = 276). On dataset A, the three models achieved at least 86% balanced accuracy, with BioSerenity-E1 finetuned achieving the highest balanced accuracy (89.19% [88.36-90.41]). BioSerenity-E1 finetuned also achieved the best performance on dataset B, with 94.63% [92.32-98.12] balanced accuracy. The models were then validated on TUAB evaluation dataset, whose corresponding training set was not used during training, where they achieved at least 76% accuracy. Specifically, BioSerenity-E1 finetuned outperformed the other two models, reaching an accuracy of 82.25% [78.27-87.48]. Our results highlight the usefulness of leveraging pre-trained models for automatic EEG classification: enabling robust and efficient interpretation of EEG data with fewer resources and broader applicability.

Automatic detection of abnormal clinical EEG: comparison of a finetuned foundation model with two deep learning models

💡 一句话要点

📋 核心要点

📝 摘要（中文）

🔬 方法详解

📊 实验亮点

🎯 应用场景

📄 摘要（原文）

⭐ 我的收藏

📁 新建收藏夹

⚙️ 管理收藏夹

🔍 搜索论文

🔐 登录 / 注册

👤 用户管理