A Quantitative Comparison Between Human and Artificial Intelligence in the Detection of Focal Cortical Dysplasia.
Journal
Investigative radiology
ISSN: 1536-0210
Titre abrégé: Invest Radiol
Pays: United States
ID NLM: 0045377
Informations de publication
Date de publication:
23 Oct 2024
23 Oct 2024
Historique:
medline:
22
10
2024
pubmed:
22
10
2024
entrez:
22
10
2024
Statut:
aheadofprint
Résumé
Artificial intelligence (AI) is thought to improve lesion detection. However, a lack of knowledge about human performance prevents a comparative evaluation of AI and an accurate assessment of its impact on clinical decision-making. The objective of this work is to quantitatively evaluate the ability of humans to detect focal cortical dysplasia (FCD), compare it to state-of-the-art AI, and determine how it may aid diagnostics. We prospectively recorded the performance of readers in detecting FCDs using single points and 3-dimensional bounding boxes. We acquired predictions of 3 AI models for the same dataset and compared these to readers. Finally, we analyzed pairwise combinations of readers and models. Twenty-eight readers, including 20 nonexpert and 5 expert physicians, reviewed 180 cases: 146 subjects with FCD (median age: 25, interquartile range: 18) and 34 healthy control subjects (median age: 43, interquartile range: 19). Nonexpert readers detected 47% (95% confidence interval [CI]: 46, 49) of FCDs, whereas experts detected 68% (95% CI: 65, 71). The 3 AI models detected 32%, 51%, and 72% of FCDs, respectively. The latter, however, also predicted more than 13 false-positive clusters per subject on average. Human performance was improved in the presence of a transmantle sign (P < 0.001) and cortical thickening (P < 0.001). In contrast, AI models were sensitive to abnormal gyration (P < 0.01) or gray-white matter blurring (P < 0.01). Compared with single experts, expert-expert pairs detected 13% (95% CI: 9, 18) more FCDs (P < 0.001). All AI models increased expert detection rates by up to 19% (95% CI: 15, 24) (P < 0.001). Nonexpert+AI pairs could still outperform single experts by up to 13% (95% CI: 10, 17). This study pioneers the comparative evaluation of humans and AI for FCD lesion detection. It shows that AI and human predictions differ, especially for certain MRI features of FCD, and, thus, how AI may complement the diagnostic workup.
Identifiants
pubmed: 39437019
doi: 10.1097/RLI.0000000000001125
pii: 00004424-990000000-00260
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
Copyright © 2024 Wolters Kluwer Health, Inc. All rights reserved.
Déclaration de conflit d'intérêts
Conflicts of interest and sources of funding: A.R. has received fees as a speaker from UCB Pharma and travel support from the Elisabeth und Helmut Uhl Stiftung. U.A. has received fees as a speaker for Siemens Healthineers and as clinical consultant for Bayer. A.R. lectures for Guerbet and Bayer, and is part of the Advisory Board for GE, Bracco, and Guerbet. R.S. has received personal fees as speaker or for serving on advisory boards from Angelini, Arvelle, Bial, Desitin, Eisai, Jazz Pharmaceuticals Germany GmbH, Janssen-Cilag GmbH, LivaNova, LivAssured B.V., Novartis, Precisis GmbH, Rapport Therapeutics, Tabuk Pharmaceuticals, UCB Pharma, UNEEG, and Zogenix. T.R. has received fees as a speaker from Eisai. None of the previously mentioned activities were related to the content of this manuscript. The remaining authors have nothing to declare. No external funding was received for this work.
Références
Lång K, Josefsson V, Larsson A-M, et al. Artificial intelligence-supported screen reading versus standard double reading in the Mammography Screening With Artificial Intelligence Trial (MASAI): a clinical safety analysis of a randomised, controlled, non-inferiority, single-blinded, screening accuracy study. Lancet Oncol. 2023;24:936–944.
McKinney SM, Sieniek M, Godbole V, et al. International evaluation of an AI system for breast cancer screening. Nature. 2020;577:89–94.
Blumcke I, Spreafico R, Haaker G, et al. Histopathological findings in brain tissue obtained during epilepsy surgery. N Engl J Med. 2017;377:1648–1656.
Tassi L, Colombo N, Garbelli R, et al. Focal cortical dysplasia: neuropathological subtypes, EEG, neuroimaging and surgical outcome. Brain. 2002;125(Pt 8):1719–1732.
Téllez-Zenteno JF, Hernández Ronquillo L, Moien-Afshari F, et al. Surgical outcomes in lesional and non-lesional epilepsy: a systematic review and meta-analysis. Epilepsy Res. 2010;89:310–318.
Timoney N, Rutka JT. Recent advances in epilepsy surgery and achieving best outcomes using high-frequency oscillations, diffusion tensor imaging, magnetoencephalography, intraoperative neuromonitoring, focal cortical dysplasia, and bottom of sulcus dysplasia. Neurosurgery. 2017;64(CN_suppl_1):1–10.
Nielsen AS, Kinkel RP, Tinelli E, et al. Focal cortical lesion detection in multiple sclerosis: 3 Tesla DIR versus 7 Tesla FLASH-T2. J Magn Reson Imaging. 2012;35:537–542.
Wehner T, Weckesser P, Schulz S, et al. Factors influencing the detection of treatable epileptogenic lesions on MRI. A randomized prospective study. Neurol Res Pract. 2021;3:41.
David B, Kröll-Seger J, Schuch F, et al. External validation of automated focal cortical dysplasia detection using morphometric analysis. Epilepsia. 2021;62:1005–1021.
Spitzer H, Ripart M, Whitaker K, et al. Interpretable surface-based detection of focal cortical dysplasias: a multi-centre epilepsy lesion detection study. Brain. 2022;145:3859–3871.
Gill RS, Lee H-M, Caldairou B, et al. Multicenter validation of a deep learning detection algorithm for focal cortical dysplasia. Neurology. 2021;97:e1571–e1582.
Maier-Hein L, Reinke A, Godau P, et al. Metrics reloaded: recommendations for image analysis validation. Nat Methods. 2024;21:195–212.
Alaverdyan Z, Jung J, Bouet R, et al. Regularized Siamese neural network for unsupervised outlier detection on brain multiparametric magnetic resonance imaging: application to epilepsy lesion screening. Med Image Anal. 2020;60:101618.
Schuch F, Walger L, Schmitz M, et al. An open presurgery MRI dataset of people with epilepsy and focal cortical dysplasia type II. Sci Data. 2023;10:475.
Sherif T, Kassis N, Rousseau M-É, et al. BrainBrowser: distributed, Web-based neurological data visualization. Front Neuroinform. 2015;8:89.
Huppertz H-J, Grimm C, Fauser S, et al. Enhanced visualization of blurred gray-white matter junctions in focal cortical dysplasia by voxel-based 3D MRI analysis. Epilepsy Res. 2005;67:35–50.
Guindon B, Zhang Y. Application of the dice coefficient to accuracy assessment of object-based image classification. Can J Remote Sens. 2017;43:48–61.
Pakdaman Naeini M, Cooper G, Hauskrecht M. Obtaining well calibrated probabilities using Bayesian binning. In: Proceedings of the AAAI Conference on Artificial Intelligence. 2015;29. Available at: https://doi.org/10.1609/aaai.v29i1.9602. Accessed September 26, 2024.
StataCorp. Stata/SE. College Station, TX: StataCorp LLC; 2023.
Wagstyl K, Whitaker K, Raznahan A, et al. Atlas of lesion locations and postsurgical seizure freedom in focal cortical dysplasia: a MELD study. Epilepsia. 2022;63:61–74.
Najm I, Lal D, Alonso Vanegas M, et al. The ILAE consensus classification of focal cortical dysplasia: an update proposed by an ad hoc task force of the ILAE diagnostic methods commission. Epilepsia. 2022;63:1899–1919.
Lee SY, Ha S, Jeon MG, et al. Localization-adjusted diagnostic performance and assistance effect of a computer-aided detection system for pneumothorax and consolidation. NPJ Digit Med. 2022;5:107.