Hi-BEHRT: Hierarchical Transformer-Based Model for Accurate Prediction of Clinical Events Using Multimodal Longitudinal Electronic Health Records.
Journal
IEEE journal of biomedical and health informatics
ISSN: 2168-2208
Titre abrégé: IEEE J Biomed Health Inform
Pays: United States
ID NLM: 101604520
Informations de publication
Date de publication:
02 2023
02 2023
Historique:
medline:
10
4
2023
pubmed:
26
11
2022
entrez:
25
11
2022
Statut:
ppublish
Résumé
Electronic health records (EHR) represent a holistic overview of patients' trajectories. Their increasing availability has fueled new hopes to leverage them and develop accurate risk prediction models for a wide range of diseases. Given the complex interrelationships of medical records and patient outcomes, deep learning models have shown clear merits in achieving this goal. However, a key limitation of current study remains their capacity in processing long sequences, and long sequence modelling and its application in the context of healthcare and EHR remains unexplored. Capturing the whole history of medical encounters is expected to lead to more accurate predictions, but the inclusion of records collected for decades and from multiple resources can inevitably exceed the receptive field of the most existing deep learning architectures. This can result in missing crucial, long-term dependencies. To address this gap, we present Hi-BEHRT, a hierarchical Transformer-based model that can significantly expand the receptive field of Transformers and extract associations from much longer sequences. Using a multimodal large-scale linked longitudinal EHR, the Hi-BEHRT exceeds the state-of-the-art deep learning models 1% to 5% for area under the receiver operating characteristic (AUROC) curve and 1% to 8% for area under the precision recall (AUPRC) curve on average, and 2% to 8% (AUROC) and 2% to 11% (AUPRC) for patients with long medical history for 5-year heart failure, diabetes, chronic kidney disease, and stroke risk prediction. Additionally, because pretraining for hierarchical Transformer is not well-established, we provide an effective end-to-end contrastive pre-training strategy for Hi-BEHRT using EHR, improving its transferability on predicting clinical events with relatively small training dataset.
Identifiants
pubmed: 36427286
doi: 10.1109/JBHI.2022.3224727
pmc: PMC7615082
mid: EMS177288
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1106-1117Subventions
Organisme : British Heart Foundation
ID : FS/PHD/21/29110
Pays : United Kingdom
Organisme : British Heart Foundation
ID : PG/18/65/33872
Pays : United Kingdom
Organisme : Department of Health
Pays : United Kingdom
Références
World Health Stat Q. 1988;41(1):32-6
pubmed: 3376487
BMJ. 1990 Apr 28;300(6732):1092
pubmed: 2344534
Sci Rep. 2020 Apr 28;10(1):7155
pubmed: 32346050
IEEE J Biomed Health Inform. 2017 Jan;21(1):22-30
pubmed: 27913366
Int J Epidemiol. 2015 Jun;44(3):827-36
pubmed: 26050254
J Am Med Inform Assoc. 2017 Jan;24(1):198-208
pubmed: 27189013
Lancet. 2018 Feb 10;391(10120):572-580
pubmed: 29174292
Lancet Digit Health. 2019 May 20;1(2):e63-e77
pubmed: 31650125
J Biomed Inform. 2020 Dec;112:103606
pubmed: 33127447
NPJ Digit Med. 2021 May 20;4(1):86
pubmed: 34017034
PLoS Med. 2018 Mar 6;15(3):e1002513
pubmed: 29509757
Sci Rep. 2021 Oct 19;11(1):20685
pubmed: 34667200
Annu Rev Biomed Data Sci. 2018 Jul;1:53-68
pubmed: 31218278
PLoS One. 2014 Oct 01;9(10):e106455
pubmed: 25271417