The Real-World Experiences of Persons With Multiple Sclerosis During the First COVID-19 Lockdown: Application of Natural Language Processing.

COVID-19 clinical informatics health data linguistic inquiry medical informatics multiple sclerosis natural language processing nervous system disease nervous system disorder patient data sentiment analysis textual data topic modeling

Journal

JMIR medical informatics
ISSN: 2291-9694
Titre abrégé: JMIR Med Inform
Pays: Canada
ID NLM: 101645109

Informations de publication

Date de publication:
10 Nov 2022
Historique:
received: 17 03 2022
accepted: 02 10 2022
pubmed: 18 10 2022
medline: 18 10 2022
entrez: 17 10 2022
Statut: epublish

Résumé

The increasing availability of "real-world" data in the form of written text holds promise for deepening our understanding of societal and health-related challenges. Textual data constitute a rich source of information, allowing the capture of lived experiences through a broad range of different sources of information (eg, content and emotional tone). Interviews are the "gold standard" for gaining qualitative insights into individual experiences and perspectives. However, conducting interviews on a large scale is not always feasible, and standardized quantitative assessment suitable for large-scale application may miss important information. Surveys that include open-text assessments can combine the advantages of both methods and are well suited for the application of natural language processing (NLP) methods. While innovations in NLP have made large-scale text analysis more accessible, the analysis of real-world textual data is still complex and requires several consecutive steps. We developed and subsequently examined the utility and scientific value of an NLP pipeline for extracting real-world experiences from textual data to provide guidance for applied researchers. We applied the NLP pipeline to large-scale textual data collected by the Swiss Multiple Sclerosis (MS) registry. Such textual data constitute an ideal use case for the study of real-world text data. Specifically, we examined 639 text reports on the experienced impact of the first COVID-19 lockdown from the perspectives of persons with MS. The pipeline has been implemented in Python and complemented by analyses of the "Linguistic Inquiry and Word Count" software. It consists of the following 5 interconnected analysis steps: (1) text preprocessing; (2) sentiment analysis; (3) descriptive text analysis; (4) unsupervised learning-topic modeling; and (5) results interpretation and validation. A topic modeling analysis identified the following 4 distinct groups based on the topics participants were mainly concerned with: "contacts/communication;" "social environment;" "work;" and "errands/daily routines." Notably, the sentiment analysis revealed that the "contacts/communication" group was characterized by a pronounced negative emotional tone underlying the text reports. This observed heterogeneity in emotional tonality underlying the reported experiences of the first COVID-19-related lockdown is likely to reflect differences in emotional burden, individual circumstances, and ways of coping with the pandemic, which is in line with previous research on this matter. This study illustrates the timely and efficient applicability of an NLP pipeline and thereby serves as a precedent for applied researchers. Our study thereby contributes to both the dissemination of NLP techniques in applied health sciences and the identification of previously unknown experiences and burdens of persons with MS during the pandemic, which may be relevant for future treatment.

Sections du résumé

BACKGROUND BACKGROUND
The increasing availability of "real-world" data in the form of written text holds promise for deepening our understanding of societal and health-related challenges. Textual data constitute a rich source of information, allowing the capture of lived experiences through a broad range of different sources of information (eg, content and emotional tone). Interviews are the "gold standard" for gaining qualitative insights into individual experiences and perspectives. However, conducting interviews on a large scale is not always feasible, and standardized quantitative assessment suitable for large-scale application may miss important information. Surveys that include open-text assessments can combine the advantages of both methods and are well suited for the application of natural language processing (NLP) methods. While innovations in NLP have made large-scale text analysis more accessible, the analysis of real-world textual data is still complex and requires several consecutive steps.
OBJECTIVE OBJECTIVE
We developed and subsequently examined the utility and scientific value of an NLP pipeline for extracting real-world experiences from textual data to provide guidance for applied researchers.
METHODS METHODS
We applied the NLP pipeline to large-scale textual data collected by the Swiss Multiple Sclerosis (MS) registry. Such textual data constitute an ideal use case for the study of real-world text data. Specifically, we examined 639 text reports on the experienced impact of the first COVID-19 lockdown from the perspectives of persons with MS. The pipeline has been implemented in Python and complemented by analyses of the "Linguistic Inquiry and Word Count" software. It consists of the following 5 interconnected analysis steps: (1) text preprocessing; (2) sentiment analysis; (3) descriptive text analysis; (4) unsupervised learning-topic modeling; and (5) results interpretation and validation.
RESULTS RESULTS
A topic modeling analysis identified the following 4 distinct groups based on the topics participants were mainly concerned with: "contacts/communication;" "social environment;" "work;" and "errands/daily routines." Notably, the sentiment analysis revealed that the "contacts/communication" group was characterized by a pronounced negative emotional tone underlying the text reports. This observed heterogeneity in emotional tonality underlying the reported experiences of the first COVID-19-related lockdown is likely to reflect differences in emotional burden, individual circumstances, and ways of coping with the pandemic, which is in line with previous research on this matter.
CONCLUSIONS CONCLUSIONS
This study illustrates the timely and efficient applicability of an NLP pipeline and thereby serves as a precedent for applied researchers. Our study thereby contributes to both the dissemination of NLP techniques in applied health sciences and the identification of previously unknown experiences and burdens of persons with MS during the pandemic, which may be relevant for future treatment.

Identifiants

pubmed: 36252126
pii: v10i11e37945
doi: 10.2196/37945
pmc: PMC9651007
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e37945

Informations de copyright

©Deborah Chiavi, Christina Haag, Andrew Chan, Christian Philipp Kamm, Chloé Sieber, Mina Stanikić, Stephanie Rodgers, Caroline Pot, Jürg Kesselring, Anke Salmen, Irene Rapold, Pasquale Calabrese, Zina-Mary Manjaly, Claudio Gobbi, Chiara Zecca, Sebastian Walther, Katharina Stegmayer, Robert Hoepner, Milo Puhan, Viktor von Wyl. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 10.11.2022.

Références

JMIR Med Inform. 2020 Dec 15;8(12):e22982
pubmed: 33320104
JMIR Med Inform. 2019 Apr 27;7(2):e12239
pubmed: 31066697
J Affect Disord. 2014 Mar;156:236-9
pubmed: 24480380
Source Code Biol Med. 2011 Apr 07;6:7
pubmed: 21473782
Neurol Ther. 2021 Jun;10(1):279-291
pubmed: 33855692
Mult Scler Relat Disord. 2020 Jul;42:102148
pubmed: 32371376
Nat Commun. 2020 Sep 10;11(1):4525
pubmed: 32913209
J Med Internet Res. 2021 May 4;23(5):e15708
pubmed: 33944788
J Health Serv Res Policy. 2021 Jul;26(3):189-197
pubmed: 33337256
J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379
pubmed: 30726935
IEEE Signal Process Mag. 2010 Nov 1;27(6):55-65
pubmed: 25104898
J Med Internet Res. 2018 Nov 12;20(11):e11141
pubmed: 30425030
J Med Internet Res. 2021 Nov 9;23(11):e28946
pubmed: 34751659
Int J Med Inform. 2019 May;125:37-46
pubmed: 30914179
Front Psychiatry. 2021 Feb 22;11:588275
pubmed: 33692703
J Med Internet Res. 2020 Sep 15;22(9):e19133
pubmed: 32866108
Swiss Med Wkly. 2018 May 16;148:w14623
pubmed: 29767828
Mult Scler Relat Disord. 2020 Nov;46:102512
pubmed: 32977074
Front Neurol. 2019 Sep 04;10:953
pubmed: 31555205
JMIR Med Inform. 2020 Mar 31;8(3):e17984
pubmed: 32229465
Mult Scler Relat Disord. 2020 Nov;46:102580
pubmed: 33296977
BMC Neurol. 2018 Aug 13;18(1):111
pubmed: 30103695
Eur J Neurol. 2021 Oct;28(10):3403-3410
pubmed: 33896086
BMC Med Inform Decis Mak. 2020 May 27;20(1):97
pubmed: 32460734
Psychother Psychosom Med Psychol. 2006 Feb;56(2):42-8
pubmed: 16453241
Int J Environ Res Public Health. 2021 Apr 14;18(8):
pubmed: 33919974

Auteurs

Deborah Chiavi (D)

Institute for Implementation Science in Health Care, University of Zurich, Zurich, Switzerland.

Christina Haag (C)

Institute for Implementation Science in Health Care, University of Zurich, Zurich, Switzerland.
Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland.

Andrew Chan (A)

Department of Neurology, Inselspital, Bern University Hospital and University of Bern, Bern, Switzerland.

Christian Philipp Kamm (CP)

Department of Neurology, Inselspital, Bern University Hospital and University of Bern, Bern, Switzerland.
Neurocenter, Lucerne Cantonal Hospital, Lucerne, Switzerland.

Chloé Sieber (C)

Institute for Implementation Science in Health Care, University of Zurich, Zurich, Switzerland.
Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland.

Mina Stanikić (M)

Institute for Implementation Science in Health Care, University of Zurich, Zurich, Switzerland.
Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland.

Stephanie Rodgers (S)

Institute for Implementation Science in Health Care, University of Zurich, Zurich, Switzerland.

Caroline Pot (C)

Service of Neurology, Department of Clinical Neurosciences, Lausanne University Hospital and University of Lausanne, Lausanne, Switzerland.

Jürg Kesselring (J)

Department of Neurology and Neurorehabilitation, Rehabilitation Centre Kliniken Valens, Valens, Switzerland.

Anke Salmen (A)

Department of Neurology, Inselspital, Bern University Hospital and University of Bern, Bern, Switzerland.

Irene Rapold (I)

Institute for Implementation Science in Health Care, University of Zurich, Zurich, Switzerland.

Pasquale Calabrese (P)

Division of Molecular and Cognitive Neuroscience, University of Basel, Basel, Switzerland.

Zina-Mary Manjaly (ZM)

Department of Neurology, Schulthess Klinik, Zurich, Switzerland.
Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland.

Claudio Gobbi (C)

Multiple Sclerosis Center, Department of Neurology, Neurocenter of Southern Switzerland, Ente Ospedaliero Cantonale, Lugano, Switzerland.
Faculty of Biomedical Sciences, Università della Svizzera Italiana (USI), Lugano, Switzerland.

Chiara Zecca (C)

Multiple Sclerosis Center, Department of Neurology, Neurocenter of Southern Switzerland, Ente Ospedaliero Cantonale, Lugano, Switzerland.
Faculty of Biomedical Sciences, Università della Svizzera Italiana (USI), Lugano, Switzerland.

Sebastian Walther (S)

Translational Research Center, University Hospital of Psychiatry and Psychotherapy, University of Bern, Bern, Switzerland.

Katharina Stegmayer (K)

Translational Research Center, University Hospital of Psychiatry and Psychotherapy, University of Bern, Bern, Switzerland.

Robert Hoepner (R)

Department of Neurology, Inselspital, Bern University Hospital and University of Bern, Bern, Switzerland.

Milo Puhan (M)

Institute for Implementation Science in Health Care, University of Zurich, Zurich, Switzerland.

Viktor von Wyl (V)

Institute for Implementation Science in Health Care, University of Zurich, Zurich, Switzerland.
Epidemiology, Biostatistics and Prevention Institute, University of Zurich, Zurich, Switzerland.

Classifications MeSH