The Real-World Experiences of Persons With Multiple Sclerosis During the First COVID-19 Lockdown: Application of Natural Language Processing.
COVID-19
clinical informatics
health data
linguistic inquiry
medical informatics
multiple sclerosis
natural language processing
nervous system disease
nervous system disorder
patient data
sentiment analysis
textual data
topic modeling
Journal
JMIR medical informatics
ISSN: 2291-9694
Titre abrégé: JMIR Med Inform
Pays: Canada
ID NLM: 101645109
Informations de publication
Date de publication:
10 Nov 2022
10 Nov 2022
Historique:
received:
17
03
2022
accepted:
02
10
2022
pubmed:
18
10
2022
medline:
18
10
2022
entrez:
17
10
2022
Statut:
epublish
Résumé
The increasing availability of "real-world" data in the form of written text holds promise for deepening our understanding of societal and health-related challenges. Textual data constitute a rich source of information, allowing the capture of lived experiences through a broad range of different sources of information (eg, content and emotional tone). Interviews are the "gold standard" for gaining qualitative insights into individual experiences and perspectives. However, conducting interviews on a large scale is not always feasible, and standardized quantitative assessment suitable for large-scale application may miss important information. Surveys that include open-text assessments can combine the advantages of both methods and are well suited for the application of natural language processing (NLP) methods. While innovations in NLP have made large-scale text analysis more accessible, the analysis of real-world textual data is still complex and requires several consecutive steps. We developed and subsequently examined the utility and scientific value of an NLP pipeline for extracting real-world experiences from textual data to provide guidance for applied researchers. We applied the NLP pipeline to large-scale textual data collected by the Swiss Multiple Sclerosis (MS) registry. Such textual data constitute an ideal use case for the study of real-world text data. Specifically, we examined 639 text reports on the experienced impact of the first COVID-19 lockdown from the perspectives of persons with MS. The pipeline has been implemented in Python and complemented by analyses of the "Linguistic Inquiry and Word Count" software. It consists of the following 5 interconnected analysis steps: (1) text preprocessing; (2) sentiment analysis; (3) descriptive text analysis; (4) unsupervised learning-topic modeling; and (5) results interpretation and validation. A topic modeling analysis identified the following 4 distinct groups based on the topics participants were mainly concerned with: "contacts/communication;" "social environment;" "work;" and "errands/daily routines." Notably, the sentiment analysis revealed that the "contacts/communication" group was characterized by a pronounced negative emotional tone underlying the text reports. This observed heterogeneity in emotional tonality underlying the reported experiences of the first COVID-19-related lockdown is likely to reflect differences in emotional burden, individual circumstances, and ways of coping with the pandemic, which is in line with previous research on this matter. This study illustrates the timely and efficient applicability of an NLP pipeline and thereby serves as a precedent for applied researchers. Our study thereby contributes to both the dissemination of NLP techniques in applied health sciences and the identification of previously unknown experiences and burdens of persons with MS during the pandemic, which may be relevant for future treatment.
Sections du résumé
BACKGROUND
BACKGROUND
The increasing availability of "real-world" data in the form of written text holds promise for deepening our understanding of societal and health-related challenges. Textual data constitute a rich source of information, allowing the capture of lived experiences through a broad range of different sources of information (eg, content and emotional tone). Interviews are the "gold standard" for gaining qualitative insights into individual experiences and perspectives. However, conducting interviews on a large scale is not always feasible, and standardized quantitative assessment suitable for large-scale application may miss important information. Surveys that include open-text assessments can combine the advantages of both methods and are well suited for the application of natural language processing (NLP) methods. While innovations in NLP have made large-scale text analysis more accessible, the analysis of real-world textual data is still complex and requires several consecutive steps.
OBJECTIVE
OBJECTIVE
We developed and subsequently examined the utility and scientific value of an NLP pipeline for extracting real-world experiences from textual data to provide guidance for applied researchers.
METHODS
METHODS
We applied the NLP pipeline to large-scale textual data collected by the Swiss Multiple Sclerosis (MS) registry. Such textual data constitute an ideal use case for the study of real-world text data. Specifically, we examined 639 text reports on the experienced impact of the first COVID-19 lockdown from the perspectives of persons with MS. The pipeline has been implemented in Python and complemented by analyses of the "Linguistic Inquiry and Word Count" software. It consists of the following 5 interconnected analysis steps: (1) text preprocessing; (2) sentiment analysis; (3) descriptive text analysis; (4) unsupervised learning-topic modeling; and (5) results interpretation and validation.
RESULTS
RESULTS
A topic modeling analysis identified the following 4 distinct groups based on the topics participants were mainly concerned with: "contacts/communication;" "social environment;" "work;" and "errands/daily routines." Notably, the sentiment analysis revealed that the "contacts/communication" group was characterized by a pronounced negative emotional tone underlying the text reports. This observed heterogeneity in emotional tonality underlying the reported experiences of the first COVID-19-related lockdown is likely to reflect differences in emotional burden, individual circumstances, and ways of coping with the pandemic, which is in line with previous research on this matter.
CONCLUSIONS
CONCLUSIONS
This study illustrates the timely and efficient applicability of an NLP pipeline and thereby serves as a precedent for applied researchers. Our study thereby contributes to both the dissemination of NLP techniques in applied health sciences and the identification of previously unknown experiences and burdens of persons with MS during the pandemic, which may be relevant for future treatment.
Identifiants
pubmed: 36252126
pii: v10i11e37945
doi: 10.2196/37945
pmc: PMC9651007
doi:
Types de publication
Journal Article
Langues
eng
Pagination
e37945Informations de copyright
©Deborah Chiavi, Christina Haag, Andrew Chan, Christian Philipp Kamm, Chloé Sieber, Mina Stanikić, Stephanie Rodgers, Caroline Pot, Jürg Kesselring, Anke Salmen, Irene Rapold, Pasquale Calabrese, Zina-Mary Manjaly, Claudio Gobbi, Chiara Zecca, Sebastian Walther, Katharina Stegmayer, Robert Hoepner, Milo Puhan, Viktor von Wyl. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 10.11.2022.
Références
JMIR Med Inform. 2020 Dec 15;8(12):e22982
pubmed: 33320104
JMIR Med Inform. 2019 Apr 27;7(2):e12239
pubmed: 31066697
J Affect Disord. 2014 Mar;156:236-9
pubmed: 24480380
Source Code Biol Med. 2011 Apr 07;6:7
pubmed: 21473782
Neurol Ther. 2021 Jun;10(1):279-291
pubmed: 33855692
Mult Scler Relat Disord. 2020 Jul;42:102148
pubmed: 32371376
Nat Commun. 2020 Sep 10;11(1):4525
pubmed: 32913209
J Med Internet Res. 2021 May 4;23(5):e15708
pubmed: 33944788
J Health Serv Res Policy. 2021 Jul;26(3):189-197
pubmed: 33337256
J Am Med Inform Assoc. 2019 Apr 1;26(4):364-379
pubmed: 30726935
IEEE Signal Process Mag. 2010 Nov 1;27(6):55-65
pubmed: 25104898
J Med Internet Res. 2018 Nov 12;20(11):e11141
pubmed: 30425030
J Med Internet Res. 2021 Nov 9;23(11):e28946
pubmed: 34751659
Int J Med Inform. 2019 May;125:37-46
pubmed: 30914179
Front Psychiatry. 2021 Feb 22;11:588275
pubmed: 33692703
J Med Internet Res. 2020 Sep 15;22(9):e19133
pubmed: 32866108
Swiss Med Wkly. 2018 May 16;148:w14623
pubmed: 29767828
Mult Scler Relat Disord. 2020 Nov;46:102512
pubmed: 32977074
Front Neurol. 2019 Sep 04;10:953
pubmed: 31555205
JMIR Med Inform. 2020 Mar 31;8(3):e17984
pubmed: 32229465
Mult Scler Relat Disord. 2020 Nov;46:102580
pubmed: 33296977
BMC Neurol. 2018 Aug 13;18(1):111
pubmed: 30103695
Eur J Neurol. 2021 Oct;28(10):3403-3410
pubmed: 33896086
BMC Med Inform Decis Mak. 2020 May 27;20(1):97
pubmed: 32460734
Psychother Psychosom Med Psychol. 2006 Feb;56(2):42-8
pubmed: 16453241
Int J Environ Res Public Health. 2021 Apr 14;18(8):
pubmed: 33919974