Overrating Classifier Performance in ROC Analysis in the Absence of a Test Set: Evidence from Simulation and Italian CARATkids Validation.


Journal

Methods of information in medicine
ISSN: 2511-705X
Titre abrégé: Methods Inf Med
Pays: Germany
ID NLM: 0210453

Informations de publication

Date de publication:
12 2019
Historique:
pubmed: 21 11 2019
medline: 13 6 2020
entrez: 21 11 2019
Statut: ppublish

Résumé

The use of receiver operating characteristic curves, or "ROC analysis," has become quite common in biomedical research to support decisions. However, sensitivity, specificity, and misclassification rates are still often estimated using the training sample, overlooking the risk of overrating the test performance. A simulation study was performed to highlight the inferential implications of splitting (or not) the dataset into training and test set. The normality assumption was made for the classifier given the disease status, and the Youden's criterion considered for the detection of the optimal cutoff. Then, an ROC analysis with sample split was applied to assess the discriminant validity of the Italian version of the Control of Allergic Rhinitis and Asthma Test (CARATkids) questionnaire for children with asthma and rhinitis, for which recent studies may have reported liberal performance estimates. The simulation study showed that both single split and cross-validation (CV) provided unbiased estimators of sensitivity, specificity, and misclassification rate, therefore allowing computation of confidence intervals. For the Italian CARATkids questionnaire, the misclassification rate estimated by fivefold CV was 0.22, with 95% confidence interval 0.14 to 0.30, indicating an acceptable discriminant validity. Splitting into training and test set avoids overrating the test performance in ROC analysis. Validated through this method, the Italian CARATkids is valid for assessing disease control in children with asthma and rhinitis.

Sections du résumé

BACKGROUND
The use of receiver operating characteristic curves, or "ROC analysis," has become quite common in biomedical research to support decisions. However, sensitivity, specificity, and misclassification rates are still often estimated using the training sample, overlooking the risk of overrating the test performance.
METHODS
A simulation study was performed to highlight the inferential implications of splitting (or not) the dataset into training and test set. The normality assumption was made for the classifier given the disease status, and the Youden's criterion considered for the detection of the optimal cutoff. Then, an ROC analysis with sample split was applied to assess the discriminant validity of the Italian version of the Control of Allergic Rhinitis and Asthma Test (CARATkids) questionnaire for children with asthma and rhinitis, for which recent studies may have reported liberal performance estimates.
RESULTS
The simulation study showed that both single split and cross-validation (CV) provided unbiased estimators of sensitivity, specificity, and misclassification rate, therefore allowing computation of confidence intervals. For the Italian CARATkids questionnaire, the misclassification rate estimated by fivefold CV was 0.22, with 95% confidence interval 0.14 to 0.30, indicating an acceptable discriminant validity.
CONCLUSIONS
Splitting into training and test set avoids overrating the test performance in ROC analysis. Validated through this method, the Italian CARATkids is valid for assessing disease control in children with asthma and rhinitis.

Identifiants

pubmed: 31746447
doi: 10.1055/s-0039-1693732
doi:

Types de publication

Journal Article Validation Study

Langues

eng

Sous-ensembles de citation

IM

Pagination

e27-e42

Informations de copyright

Georg Thieme Verlag KG Stuttgart · New York.

Déclaration de conflit d'intérêts

None declared.

Auteurs

Giovanna Cilluffo (G)

Institute for Biomedical Research and Innovation, National Research Council of Italy, Palermo, Italy.
Department of Economical, Business and Statistical Science, University of Palermo, Palermo, Italy.

Salvatore Fasola (S)

Institute for Biomedical Research and Innovation, National Research Council of Italy, Palermo, Italy.
Department of Economical, Business and Statistical Science, University of Palermo, Palermo, Italy.

Giuliana Ferrante (G)

Department of Health Promotion Sciences, Maternal and Infant Care, Internal Medicine and Medical Specialities, University of Palermo, Italy.

Laura Montalbano (L)

Institute for Biomedical Research and Innovation, National Research Council of Italy, Palermo, Italy.

Ilaria Baiardini (I)

Department of Biomedical Sciences, Humanitas University, Milan, Italy.

Luciana Indinnimeo (L)

Department of Pediatrics and NPI, University of Roma Sapienza, Rome, Italy.

Giovanni Viegi (G)

Institute for Biomedical Research and Innovation, National Research Council of Italy, Palermo, Italy.
Institute of Clinical Physiology, Pulmonary Environmental Epidemiology Unit, National Research Council of Italy, Pisa, Italy.

Joao A Fonseca (JA)

Department of Immunoallergy, CUF Porto Hospital and Institute, Porto, Portugal.

Stefania La Grutta (S)

Institute for Biomedical Research and Innovation, National Research Council of Italy, Palermo, Italy.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH