Error-rate estimation in discriminant analysis of non-linear longitudinal data: A comparison of resampling methods

de la Cruz, Rolando; FUENTES, CLAUDIO; Meza, Cristian; Núñez-Antón, Vicente

Abstract

Consider longitudinal observations across different subjects such that the underlying distribution is determined by a non-linear mixed-effects model. In this context, we look at the misclassification error rate for allocating future subjects using cross-validation, bootstrap algorithms (parametric bootstrap, leave-one-out, .632 and .632+), and bootstrap cross-validation (which combines the first two approaches), and conduct a numerical study to compare the performance of the different methods. The simulation and comparisons in this study are motivated by real observations from a pregnancy study in which one of the main objectives is to predict normal versus abnormal pregnancy outcomes based on information gathered at early stages. Since in this type of studies it is not uncommon to have insufficient data to simultaneously solve the classification problem and estimate the misclassification error rate, we put special attention to situations when only a small sample size is available. We discuss how the misclassification error rate estimates may be affected by the sample size in terms of variability and bias, and examine conditions under which the misclassification error rate estimates perform reasonably well.

Más información

Título de la Revista: Statistical Methods in Medical Research
Volumen: 27
Número: 4
Fecha de publicación: 2018
Página de inicio: 1153
Página final: 1167
Idioma: English
DOI:

10.1177/0962280216656246

Notas: WOS