Error-rate estimation in discriminant analysis of non-linear longitudinal data: A comparison of resampling methods
Abstract
Consider longitudinal observations across different subjects such that the underlying distribution is determined by a non-linear mixed-effects model. In this context, we look at the misclassification error rate for allocating future subjects using cross-validation, bootstrap algorithms (parametric bootstrap, leave-one-out, .632 and .632+), and bootstrap cross-validation (which combines the first two approaches), and conduct a numerical study to compare the performance of the different methods. The simulation and comparisons in this study are motivated by real observations from a pregnancy study in which one of the main objectives is to predict normal versus abnormal pregnancy outcomes based on information gathered at early stages. Since in this type of studies it is not uncommon to have insufficient data to simultaneously solve the classification problem and estimate the misclassification error rate, we put special attention to situations when only a small sample size is available. We discuss how the misclassification error rate estimates may be affected by the sample size in terms of variability and bias, and examine conditions under which the misclassification error rate estimates perform reasonably well.
Más información
Título de la Revista: | Statistical Methods in Medical Research |
Volumen: | 27 |
Número: | 4 |
Fecha de publicación: | 2018 |
Página de inicio: | 1153 |
Página final: | 1167 |
Idioma: | English |
DOI: |
10.1177/0962280216656246 |
Notas: | WOS |