Supervised Machine Learning Algorithms for Fitness-Based Cardiometabolic Risk Classification in Adolescents

Yáñez-Sepúlveda, R; Olivares R.; Olivares P.; Zavala-Crichton J.P.; Hinojosa-Torres C.; Giakoni-Ramirez F.; de Souza-Lima J.; Monsalves-Álvarez M.; Tuesta, M.; Páez-Herrera, J; Olivares-Arancibia J.; Reyes-Amigo T.; Cortés-Roco, G; Hurtado-Almonacid J.; Guzmán-Muñoz, E; et. al.

Keywords: health, adolescent, predictive modeling, physical fitness, Gradient boosting

Abstract

Background: Cardiometabolic risk in adolescents represents a growing public health concern that is closely linked to modifiable factors such as physical fitness. Traditional statistical approaches often fail to capture complex, nonlinear relationships among anthropometric and fitness-related variables. Objective: To develop and evaluate supervised machine learning algorithms, including artificial neural networks and ensemble methods, for classifying cardiometabolic risk levels among Chilean adolescents based on standardized physical fitness assessments. Methods: A cross-sectional analysis was conducted using a large representative sample of school-aged adolescents. Field-based physical fitness tests, such as cardiorespiratory fitness (in terms of estimated maximal oxygen consumption [VO2max]), muscular strength (push-ups), and explosive power (horizontal jump) testing, were used as input variables. A cardiometabolic risk index was derived using international criteria. Various supervised machine learning models were trained and compared regarding accuracy, F1 score, recall, and area under the receiver operating characteristic curve (AUC-ROC). Results: Among all the models tested, the gradient boosting classifier achieved the best overall performance, with an accuracy of 77.0%, an F1 score of 67.3%, and the highest AUC-ROC (0.601). These results indicate a strong balance between sensitivity and specificity in classifying adolescents at cardiometabolic risk. Horizontal jumps and push-ups emerged as the most influential predictive variables. Conclusions: Gradient boosting proved to be the most effective model for predicting cardiometabolic risk based on physical fitness data. This approach offers a practical, data-driven tool for early risk detection in adolescent populations and may support scalable screening efforts in educational and clinical settings. © 2025 by the authors.

Más información

Título según WOS: Supervised Machine Learning Algorithms for Fitness-Based Cardiometabolic Risk Classification in Adolescents
Título según SCOPUS: Supervised Machine Learning Algorithms for Fitness-Based Cardiometabolic Risk Classification in Adolescents
Título de la Revista: Sports
Volumen: 13
Número: 8
Editorial: Multidisciplinary Digital Publishing Institute (MDPI)
Fecha de publicación: 2025
Idioma: English
DOI:

10.3390/sports13080273

Notas: ISI, SCOPUS