Quality matters: Artificial intelligence-based assessment of feedback quality predicts technical skill improvement
Abstract
Introduction: Consistently delivering high-quality feedback is critical, yet challenging, in surgical education. We hypothesized that feedback quality, as assessed by artificial intelligence (AI, Teach1) based-tools, would correlate with improvements in technical skill acquisition during simulated paracentesis training. Methods: Seventy-five fourth-year medical students meeting inclusion criteria (baseline Objective Structured Assessment of Technical Skills scores <= 20) performed paracentesis on simulators, received instructor feedback via a digital platform, and completed a second attempt. Teach1, developed using OpenAI Vector Embeddings and Support Vector Machine classification (area under the receiver operating characteristic curve, 0.86-0.98), evaluated 688 feedback comments on the basis of 5 established quality criteria: direct observation, skill specificity, reinforcement of strengths, targeted improvement suggestions, and actionable plans. Changes in Objective Structured Assessment of Technical Skills (Delta Objective Structured Assessment of Technical Skills) scores were analyzed in relation to feedback quality percentages. Results: Objective Structured Assessment of Technical Skills scores improved from 15.76 (+/- 3.15) to 21.68 (+/- 2.30) postfeedback. Feedback quality percentages positively correlated with performance improvement (Delta Objective Structured Assessment of Technical Skills) (r = 0.309, P < .001), with each 10% increase in feedback quality percentages corresponding to a 1.16-point improvement in Objective Structured Assessment of Technical Skills scores (beta = 11.61, P < .001). Reinforcement of strengths and suggesting targeted improvements showed the strongest correlations with skill gains. Artificial intelligence assessments demonstrated substantial agreement with expert evaluations (kappa = 0.78). Conclusion: Artificial intelligence-driven assessment of feedback quality significantly correlates with technical skill improvement in a simulated procedural setting. By enhancing feedback quality, Teach1 taps into one of the strongest drivers of skill development in education, offering a scalable method to standardize feedback quality across surgical training and potentially improve patient outcomes. (c) 2025 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Más información
Título según WOS: | ID WOS:001564377000001 Not found in local WOS DB |
Título de la Revista: | SURGERY |
Volumen: | 187 |
Editorial: | MOSBY-ELSEVIER |
Fecha de publicación: | 2025 |
DOI: |
10.1016/j.surg.2025.109625 |
Notas: | ISI |