Identification of SNPs and Genes Associated with Quantitative Oenological Traits in Saccharomyces cerevisiae Using Regression and Machine Learning Models

Riquelme-Zamora, Catalina; Concha-Toro, Camila; Perez-Munoz, Hector; I. Kessi-Perez, Eduardo; Villalobos-Cid, Manuel

Abstract

Saccharomyces cerevisiae is a model organism with key industrial relevance in wine fermentation and other biotechnological processes. Identifying single-nucleotide polymorphisms (SNPs) linked to quantitative traits is essential for understanding the genetic basis of strain performance. Most previous studies have frequently focused on lineage-based comparisons, limiting the resolution of genotype-phenotype associations. To deal with this constraint, we applied regression and machine-learning models to characterise SNPs and genes associated with continuous oenological traits in wine strains. Four phenotypes were evaluated: proliferation efficiency, proliferation rate, lag phase, and area under the growth curve. Genome-wide SNP data were encoded and used to train predictive models based on Ridge regression, Support Vector Machines, K-nearest Neighbours, and Random Forest. Model performance was evaluated through cross-validation, and key SNPs were prioritised using feature-importance metrics. Functional enrichment of the associated genes was performed using curated biological databases to provide a mechanistic context. This integrative approach identified genomic regions consistently linked to fermentation traits and revealed candidate genes involved in stress adaptation and metabolic regulation. The results refine genotype-phenotype associations in wine yeasts and establish a reproducible framework for future experimental validation of functionally relevant variants.

Más información

Título según WOS: ID WOS:001667812500001 Not found in local WOS DB
Título de la Revista: JOURNAL OF THE AMERICAN SOCIETY OF BREWING CHEMISTS
Editorial: TAYLOR & FRANCIS LTD
Fecha de publicación: 2025
DOI:

10.1080/03610470.2025.2595368

Notas: ISI