Study of principal components on classification of problematic wine fermentations

Urtubia U.A.; Perez Correa J.R.

Keywords: cluster, fermentation, component, components, classification, sets, measurement, wine, ability, data, detection, mining, analysis, clustering, techniques, k-means, principal, procedure, Reliable

Abstract

Data mining techniques have already shown useful to classify wine fermentations as problematic. Then, these techniques are a good option for winemakers who currently lack the tools to identify early signs of undesirable fermentation behavior and, therefore, are unable to take possible mitigating actions. In this study we assessed how much the performance of a clustering K-means fermentation classification procedure is affected by the number of principal components (PCs), when principal component analysis (PCA) is previously applied to reduce the dimensionality of the available data. It was observed that three PCs were enough to preserve the overall information of a dataset containing reliable measurements only. In this case, a 40% detection ability of problematic fermentations was achieved. In turn, using a more complete dataset, but containing unreliable measurements, the number of PCs yielded different classifications. Here, 33%f the problematic fermentations were detected. © 2009 Springer Berlin Heidelberg.

Más información

Título de la Revista: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen: 5633
Editorial: Society of Laparoendoscopic Surgeons
Fecha de publicación: 2009
Página de inicio: 38
Página final: 43
URL: http://www.scopus.com/inward/record.url?eid=2-s2.0-76249111699&partnerID=q2rCbXpz