Element selection and concentration analysis for classifying South America wine samples according to the country of origin

Soares, Felipe; Anzanello, Michel J.; Fogliatto, Flavio S.; Marcelo, Marcelo C. A.; Ferrao, Marco F.; Manfroi, Vitor; Pozebon, Dirce

Abstract

This paper proposes an approach for feature selection aimed at classifying wines samples according to place of origin. The method relies on Kruskal-Wallis non-parametric test to remove non significant features, and Linear Discriminant Analysis to derive a feature importance index. The ranked features according that index are iteratively added and classification performance is assessed after each insertion. The number of selected features is chosen according the maximum accuracy in a repeated 10-fold cross-validation. Aiming at improving cate-gorization accuracy, different classification techniques are tested. When applied to a wine dataset comprised of 53 samples from four South America countries (Argentina, Brazil, Chile, and Uruguay) and 45 chemical elements concentrations determined by ICP-OES and ICP-MS, the proposed framework yielded average 99.9% accurate classifications in the testing set, and retained average 6.73 of the 45 original elements. Retained chemical ele-ments were then qualitatively assessed.

Más información

Título según WOS: ID WOS:000437079900005 Not found in local WOS DB
Título de la Revista: COMPUTERS AND ELECTRONICS IN AGRICULTURE
Volumen: 150
Editorial: ELSEVIER SCI LTD
Fecha de publicación: 2018
Página de inicio: 33
Página final: 40
DOI:

10.1016/j.compag.2018.03.027

Notas: ISI