Revisiting Machine Learning from Crowds a Mixture Model for Grouping Annotations
Abstract
Today, supervised learning is widely used for pattern recognition, computer vision and other tasks. In this setting, data need to be explicitly annotated. Unfortunately, obtaining accurate labels can be difficult, expensive and time-consuming. As a result, many machine learning projects rely on labelling processes that involve crowds, i.e. multiple subjective and inexpert annotators. Handling this noise in a principled way is an important challenge for machine learning, called learning from crowds. In this paper, we present a model that learns patterns of label noise by grouping annotations. In contrast to previous art, we do not model specific labeling patterns for each annotator but explain the data using a fixed-size mixture model. This approach allows to handle a sparse distribution of labels among annotators and obtain a model with less parameters that can scale better to large-scale scenarios. Experiments on real and simulated data illustrate the advantages of our approach.
Más información
Título según WOS: | Revisiting Machine Learning from Crowds a Mixture Model for Grouping Annotations |
Título según SCOPUS: | Revisiting Machine Learning from Crowds a Mixture Model for Grouping Annotations |
Título de la Revista: | Lecture Notes in Computer Science |
Volumen: | 11896 |
Editorial: | Springer, Cham |
Fecha de publicación: | 2019 |
Página de inicio: | 493 |
Página final: | 503 |
DOI: |
10.1007/978-3-030-33904-3_46 |
Notas: | ISI, SCOPUS |