Determining Word-Emotion Associations from Tweets by Multi-Label Classification
Abstract
The automatic detection of emotions in Twitter posts is a challenging task due to the informal nature of the language used in this platform. In this paper, we propose a methodology for expanding the NRC word-emotion association lexicon for the language used in Twitter. We perform this expansion using multi-label classification of words and compare different word-level features extracted from unlabelled tweets such as unigrams, Brown clusters, POS tags, and word2vec embeddings. The results show that the expanded lexicon achieves major improvements over the original lexicon when classifying tweets into emotional categories. In contrast to previous work, our methodology does not depend on tweets annotated with emotional hashtags, thus enabling the identification of emotional words from any domain-specific collection using unlabelled tweets.
Más información
Título según WOS: | ID WOS:000404432100080 Not found in local WOS DB |
Título de la Revista: | 2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016) |
Editorial: | IEEE |
Fecha de publicación: | 2016 |
Página de inicio: | 536 |
Página final: | 539 |
DOI: |
10.1109/WI.2016.90 |
Notas: | ISI |