Determining Word-Emotion Associations from Tweets by Multi-Label Classification

Bravo-Marquez, Felipe; Frank, Eibe; Mohammad, Saif M.; Pfahringer, Bernhard; IEEE

Abstract

The automatic detection of emotions in Twitter posts is a challenging task due to the informal nature of the language used in this platform. In this paper, we propose a methodology for expanding the NRC word-emotion association lexicon for the language used in Twitter. We perform this expansion using multi-label classification of words and compare different word-level features extracted from unlabelled tweets such as unigrams, Brown clusters, POS tags, and word2vec embeddings. The results show that the expanded lexicon achieves major improvements over the original lexicon when classifying tweets into emotional categories. In contrast to previous work, our methodology does not depend on tweets annotated with emotional hashtags, thus enabling the identification of emotional words from any domain-specific collection using unlabelled tweets.

Más información

Título según WOS: ID WOS:000404432100080 Not found in local WOS DB
Título de la Revista: 2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016)
Editorial: IEEE
Fecha de publicación: 2016
Página de inicio: 536
Página final: 539
DOI:

10.1109/WI.2016.90

Notas: ISI