Neural Abstractive Unsupervised Summarization of Online News Discussions

Tampe, Ignacio

Abstract

Summarization has usually relied on gold standard summaries to train extractive or abstractive models. Social media brings a hurdle to summarization techniques since it requires addressing a multi-document multi-author approach. We address this challenging task by introducing a novel method that generates abstractive summaries of online news discussions. Our method extends a BERT-based architecture, including an attention encoding that fed comments’ likes during the training stage. To train our model, we define a task which consists of reconstructing high impact comments based on popularity (likes). Accordingly, our model learns to summarize online discussions based on their most relevant comments. Our novel approach provides a summary that represents the most relevant aspects of a news item that users comment on, incorporating the social context as a source of information to summarize texts in online social networks. Our model is evaluated using ROUGE scores between the generated summary and each comment on the thread. Our model, including the social attention encoding, significantly outperforms both extractive and abstractive summarization methods based on such evaluation.

Más información

Título según SCOPUS: Neural Abstractive Unsupervised Summarization of Online News Discussions
Título de la Revista: Lecture Notes in Networks and Systems
Volumen: 295
Editorial: Springer Science and Business Media Deutschland GmbH
Fecha de publicación: 2022
Año de Inicio/Término: 2 September 2021 through 3 September 2021
Página de inicio: 822
Página final: 841
Idioma: English
DOI:

10.1007/978-3-030-82196-8_60

Notas: SCOPUS