Efficiently Finding Cyclical Patterns on Twitter Considering the Inherent Spatio-temporal Attributes of Data

Gutierrez-Soto, Claudio; Galdames, Patricio; Navea, Daniel

Keywords: Cyclical Patterns, Spatio-temporal Data, Social Networks, Algorithms

Abstract

Social networks such as Twitter provide thousands of terabytes per day, which can be exploited to find relevant information. This relevant information is used to promote marketing strategies, analyze current political issues, and track market trends, to name a few examples. One instance of relevant information is finding cyclic behavior patterns (i.e., patterns that frequently repeat themselves over time) in the population. Because trending topics on Twitter change rapidly, efficient algorithms are required, especially when considering location and time (i.e., the specific location and time) during broadcasts. This article presents an efficient algorithm based on association rules to find cyclical patterns on Twitter, considering the inherent spatio-temporal attributes of data. Using a Hash Table enhances the efficiency of this algorithm, called HashCycle. Notably, HashCycle does not use minimum support and can detect patterns in a single run over a sequence. The processing times of HashCycle were compared to the Apriori (which is a well-known and widely used on diverse platforms) and Projection-based Partial Periodic Patterns (PPA) algorithms (which is one of the most efficient algorithms in terms of processing times). Empirical results from two spatio-temporal databases (a synthetic data set and one based on Twitter) show that HashCycle has more efficient processing times than two state-of-the-art algorithms: Apriori and PPA.

Más información

Título de la Revista: JUCS - Journal of Universal Computer Science
Volumen: 29
Número: 11
Editorial: 10.3897/jucs.112523
Fecha de publicación: 2023
Página de inicio: 1404
Página final: 1421
Idioma: Ingles
URL: https://doi.org/10.3897/jucs.112523
Notas: Web of Science Core Collection: Science Citation Index Expanded Additional Web of Science Indexes: Current Contents Engineering, Computing & Technology | Essential Science Indicators