CT4RDD: Classification trees for research on digital divide

Coria, S.R.; Mondragon-Becerra, R.; Pérez-Meza, M.; Ramírez-Vásquez, S.K.; Martínez-Peláez, R.; Barragán-López, D.; Ávila-Barrón, O.R.

Keywords: Digital divide analysis, Digital divide modeling, Digital divide measurement, Census data mining, C4.5 algorithm, J4.8 algorithm

Abstract

This paper presents CT4RDD (classification trees for research on digital divide), a novel methodology for the quantitative analysis and modeling of the digital divide phenomenon with an approach of single country. It is inspired on the reputed Quinlan’s C4.5 algorithm to automatically produce classification trees, as implemented in Witten & Frank’s WEKA software toolkit. The methodology is created and evaluated on data from the 2010 Mexican Population and Housing Census that include a number of variables whose interactions involve aspects of the phenomenon; particularly, interactions among Internet service presence in households and a number of features regarding educational and economical levels, genders, ages, housing characteristics, ratios of indigenous population, etc. Discretization is used to represent percentages of presence of Internet in households of municipalities as a nominal target attribute to produce classification trees. Results suggest that the methodology can produce quantitative profiles that describe similarities and differences among a series of municipality classes that present different percentages of presence of Internet in households. The discovered profiles provide scholars, government officials and enterprise managers with valuable insight for research, planning and decision making.

Más información

Título de la Revista: EXPERT SYSTEMS WITH APPLICATIONS
Volumen: 40
Número: 14
Editorial: PERGAMON-ELSEVIER SCIENCE LTD
Fecha de publicación: 2013
Página de inicio: 5779
Página final: 5786
Idioma: English
URL: https://www.sciencedirect.com/science/article/abs/pii/S095741741300239X
Notas: WOS Core Collection