CT4RDD: Classification trees for research on digital divide
Keywords: Digital divide analysis, Digital divide modeling, Digital divide measurement, Census data mining, C4.5 algorithm, J4.8 algorithm
Abstract
This paper presents CT4RDD (classification trees for research on digital divide), a novel methodology for the quantitative analysis and modeling of the digital divide phenomenon with an approach of single country. It is inspired on the reputed Quinlan’s C4.5 algorithm to automatically produce classification trees, as implemented in Witten & Frank’s WEKA software toolkit. The methodology is created and evaluated on data from the 2010 Mexican Population and Housing Census that include a number of variables whose interactions involve aspects of the phenomenon; particularly, interactions among Internet service presence in households and a number of features regarding educational and economical levels, genders, ages, housing characteristics, ratios of indigenous population, etc. Discretization is used to represent percentages of presence of Internet in households of municipalities as a nominal target attribute to produce classification trees. Results suggest that the methodology can produce quantitative profiles that describe similarities and differences among a series of municipality classes that present different percentages of presence of Internet in households. The discovered profiles provide scholars, government officials and enterprise managers with valuable insight for research, planning and decision making.
Más información
Título de la Revista: | EXPERT SYSTEMS WITH APPLICATIONS |
Volumen: | 40 |
Número: | 14 |
Editorial: | PERGAMON-ELSEVIER SCIENCE LTD |
Fecha de publicación: | 2013 |
Página de inicio: | 5779 |
Página final: | 5786 |
Idioma: | English |
URL: | https://www.sciencedirect.com/science/article/abs/pii/S095741741300239X |
Notas: | WOS Core Collection |