Evolutionary Selection of Interesting Class Association Rules Using Genetic Relation Algorithm

Abstract

During the last years, several association rule-based classification methods have been proposed, these algorithms may quickly generate accurate rules. However, the generated rules are often very large in terms of the number of rules and usually complex and hardly understandable for users. Among all the rules generated by the algorithms, only some of them are likely to be of any interest to the domain expert analyzing the data. Most of the rules are either redundant, irrelevant or obvious. In this paper, a new method for selecting the interesting class association rules is proposed by an evolutionary method named genetic relation algorithm. The algorithm evaluates the relevance and interestingness of the discovered association rules by the relationships between the rules in each generation using a specific measure of distance among them giving a reduced set of rules as the result in the final generation. This small rule set has the following properties: (i) accurate as it has at least the same classification accuracy as the complete association rule set, (ii) interesting because of the diversity of rules and (iii) comprehensible because it is more understandable for the users as the number of attributes involved in the rules is also small. The efficiency of the proposed method is compared with other conventional methods including genetic network programming-based mining using ten databases and the experimental results show that it outperforms others keeping a good balance between the classification accuracy and the comprehensibility of the rules. (C) 2011 Institute of Electrical Engineers of Japan. Published by John Wiley Sons, Inc.

Más información

Título según WOS: ID WOS:000294121200005 Not found in local WOS DB
Título de la Revista: IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING
Volumen: 6
Número: 5
Editorial: Wiley
Fecha de publicación: 2011
Página de inicio: 431
Página final: 440
DOI:

10.1002/tee.20679

Notas: ISI