Feature Selection via Pareto Multi-objective Genetic Algorithms

Feature selection, an important combinatorial optimization problem in data mining, aims to find a reduced subset of features of high quality in a dataset. Different categories of importance measures can be used to estimate the quality of a feature subset. Since each measure provides a distinct persp...

Full description

Saved in:
Bibliographic Details
Published in:Applied artificial intelligence Vol. 31; no. 9-10; pp. 764 - 791
Main Authors: Spolaôr, Newton, Lorena, Ana Carolina, Diana Lee, Huei
Format: Journal Article
Language:English
Published: Philadelphia Taylor & Francis 26.11.2017
Taylor & Francis Ltd
Taylor & Francis Group
Subjects:
ISSN:0883-9514, 1087-6545
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Feature selection, an important combinatorial optimization problem in data mining, aims to find a reduced subset of features of high quality in a dataset. Different categories of importance measures can be used to estimate the quality of a feature subset. Since each measure provides a distinct perspective of data and of which are their important features, in this article we investigate the simultaneous optimization of importance measures from different categories using multi-objective genetic algorithms grounded in the Pareto theory. An extensive experimental evaluation of the proposed method is presented, including an analysis of the performance of predictive models built using the selected subsets of features. The results show the competitiveness of the method in comparison with six feature selection algorithms. As an additional contribution, we conducted a pioneer, rigorous, and replicable systematic review on related work. As a result, a summary of 93 related papers strengthens features of our method.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0883-9514
1087-6545
DOI:10.1080/08839514.2018.1444334