A novel ensemble method for high-dimensional genomic data classification

Espichán-Linares, A.; Villanueva, E.

doi:https://doi.org/10.1109/BIBM.2018.8621386

A novel ensemble method for high-dimensional genomic data classification

Date

2019

Authors

Espichán-Linares, A.

Villanueva, E.

Publisher

Institute of Electrical and Electronics Engineers

URI

http://hdl.handle.net/20.500.14657/205765

DOI

https://doi.org/10.1109/BIBM.2018.8621386

Abstract

Classifier ensembles have shown to be an attractive approach for dealing with the curse of dimensionality problems in genomic data. The common idea of this approach is to integrate diverse and accurate base predictors in order to obtain a classification system better than its members. Many methods pursue it by introducing perturbations in some aspect of the learning process (examples, features, base learners, etc.). However, many of the existing methodologies do so in a completely random way, without having control of the perturbation process, which can generate unhelpful base predictors that can affect the final performance or the need to use some pruning strategy. In this paper we introduce tEnsemble, a new and simple approach that seeks an adequate balance between diversity and accuracy. This is done by using a previously optimized template feature set, which serves to guide the perturbation process on the feature space in a controlled manner. Experiments carried out on 39 gene expression public data sets showed that this methodology has the potential to produce effective classifier ensemble systems, showing a frequent superiority in relation to Random Forest, a well-established methodology in the area.

Keywords

Curse of dimensionality, Computer science, Random forest, Classifier (UML), Artificial intelligence, Machine learning, Ensemble learning, Data mining, Feature vector, Dimensionality reduction, Pattern recognition (psychology)

Collections

Artículos (DFI)

Full item page

A novel ensemble method for high-dimensional genomic data classification

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

URI

DOI

Acceso al texto completo solo para la Comunidad PUCP

Abstract

Description

Keywords

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By