Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty

Aiedh Mrisi Alharthi, Muhammad Hisyam Lee*, Zakariya Yahya Algamal

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

14 Citations (Scopus)

Abstract

The removal of irrelevant and insignificant genes has always been a major step in microarray data analysis. The application of gene selection methods in biological datasets has greatly increased, supporting expert systems in cancer diagnostic capability with high classification accuracy. Penalized logistic regression (PLR) using the elastic net (EN) has been widely used in high-dimensional cancer classification in recent years to estimate the gene coefficients and perform gene selection simultaneously. However, the EN estimator does not satisfy the oracle properties. This paper proposes the PLR using the adaptive elastic net (AEN), abbreviated as PLRAEN, to address the inconsistency. Our method employs the ratio (BWR) as an initial weight inside the L1-norm of the EN model. Several experiments were performed on a simulation study for a different number of predictor variables, sample sizes, and correlation coefficients and also on three public gene expression datasets to evaluate the effectiveness. Experimental results demonstrate that the proposed method consistently outperforms two other contemporary penalized methods regarding classification accuracy and the number of selected genes. Therefore, we conclude that PLRAEN is a better method to implement gene selection in the high-dimensional cancer classification field.

Original languageEnglish
Article number100622
JournalInformatics in Medicine Unlocked
Volume24
DOIs
Publication statusPublished - Jan 2021
Externally publishedYes

Keywords

  • Adapted elastic net
  • Cancer diagnosis
  • Gene selection
  • Penalized logistic regression

Fingerprint

Dive into the research topics of 'Gene selection and classification of microarray gene expression data based on a new adaptive L1-norm elastic net penalty'. Together they form a unique fingerprint.

Cite this