A novel molecular descriptor selection method in QSAR classification model based on weighted penalized logistic regression

Zakariya Yahya Algamal, Muhammad Hisyam Lee*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

34 Citations (Scopus)

Abstract

Molecular descriptor selection is a pivotal tool for quantitative structure–activity relationship modeling. This paper proposes a novel molecular descriptor selection method on the basis of taking into account the information of the group type that the descriptor belongs to. This descriptor selection method is on the basis of combining penalized logistic regression with 2-sample t test. The proposed method can perform filtering and weighting simultaneously. Specifically, 2-sample t test is employed as filter method by removing the descriptor which is not show statistically significant difference. On the other hand, a weighted penalized logistic regression is used by assigning a weight depending on the 2-sample t test value inside the descriptor type block. The proposed method is experimentally tested and compared with state-of-the-art selection methods. The results show that our proposed method is simpler and faster with efficient classification performance.

Original languageEnglish
Article numbere2915
JournalJournal of Chemometrics
Volume31
Issue number10
DOIs
Publication statusPublished - Oct 2017
Externally publishedYes

Keywords

  • QSAR classification
  • SCAD
  • adaptive lasso
  • descriptor selection
  • penalized logistic regression

Fingerprint

Dive into the research topics of 'A novel molecular descriptor selection method in QSAR classification model based on weighted penalized logistic regression'. Together they form a unique fingerprint.

Cite this