A new adaptive L1-norm for optimal descriptor selection of high-dimensional QSAR classification model for anti-hepatitis C virus activity of thiourea derivatives

Z. Y. Algamal, M. H. Lee*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

30 Citations (Scopus)

Abstract

A high-dimensional quantitative structure–activity relationship (QSAR) classification model typically contains a large number of irrelevant and redundant descriptors. In this paper, a new design of descriptor selection for the QSAR classification model estimation method is proposed by adding a new weight inside L1-norm. The experimental results of classifying the anti-hepatitis C virus activity of thiourea derivatives demonstrate that the proposed descriptor selection method in the QSAR classification model performs effectively and competitively compared with other existing penalized methods in terms of classification performance on both the training and the testing datasets. Moreover, it is noteworthy that the results obtained in terms of stability test and applicability domain provide a robust QSAR classification model. It is evident from the results that the developed QSAR classification model could conceivably be employed for further high-dimensional QSAR classification studies.

Original languageEnglish
Pages (from-to)75-90
Number of pages16
JournalSAR and QSAR in Environmental Research
Volume28
Issue number1
DOIs
Publication statusPublished - 2 Jan 2017
Externally publishedYes

Keywords

  • QSAR
  • classification
  • lasso
  • penalized logistic regression
  • penalized method

Fingerprint

Dive into the research topics of 'A new adaptive L1-norm for optimal descriptor selection of high-dimensional QSAR classification model for anti-hepatitis C virus activity of thiourea derivatives'. Together they form a unique fingerprint.

Cite this