High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm

Z. Y. Algamal, M. H. Lee*, A. M. Al-Fakih, M. Aziz

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

24 Citations (Scopus)

Abstract

In high-dimensional quantitative structure–activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L1/2-norm is proposed. Furthermore, the local linear approximation algorithm is utilized to avoid the non-convexity of the proposed method. The potential applicability of the proposed method is tested on several benchmark data sets. Compared with other commonly used penalized methods, the proposed method can not only obtain the best predictive ability, but also provide an easily interpretable QSAR model. In addition, it is noteworthy that the results obtained in terms of applicability domain and Y-randomization test provide an efficient and a robust QSAR model. It is evident from the results that the proposed method may possibly be a promising penalized method in the field of computational chemistry research, especially when the number of molecular descriptors exceeds the number of compounds.

Original languageEnglish
Pages (from-to)703-719
Number of pages17
JournalSAR and QSAR in Environmental Research
Volume27
Issue number9
DOIs
Publication statusPublished - 1 Sept 2016
Externally publishedYes

Keywords

  • L-norm
  • QSAR
  • bridge penalty
  • imidazo[4,5-b]pyridine derivatives
  • penalized method
  • procollagen C-proteinase

Fingerprint

Dive into the research topics of 'High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm'. Together they form a unique fingerprint.

Cite this