TY - JOUR
T1 - QSAR model for predicting neuraminidase inhibitors of influenza A viruses (H1N1) based on adaptive grasshopper optimization algorithm
AU - Algamal, Z. Y.
AU - Qasim, M. K.
AU - Lee, M. H.
AU - Ali, H. T.M.
N1 - Publisher Copyright:
© 2020 Informa UK Limited, trading as Taylor & Francis Group.
PY - 2020/11/1
Y1 - 2020/11/1
N2 - High-dimensionality is one of the major problems which affect the quality of the quantitative structure–activity relationship (QSAR) modelling. Obtaining a reliable QSAR model with few descriptors is an essential procedure in chemometrics. The binary grasshopper optimization algorithm (BGOA) is a new meta-heuristic optimization algorithm, which has been used successfully to perform feature selection. In this paper, four new transfer functions were adapted to improve the exploration and exploitation capability of the BGOA in QSAR modelling of influenza A viruses (H1N1). The QSAR model with these new quadratic transfer functions was internally and externally validated based on MSEtrain, Y-randomization test, MSEtest, and the applicability domain (AD). The validation results indicate that the model is robust and not due to chance correlation. In addition, the results indicate that the descriptor selection and prediction performance of the QSAR model for training dataset outperform the other S-shaped and V-shaped transfer functions. QSAR model using quadratic transfer function shows the lowest MSEtrain. For the test dataset, proposed QSAR model shows lower value of MSEtest compared with the other methods, indicating its higher predictive ability. In conclusion, the results reveal that the proposed QSAR model is an efficient approach for modelling high-dimensional QSAR models and it is useful for the estimation of IC50 values of neuraminidase inhibitors that have not been experimentally tested.
AB - High-dimensionality is one of the major problems which affect the quality of the quantitative structure–activity relationship (QSAR) modelling. Obtaining a reliable QSAR model with few descriptors is an essential procedure in chemometrics. The binary grasshopper optimization algorithm (BGOA) is a new meta-heuristic optimization algorithm, which has been used successfully to perform feature selection. In this paper, four new transfer functions were adapted to improve the exploration and exploitation capability of the BGOA in QSAR modelling of influenza A viruses (H1N1). The QSAR model with these new quadratic transfer functions was internally and externally validated based on MSEtrain, Y-randomization test, MSEtest, and the applicability domain (AD). The validation results indicate that the model is robust and not due to chance correlation. In addition, the results indicate that the descriptor selection and prediction performance of the QSAR model for training dataset outperform the other S-shaped and V-shaped transfer functions. QSAR model using quadratic transfer function shows the lowest MSEtrain. For the test dataset, proposed QSAR model shows lower value of MSEtest compared with the other methods, indicating its higher predictive ability. In conclusion, the results reveal that the proposed QSAR model is an efficient approach for modelling high-dimensional QSAR models and it is useful for the estimation of IC50 values of neuraminidase inhibitors that have not been experimentally tested.
KW - Grasshopper optimization algorithm
KW - QSAR
KW - descriptor selection
KW - evolutionary algorithm
KW - neuraminidase inhibitor
UR - http://www.scopus.com/inward/record.url?scp=85091042753&partnerID=8YFLogxK
U2 - 10.1080/1062936X.2020.1818616
DO - 10.1080/1062936X.2020.1818616
M3 - Article
C2 - 32938208
AN - SCOPUS:85091042753
SN - 1062-936X
VL - 31
SP - 803
EP - 814
JO - SAR and QSAR in Environmental Research
JF - SAR and QSAR in Environmental Research
IS - 11
ER -