Skip to main navigation Skip to search Skip to main content

Ensemble based machine learning for optimizing toxicity in cancer drug discovery

  • Heri Kuswanto*
  • , Erlin Sukmaputri
  • , Hayato Ohwada
  • *Corresponding author for this work
  • Institut Teknologi Sepuluh Nopember
  • Tokyo University of Science

Research output: Contribution to journalArticlepeer-review

Abstract

Cancer is a disease caused by abnormal growth due to the cells of the body’stissues that turn into cancer cells. Radio therapy is one of the cancer treatments thathas a side effect of killing normal cells around cancer cells. Radio protector is madeto reduce normal cell death and increase cancer cell death. This research identifies the compounds corresponding to the toxicity with normal cell death rate below and above20%. The data used in this study is the level of toxicity to classify compounds for radio protector consisting of 84 compounds with 217 predictors (features). Two ensemble based machine learning approaches are applied to overcoming the problem of high dimensionality of the data, namely Logistic Regression Ensembles (LORENS) and Ensemble Support Vector Machine (AdaBoost-SVM). The AdaBoost-SVM is applied to the important features selected by Mean Decreasing Gini (MDG) index. The results showed thatthe AdaBoost-SVM outperforms LORENS significantly. The accuracy is 0.7889 obtainedby examining 5% of most important features.

Original languageEnglish
Pages (from-to)697-703
Number of pages7
JournalICIC Express Letters
Volume13
Issue number8
DOIs
Publication statusPublished - Aug 2019

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • AdaBoost-SVM
  • Cancer
  • Compound
  • High dimensionality
  • LORENS
  • Toxicity

Fingerprint

Dive into the research topics of 'Ensemble based machine learning for optimizing toxicity in cancer drug discovery'. Together they form a unique fingerprint.

Cite this