Logistic Regression Ensemble for Predicting Customer Defection with Very Large Sample Size

Heri Kuswanto*, Ayu Asfihani, Yogi Sarumaha, Hayato Ohwada

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

17 Citations (Scopus)

Abstract

Predicting customer defection is an important subject for companies producing cloud based software. The studied company sell three products (High, Medium and Low Price), in which the consumer has choice to defect or retain the product after certain period of time. The fact that the company collected very large dataset leads to inapplicability of standard statistical models due to the curse of dimensionality. Parametric statistical models will tend to produce very big standard error which may lead to inaccurate prediction results. This research examines a machine learning approach developed for high dimensional data namely logistic regression ensemble (LORENS). Using computational approaches, LORENS has prediction ability as good as standard logistic regression model i.e. between 66% to 77% prediction accuracy. In this case, LORENS is preferable as it is more reliable and free of assumptions.

Original languageEnglish
Pages (from-to)86-93
Number of pages8
JournalProcedia Computer Science
Volume72
DOIs
Publication statusPublished - 2015
Event3rd Information Systems International Conference, 2015 - Shenzhen, China
Duration: 16 Apr 201518 Apr 2015

Keywords

  • classification
  • ensemble
  • high dimensional data
  • logistic regression
  • machine learning

Fingerprint

Dive into the research topics of 'Logistic Regression Ensemble for Predicting Customer Defection with Very Large Sample Size'. Together they form a unique fingerprint.

Cite this