Abstract
Predicting customer defection is an important subject for companies producing cloud based software. The studied company sell three products (High, Medium and Low Price), in which the consumer has choice to defect or retain the product after certain period of time. The fact that the company collected very large dataset leads to inapplicability of standard statistical models due to the curse of dimensionality. Parametric statistical models will tend to produce very big standard error which may lead to inaccurate prediction results. This research examines a machine learning approach developed for high dimensional data namely logistic regression ensemble (LORENS). Using computational approaches, LORENS has prediction ability as good as standard logistic regression model i.e. between 66% to 77% prediction accuracy. In this case, LORENS is preferable as it is more reliable and free of assumptions.
Original language | English |
---|---|
Pages (from-to) | 86-93 |
Number of pages | 8 |
Journal | Procedia Computer Science |
Volume | 72 |
DOIs | |
Publication status | Published - 2015 |
Event | 3rd Information Systems International Conference, 2015 - Shenzhen, China Duration: 16 Apr 2015 → 18 Apr 2015 |
Keywords
- classification
- ensemble
- high dimensional data
- logistic regression
- machine learning