TY - JOUR
T1 - Evidence accumulation clustering with possibilitic fuzzy C-means base clustering approach to disease diagnosis
AU - Iliyasu, Abdullah M.
AU - Fatichah, Chastine
AU - Abuhasel, Khaled A.
N1 - Publisher Copyright:
© 2016 Taylor and Francis Group, LLC.
PY - 2017
Y1 - 2017
N2 - Traditionally, supervised machine learning methods are the first choice for tasks involving classification of data. This study provides a non-conventional hybrid alternative technique (pEAC) that blends the Possibilistic Fuzzy C-Means (PFCM) as base cluster generating algorithm into the ‘standard’ Evidence Accumulation Clustering (EAC) clustering method. The PFCM coalesces the separate properties of the Possibilistic C-Means (PCM) and Fuzzy C-Means (FCM) algorithms into a sophisticated clustering algorithm. Notwithstanding the tremendous capabilities offered by this hybrid technique, in terms of structure, it resembles the hEAC and fEAC ensemble clustering techniques that are realised by integrating the K-Means and FCM clustering algorithms into the EAC technique. To validate the new technique’s effectiveness, its performance on both synthetic and real medical datasets was evaluated alongside individual runs of well-known clustering methods, other unsupervised ensemble clustering techniques and some supervised machine learning methods. Our results show that the proposed pEAC technique outperformed the individual runs of the clustering methods and other unsupervised ensemble techniques in terms accuracy for the diagnosis of hepatitis, cardiovascular, breast cancer, and diabetes ailments that were used in the experiments. Remarkably, compared alongside selected supervised machine learning classification models, our proposed pEAC ensemble technique exhibits better diagnosing accuracy for the two breast cancer datasets that were used, which suggests that even at the cost of none labelling of data, the proposed technique offers efficient medical data classification.
AB - Traditionally, supervised machine learning methods are the first choice for tasks involving classification of data. This study provides a non-conventional hybrid alternative technique (pEAC) that blends the Possibilistic Fuzzy C-Means (PFCM) as base cluster generating algorithm into the ‘standard’ Evidence Accumulation Clustering (EAC) clustering method. The PFCM coalesces the separate properties of the Possibilistic C-Means (PCM) and Fuzzy C-Means (FCM) algorithms into a sophisticated clustering algorithm. Notwithstanding the tremendous capabilities offered by this hybrid technique, in terms of structure, it resembles the hEAC and fEAC ensemble clustering techniques that are realised by integrating the K-Means and FCM clustering algorithms into the EAC technique. To validate the new technique’s effectiveness, its performance on both synthetic and real medical datasets was evaluated alongside individual runs of well-known clustering methods, other unsupervised ensemble clustering techniques and some supervised machine learning methods. Our results show that the proposed pEAC technique outperformed the individual runs of the clustering methods and other unsupervised ensemble techniques in terms accuracy for the diagnosis of hepatitis, cardiovascular, breast cancer, and diabetes ailments that were used in the experiments. Remarkably, compared alongside selected supervised machine learning classification models, our proposed pEAC ensemble technique exhibits better diagnosing accuracy for the two breast cancer datasets that were used, which suggests that even at the cost of none labelling of data, the proposed technique offers efficient medical data classification.
KW - Disease diagnosis
KW - Evidence accumulation clustering
KW - Fuzzy C-means
KW - Health informatics
KW - Hybrid intelligent systems
KW - K-means
KW - Medical data classification
KW - Possibilitic fuzzy C-means
UR - http://www.scopus.com/inward/record.url?scp=85018372154&partnerID=8YFLogxK
U2 - 10.7305/automatika.2016.10.1427
DO - 10.7305/automatika.2016.10.1427
M3 - Article
AN - SCOPUS:85018372154
SN - 0005-1144
VL - 57
SP - 822
EP - 835
JO - Automatika
JF - Automatika
IS - 3
ER -