TAILIEUCHUNG - Extensions to the k-amh algorithm for numerical clustering
The results obtained indicate that the two algorithms are as efficient as the fuzzy k-Means algorithm when clustering numerical values. Further, on an ANOVA test, k-AMH Numeric I obtained the highest accuracy score of for the six datasets combined with p-value less than , indicating a 95% confidence level. | Journal of ICT, 17, No. 4 (October) 2018, pp: 587–599 How to cite this paper: Seman, A., & Sapawi, A. M. (2018). Extensions to the k-AMH algorithm for numerical clustering. Journal if Information and Communication Technology, 17 (4), 587-599. EXTENSIONS TO THE K-AMH ALGORITHM FOR NUMERICAL CLUSTERING Ali Seman & Azizian Mohd Sapawi Faculty of Computer and Mathematical Sciences Universiti Teknologi MARA, Malaysia aliseman@; azizian@ ABSTRACT The k-AMH algorithm has been proven efficient in clustering categorical datasets. It can also be used to cluster numerical values with minimum modification to the original algorithm. In this paper, we present two algorithms that extend the k-AMH algorithm to the clustering of numerical values. The original k-AMH algorithm for categorical values uses a simple matching dissimilarity measure, but for numerical values it uses Euclidean distance. The first extension to the k-AMH algorithm, denoted k-AMH Numeric I, enables it to cluster numerical values in a fashion similar to k-AMH for categorical data. The second extension, k-AMH Numeric II, adopts the cost function of the fuzzy k-Means algorithm together with Euclidean distance, and has demonstrated performance similar to that of k-AMH Numeric I. The clustering performance of the two algorithms was evaluated on six real-world datasets against a benchmark algorithm, the fuzzy k-Means algorithm. The results obtained indicate that the two algorithms are as efficient as the fuzzy k-Means algorithm when clustering numerical values. Further, on an ANOVA test, k-AMH Numeric I obtained the highest accuracy score of for the six datasets combined with p-value less than , indicating a 95% confidence level. The experimental results prove that the k-AMH Numeric I and k-AMH Numeric II algorithms can be Received: 3 April 2018 Accepted: 19 August 2018 587 Published: 1 October 2018 Journal of ICT, 17, No. 4 (October) 2018, pp: 585–599 effectively .
đang nạp các trang xem trước