Unveiling insights from unstructured wealth: a comparative analysis of clustering techniques on blockchain cryptocurrency data

In the fourth industrial revolution era of today, individuals encounter an immense volume of information daily. The digital world is rich in data like IoT, social media, healthcare, business, cryptocurrencies, cybersecurity, etc. The situation can become problematic as these vast amounts of data req...

Full description

Saved in:
Bibliographic Details
Published in:Advances in Computing and Engineering Vol. 4; no. 1; pp. 1 - 43
Main Authors: Haraty, Ramzi A., Sobeh, Salma
Format: Journal Article
Language:English
Published: Academy Publishing Center 28.01.2024
Subjects:
ISSN:2735-5977, 2735-5985
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In the fourth industrial revolution era of today, individuals encounter an immense volume of information daily. The digital world is rich in data like IoT, social media, healthcare, business, cryptocurrencies, cybersecurity, etc. The situation can become problematic as these vast amounts of data require significant storage capacity, which leads to challenges in executing tasks such as analytical operations, processing operations, and retrieval operations that are time-consuming and arduous. To effectively analyze and utilize this data, artificial intelligence, particularly machine learning, and deep learning, can provide a practical solution. Clustering, an unsupervised learning technique, aims to identify a specific number of clusters to effectively categorize the data through data grouping. Hence, clustering is related to many fields and is used in various applications that deal with large datasets. This survey examines seven widely recognized clustering techniques, namely k-means, G-means, DBSCAN, Agglomerative hierarchical clustering, Two-stage density (DBSCAN and k-means) algorithm, Two-levels (DBSCAN and hierarchical) clustering algorithm, and Two-stage MeanShift and k-means clustering algorithm and compares them with a real dataset - The Blockchain dataset, including prominent cryptocurrencies like Binance, Bitcoin, Doge, and Ethereum, under several metrics such as silhouette coefficient, Calinski-Harabasz, Davies-Bouldin Index, time complexity, and entropy. Received: 20 July 2023 Accepted: 28 November 2023 Published: 28 January 2024
ISSN:2735-5977
2735-5985
DOI:10.21622/ACE.2024.04.1.698