Hybrid lion and exponential PSO-based metaheuristic clustering approach for efficient dynamic data stream management

In dynamic data stream environment, the problem related to the exploration of big data within the real time scenario cannot be addressed through the tracking of each individual historic data even though it is highly memory expensive. Thus, a data stream clustering method is essential for exploring a...

Full description

Saved in:
Bibliographic Details
Published in:Scientific reports Vol. 15; no. 1; pp. 22343 - 30
Main Authors: Ananthi, M., Valarmathi, K., Ramathilagam, A., Praveen, R.
Format: Journal Article
Language:English
Published: London Nature Publishing Group UK 01.07.2025
Nature Portfolio
Subjects:
ISSN:2045-2322, 2045-2322
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In dynamic data stream environment, the problem related to the exploration of big data within the real time scenario cannot be addressed through the tracking of each individual historic data even though it is highly memory expensive. Thus, a data stream clustering method is essential for exploring and storing the potential amount of information from the historical data determined in a single pass. The dynamic algorithms developed for clustering need to satisfy the two requirements of concept drift and concept evolution. These dynamic algorithms need to handle the change in the association between the object attributes that are existing within each individual clusters. In this paper, A Hybrid Lion and Exponential PSO-based Metaheuristic Clustering Approach (HLEPSOMCA) is proposed for satisfying the requirements of concept drift and concept evolution during efficient dynamic data stream management. This Metaheuristic Clustering Approach is proposed with the properties of good scalability and minimized number of parameters with respect to the number of clusters and high dimensional data determined from the dataset. It adopted different methods of stochastic optimization and deterministic clustering techniques for centring the clusters in an optimal manner. It further adopted density clustering strategies for determining micro clusters, such that Lion and Exponential PSO can be adopted in the initialization phase for maximizing the performance. The experimental results of this HLEPSOMCA approach with respect to KDD-99 dataset confirmed that the purity achieved by the proposed HLEPSOMCA scheme is improved on an average by 13.24%, better than the bassline approaches used for comparison.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-025-07404-9