Adaptive Markov Clustering Algorithm for Optimal Event Detection From Heterogeneous News Documents

Several Event Detection (ED) applications utilize various wrapper Feature Selection (FS) techniques based on wrapping the Markov Clustering Algorithm (MCL) with the Binary Bat Algorithm (BBA) or Adaptive Bat Algorithm (ABBA). These approaches have shown promising results in identifying relevant feat...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IEEE access Ročník 12; s. 72079 - 72097
Hlavní autor: Al-Dyani, Wafa Zubair
Médium: Journal Article
Jazyk:angličtina
Vydáno: Piscataway IEEE 2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Témata:
ISSN:2169-3536, 2169-3536
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Several Event Detection (ED) applications utilize various wrapper Feature Selection (FS) techniques based on wrapping the Markov Clustering Algorithm (MCL) with the Binary Bat Algorithm (BBA) or Adaptive Bat Algorithm (ABBA). These approaches have shown promising results in identifying relevant feature subset for MCL, leading to more precise event cluster from heterogeneous news articles. However, such wrapped FS methods involve coupling two methods (FS and ED) within ED model, with their performance influencing each other. While ABBA improved upon BBA's limitations, MCL's rapid convergence can hinder detection effectiveness. This fast convergence can lead to local optima and the detection of meaningless clusters. Additionally, MCL's identification ability diminishes as the feature space grows. To address these issues, this paper develops two novel adaptive techniques to control MCL's inflation (inf) and pruning (p) parameters, thereby managing its convergence behavior. Consequently, a new variant called Adaptive MCL (AMCL) is introduced and combined with ABBA. The effectiveness of the ABBA-AMCL method is evaluated using 10 benchmark datasets and two substantial Facebook news datasets. Various performance measures are employed to compare ABBA-AMCL against established methods. The empirical results demonstrate that ABBA-AMCL excels at extracting high-quality, real-world event clusters from various news text sources.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2169-3536
2169-3536
DOI:10.1109/ACCESS.2024.3403506