Taxonomy grooming algorithm ‐ An autodidactic domain specific dimensionality reduction approach for fast clustering of social media text data

Social media being the most eminent source toward the growth of big data is important for information retrieval‐based applications to improve the efficiency in proportional to the volume it must deal with. One way to achieve better performance is to upgrade the processing capacity and the alternativ...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Concurrency and computation Jg. 34; H. 11
Hauptverfasser: Renjith, Shini, Sreekumar, A., Jathavedan, M.
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Hoboken, USA John Wiley & Sons, Inc 15.05.2022
Wiley Subscription Services, Inc
Schlagworte:
ISSN:1532-0626, 1532-0634
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Social media being the most eminent source toward the growth of big data is important for information retrieval‐based applications to improve the efficiency in proportional to the volume it must deal with. One way to achieve better performance is to upgrade the processing capacity and the alternative option is to improve the processing methodology. The latter can be achieved using smarter processing techniques and/or better algorithms. Reducing the data volume that needs to be processed is a good strategy and it can be achieved by extracting only the relevant information via user segmentation by adopting an appropriate clustering technique. However, while dealing with text content, clustering algorithms do suffer due to the very high dimensions to be dealt with. Since the domain‐specific aspects are getting lost while applying traditional dimensionality reduction approaches, it is important to device an alternate strategy. This work proposes a taxonomy grooming algorithm (TGA), an autodidactic domain‐specific dimensionality reduction approach, for fast clustering of social media text data. Our experiment results are very promising and the dimensionality reduction using TGA resulted in better results in comparison with the traditional dimensionality reduction approaches.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1532-0626
1532-0634
DOI:10.1002/cpe.6837