Fraudulent account detection in social media using hybrid deep transformer model and hyperparameter optimization

The high rate of social media development has triggered a high rate of fake accounts, which are a great risk to the privacy of users and the integrity of the platform. These malicious accounts are hard to detect because user activity data is highly imbalanced, dimensional, and sequential. The emerge...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports Jg. 15; H. 1; S. 38447 - 23
Hauptverfasser: Shukla, Prashant Kumar, Veerasamy, Bala Dhandayuthapani, Alduaiji, Noha, Addula, Santosh Reddy, Pandey, Ankur, Shukla, Piyush Kumar
Format: Journal Article
Sprache:Englisch
Veröffentlicht: London Nature Publishing Group UK 03.11.2025
Nature Publishing Group
Nature Portfolio
Schlagworte:
ISSN:2045-2322, 2045-2322
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The high rate of social media development has triggered a high rate of fake accounts, which are a great risk to the privacy of users and the integrity of the platform. These malicious accounts are hard to detect because user activity data is highly imbalanced, dimensional, and sequential. The emergence of fake profiles on social media endangers the privacy and trust of social media users. It is difficult to detect such accounts because of high-dimensional, highly sequential, and imbalanced user behavior data. Current techniques tend to miss out on the complicated activity patterns or even overfit, which is why a strong, scalable, and precise model of social media fraud detection is required. This study suggests a new deep learning architecture that entails a Temporal Convolutional Network (TCN) with Generative Adversarial Network (GAN)-based data augmentation to generate minority classes, and Autoencoder-based feature extraction to reduce dimensionality. The Seagull Optimization Algorithm (SOA), which is a metaheuristic algorithm, is used to optimize hyperparameters by balancing efficiency and speed of convergence in global search. The framework is tested on benchmark datasets (Cresci-2017 and TwiBot-22) and compared to the state-of-the-art models. It has been shown in experiments that the suggested TCN-GAN-SOA framework performs better, with ROC-AUC scores of 0.96 on Cresci-2017 and 0.95 on TwiBot-22, and a higher precision-recall value and better F1-scores. In addition, computational efficiency can be verified by the runtime analysis; case studies prove the framework’s strength when handling various situations of fraudulent behaviors. The given solution offers a scalable, reliable, and accurate methodology of detecting social media fraud based on the combination of sophisticated sequence modeling, realistic data augmentation, and hyperparameter optimization.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-025-24326-8