Fraudulent account detection in social media using hybrid deep transformer model and hyperparameter optimization

The high rate of social media development has triggered a high rate of fake accounts, which are a great risk to the privacy of users and the integrity of the platform. These malicious accounts are hard to detect because user activity data is highly imbalanced, dimensional, and sequential. The emerge...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Scientific reports Ročník 15; číslo 1; s. 38447 - 23
Hlavní autoři: Shukla, Prashant Kumar, Veerasamy, Bala Dhandayuthapani, Alduaiji, Noha, Addula, Santosh Reddy, Pandey, Ankur, Shukla, Piyush Kumar
Médium: Journal Article
Jazyk:angličtina
Vydáno: London Nature Publishing Group UK 03.11.2025
Nature Publishing Group
Nature Portfolio
Témata:
ISSN:2045-2322, 2045-2322
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:The high rate of social media development has triggered a high rate of fake accounts, which are a great risk to the privacy of users and the integrity of the platform. These malicious accounts are hard to detect because user activity data is highly imbalanced, dimensional, and sequential. The emergence of fake profiles on social media endangers the privacy and trust of social media users. It is difficult to detect such accounts because of high-dimensional, highly sequential, and imbalanced user behavior data. Current techniques tend to miss out on the complicated activity patterns or even overfit, which is why a strong, scalable, and precise model of social media fraud detection is required. This study suggests a new deep learning architecture that entails a Temporal Convolutional Network (TCN) with Generative Adversarial Network (GAN)-based data augmentation to generate minority classes, and Autoencoder-based feature extraction to reduce dimensionality. The Seagull Optimization Algorithm (SOA), which is a metaheuristic algorithm, is used to optimize hyperparameters by balancing efficiency and speed of convergence in global search. The framework is tested on benchmark datasets (Cresci-2017 and TwiBot-22) and compared to the state-of-the-art models. It has been shown in experiments that the suggested TCN-GAN-SOA framework performs better, with ROC-AUC scores of 0.96 on Cresci-2017 and 0.95 on TwiBot-22, and a higher precision-recall value and better F1-scores. In addition, computational efficiency can be verified by the runtime analysis; case studies prove the framework’s strength when handling various situations of fraudulent behaviors. The given solution offers a scalable, reliable, and accurate methodology of detecting social media fraud based on the combination of sophisticated sequence modeling, realistic data augmentation, and hyperparameter optimization.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-025-24326-8