Data-Driven Insights into Social Media Behavior Using Predictive Modeling

This study proposes a statistical machine learning approach to predict social media usage across various demographic categories in India. The dataset comprises twenty-six features, including demographic attributes (age, gender, education, location), social media engagement metrics (number of followe...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Procedia computer science Ročník 252; s. 480 - 489
Hlavní autoři: Selvakumar, V., Reddy, Nadipi Keerthana, Tulasi, R. Sree Vardhini, Kumar, Kunchala Rohit
Médium: Journal Article
Jazyk:angličtina
Vydáno: Elsevier B.V 2025
Témata:
ISSN:1877-0509, 1877-0509
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:This study proposes a statistical machine learning approach to predict social media usage across various demographic categories in India. The dataset comprises twenty-six features, including demographic attributes (age, gender, education, location), social media engagement metrics (number of followers, posts, time spent on platforms), and device-related information. It reflects real-world social media behavior on platforms such as WhatsApp, Facebook, and Instagram, capturing distinct patterns of weekday and weekend usage. Key variables such as time spent on each platform, the number of Instagram posts and followers, and overall social media usage were analyzed in detail. It is identified that significant predictors of user status categories through feature engineering, including Sabbatical, Self-Employed, Student, and Working Professional. Multiple regression models—Linear Regression, K-Nearest Neighbors, Decision Tree Regression, Random Forest Regression, Gradient Boosting, Naïve Bayes, and Support Vector Regression—were employed to assess their performance in predicting user status. Comparative analysis revealed that the Gradient Boosting algorithm outperformed other models with the highest accuracy. The machine learning workflow encompassed data pre-processing, feature engineering, model training, and evaluation, all implemented using Python. This study significantly advances the field by elucidating the key predictors of social media engagement and providing a thorough evaluation of the importance of features alongside a comparative analysis of predictive models.
ISSN:1877-0509
1877-0509
DOI:10.1016/j.procs.2025.01.007