Product collaborative filtering based recommendation systems for large-scale E-commerce

•E-commerce demands multi-choice products, challenging businesses.•Recommender systems reshape E-commerce with personalized experiences.•Scalability is a pressing issue for recommendation systems.•Parallel techniques tackle scalability challenges in E-commerce.•Apache Spark accelerates training time...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of information management data insights Jg. 5; H. 1; S. 100322
Hauptverfasser: Trinh, Trang, Nguyen, Van-Ho, Nguyen, Nghia, Nguyen, Duy-Nghia
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier Ltd 01.06.2025
Elsevier
Schlagworte:
ISSN:2667-0968, 2667-0968
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•E-commerce demands multi-choice products, challenging businesses.•Recommender systems reshape E-commerce with personalized experiences.•Scalability is a pressing issue for recommendation systems.•Parallel techniques tackle scalability challenges in E-commerce.•Apache Spark accelerates training time for large-scale E-commerce. The rapid growth in e-commerce and the increasing diversity of customer preferences necessitates the development of an effective recommender system for a business offering a wide range of products. This paper introduces a product-based collaborative filtering approach utilizing Apache Spark, a powerful parallel processing framework to address the scalability issues of recommender systems in the cloud computing environment. Using Spark's distributed computing ability, our model attains a surprising 7.6 times speedup on the training time compared to traditional single-machine methods while preserving accuracy with a Root Mean Square Error (RMSE) 0.9. These results demonstrate the effectiveness of parallel and distributed techniques in developing efficient and accurate recommender systems for large-scale e-commerce applications. Future work will focus on applying multi-model to enhance the accuracy of prediction and configuration to optimize the cost of cluster operations.
ISSN:2667-0968
2667-0968
DOI:10.1016/j.jjimei.2025.100322