An Efficient Distributed Programming Model for Mining Useful Patterns in Big Datasets

Mining combined association rules with correlation and market basket analysis can discover customer's buying purchase rules along with frequently correlated, associated-correlated, and independent patterns synchronously which are extraordinarily useful for making everyday's business decisi...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Technical review - IETE Ročník 30; číslo 1; s. 53 - 63
Hlavní autoři: Karim, Md. Rezaul, Ahmed, Chowdhury Farhan, Jeong, Byeong-Soo, Choi, Ho-Jin
Médium: Journal Article
Jazyk:angličtina
Vydáno: New Delhi Taylor & Francis 01.01.2013
Taylor & Francis Ltd
Témata:
ISSN:0256-4602, 0974-5971
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Mining combined association rules with correlation and market basket analysis can discover customer's buying purchase rules along with frequently correlated, associated-correlated, and independent patterns synchronously which are extraordinarily useful for making everyday's business decisions. However, due to the main memory bottleneck in single computing system, existing approaches fail to handle big datasets. Moreover, most of them cannot overcome the screenings and overhead of null transactions; hence, performance degrades drastically. In this paper, considering these limitations, we propose a distributed programming model for mining business-oriented transactional datasets by using an improved MapReduce framework on Hadoop, which overcomes not only the single processor and main memory-based computing, but also highly scalable in terms of increasing database size. Experimental results show that the technique proposed and developed in this paper are feasible for mining big transactional datasets in terms of time and scalability.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
content type line 14
ISSN:0256-4602
0974-5971
DOI:10.4103/0256-4602.107340