An Efficient Distributed Programming Model for Mining Useful Patterns in Big Datasets
Mining combined association rules with correlation and market basket analysis can discover customer's buying purchase rules along with frequently correlated, associated-correlated, and independent patterns synchronously which are extraordinarily useful for making everyday's business decisi...
Uloženo v:
| Vydáno v: | Technical review - IETE Ročník 30; číslo 1; s. 53 - 63 |
|---|---|
| Hlavní autoři: | , , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
New Delhi
Taylor & Francis
01.01.2013
Taylor & Francis Ltd |
| Témata: | |
| ISSN: | 0256-4602, 0974-5971 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Shrnutí: | Mining combined association rules with correlation and market basket analysis can discover customer's buying purchase rules along with frequently correlated, associated-correlated, and independent patterns synchronously which are extraordinarily useful for making everyday's business decisions. However, due to the main memory bottleneck in single computing system, existing approaches fail to handle big datasets. Moreover, most of them cannot overcome the screenings and overhead of null transactions; hence, performance degrades drastically. In this paper, considering these limitations, we propose a distributed programming model for mining business-oriented transactional datasets by using an improved MapReduce framework on Hadoop, which overcomes not only the single processor and main memory-based computing, but also highly scalable in terms of increasing database size. Experimental results show that the technique proposed and developed in this paper are feasible for mining big transactional datasets in terms of time and scalability. |
|---|---|
| Bibliografie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 content type line 14 |
| ISSN: | 0256-4602 0974-5971 |
| DOI: | 10.4103/0256-4602.107340 |