An efficient vertical-Apriori Mapreduce algorithm for frequent item-set mining

Algorithms such as OPUS and Apriori-based Mapreduce for enhancing the efficiency of mining frequent item-set for pattern recognition application from transactional dataset have been proposed in the literature. Most of these algorithms are, however, evaluated offline on relatively small data size. Wh...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA) s. 108 - 112
Hlavní autoři: Dawei Sun, Lee, Vincent Cs, Burstein, Frada, Haghighi, Pari Delir
Médium: Konferenční příspěvek
Jazyk:angličtina
Vydáno: IEEE 01.06.2015
Témata:
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:Algorithms such as OPUS and Apriori-based Mapreduce for enhancing the efficiency of mining frequent item-set for pattern recognition application from transactional dataset have been proposed in the literature. Most of these algorithms are, however, evaluated offline on relatively small data size. When confronting with larger data size, which is inevitable for todays organisation, most if not all algorithms performed not as efficient as required to meet the real time big data driven decision making needs. We therefore attempt to solve these efficiency problems by proposing a VAMR (Vertical-Apriori Map-reduce) algorithm. VAMR is based on data attribute identifier which is exploited as capability metric for mining frequency item-set from large dataset in a single node (for example in a single site enterprise) that has no distributed and parallel computing system environment. Our evaluations using synthetic datasets and data from public repository suggest that VAMR algorithm can offer superior efficiency in mining frequent item-sets from large transaction dataset.
DOI:10.1109/ICIEA.2015.7334093