Data squashing as preprocessing in association rule mining

Data squashing is a well-known preprocessing method in Machine Learning that enables construction of smaller datasets from the original ones and provides approximately the same results of data analysis as the original. The paper proposes a new data squashing method for Association Rule Mining based...

Full description

Saved in:
Bibliographic Details
Published in:2022 IEEE Symposium Series on Computational Intelligence (SSCI) pp. 1720 - 1725
Main Authors: Fister, Iztok, Novak, Damijan, Verber, Domen
Format: Conference Proceeding
Language:English
Published: IEEE 04.12.2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Data squashing is a well-known preprocessing method in Machine Learning that enables construction of smaller datasets from the original ones and provides approximately the same results of data analysis as the original. The paper proposes a new data squashing method for Association Rule Mining based on the Cosine similarity and Euclidean distance similarity. The method was applied to three datasets from the UCI Machine Learning repository. The results showed that the proposed data squashing method is effective, scalable, and easy to use, and therefore represents a huge potential for use in practice.
DOI:10.1109/SSCI51031.2022.10022240