A Scalable Similarity Join Algorithm Based on MapReduce and LSH

Similarity joins are recognized to be among the most useful data processing and analysis operations. A similarity join is used to retrieve all data pairs whose distances are smaller than a predefined threshold λ . In this paper, we introduce the MRS-join algorithm to perform similarity joins on larg...

Full description

Saved in:
Bibliographic Details
Published in:International journal of parallel programming Vol. 50; no. 3-4; pp. 360 - 380
Main Authors: Rivault, Sébastien, Bamha, Mostafa, Limet, Sébastien, Robert, Sophie
Format: Journal Article
Language:English
Published: New York Springer US 01.08.2022
Springer Nature B.V
Springer Verlag
Subjects:
ISSN:0885-7458, 1573-7640
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first