A Genetic Programming Approach to Record Deduplication

Several systems that rely on consistent data to offer high-quality services, such as digital libraries and e-commerce brokers, may be affected by the existence of duplicates, quasi replicas, or near-duplicate entries in their repositories. Because of that, there have been significant investments fro...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on knowledge and data engineering Vol. 24; no. 3; pp. 399 - 412
Main Authors: de Carvalho, M. G., Laender, A. H. F., Goncalves, M. A., da Silva, A. S.
Format: Journal Article
Language:English
Published: IEEE 01.03.2012
Subjects:
ISSN:1041-4347
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first