A Genetic Programming Approach to Record Deduplication
Several systems that rely on consistent data to offer high-quality services, such as digital libraries and e-commerce brokers, may be affected by the existence of duplicates, quasi replicas, or near-duplicate entries in their repositories. Because of that, there have been significant investments fro...
Saved in:
| Published in: | IEEE transactions on knowledge and data engineering Vol. 24; no. 3; pp. 399 - 412 |
|---|---|
| Main Authors: | , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
IEEE
01.03.2012
|
| Subjects: | |
| ISSN: | 1041-4347 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!