Implementation of Indexing Techniques to Prevent Data Leakage and Duplication in Internet

Research in this area aims to create a new and efficient method for detecting near-duplicates in online content. Web pages that a search engine has scoured are first parsed to remove HTML elements and java scripts. After this phase, remove common keywords or stop words from the crawled pages. The af...

Full description

Saved in:
Bibliographic Details
Published in:2022 International Conference on Advances in Computing, Communication and Applied Informatics (ACCAI) pp. 1 - 9
Main Authors: Nalini, M. K., K, Dhinakaran, D, Elantamilan, Gnanavel, R., Vinod, D.
Format: Conference Proceeding
Language:English
Published: IEEE 28.01.2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first