xCrawl: a high-recall crawling method for Web mining

Web mining systems exploit the redundancy of data published on the Web to automatically extract information from existing Web documents. The first step in the Information Extraction process is thus to locate as many Web pages as possible that contain relevant information within a limited period of t...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Knowledge and information systems Jg. 25; H. 2; S. 303 - 326
Hauptverfasser:	Shchekotykhin, Kostyantyn, Jannach, Dietmar, Friedrich, Gerhard
Format:	Journal Article
Sprache:	Englisch
Veröffentlicht:	London Springer-Verlag 01.11.2010 Springer Springer Nature B.V
Schlagworte:	Algorithms Analysis Applied sciences Automation Computer Science Computer science; control theory; systems Computer systems and distributed systems. User interface Data mining Data Mining and Knowledge Discovery Data processing. List processing. Character string processing Database Management Descriptions Digital cameras Exact sciences and technology Extraction Hierarchies Information retrieval Information Storage and Retrieval Information systems Information Systems and Communication Service Information Systems Applications (incl.Internet) Information systems. Data bases IT in Business Memory organisation. Data processing Mining Recall Regular Paper Search engines Searches Software Studies URLs Websites Information retrieval Web crawling Information extraction Web mining Data analysis Extraction process Redundancy Electronic document Data mining Information browsing World wide web Automatic generation Internet Web site
ISSN:	0219-1377, 0219-3116
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!