Automating the extraction of data from HTML tables with unknown structure

Data on the Web in HTML tables is mostly structured, but we usually do not know the structure in advance. Thus, we cannot directly query for data of interest. We propose a solution to this problem based on document-independent extraction ontologies. Our solution entails elements of table understandi...

Full description

Saved in:
Bibliographic Details
Published in:Data & knowledge engineering Vol. 54; no. 1; pp. 3 - 28
Main Authors: Embley, David W., Tao, Cui, Liddle, Stephen W.
Format: Journal Article
Language:English
Published: Elsevier B.V 01.07.2005
Subjects:
ISSN:0169-023X, 1872-6933
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first