Gespeichert in:
Bibliographische Detailangaben
Titel: [Untitled]
Weitere Verfasser: The Pennsylvania State University CiteSeerX Archives
Quelle: http://www.vldb.org/conf/1995/P263.PDF.
Bestand: CiteSeerX
Schlagwörter: file classification
Beschreibung: Semi-structured documents (e.g. journal art,i-cles, electronic mail, television programs, mail order catalogs,.) a.re often not explicitly typed; the only available t,ype information is the implicit structure. An explicit t,ype, however, is needed in order to a.pply object-oriented technology, like type-specific meth-ods. In this paper, we present a.n experimental vec-tor space cla.ssifier for determining the type of semi-structured documents. Our goal was to design a. high-performa.nce classifier in t,erms of accuracy (recall and precision), speed, and extensibility.
Publikationsart: text
Sprache: English
Relation: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.97.6471; http://www.vldb.org/conf/1995/P263.PDF
Verfügbarkeit: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.97.6471
http://www.vldb.org/conf/1995/P263.PDF
Rights: Metadata may be used without restrictions as long as the oai identifier remains attached to it.
Dokumentencode: edsbas.BC36F7CA
Datenbank: BASE
Beschreibung
Abstract:Semi-structured documents (e.g. journal art,i-cles, electronic mail, television programs, mail order catalogs,.) a.re often not explicitly typed; the only available t,ype information is the implicit structure. An explicit t,ype, however, is needed in order to a.pply object-oriented technology, like type-specific meth-ods. In this paper, we present a.n experimental vec-tor space cla.ssifier for determining the type of semi-structured documents. Our goal was to design a. high-performa.nce classifier in t,erms of accuracy (recall and precision), speed, and extensibility.