An automatic load/extract scheme for XML documents through object-relational repositories

Extensible markup language (XML), a simplified version of standard generalized markup language (SGML), is designed to enable electronic text interchange in the Internet. XML documents have a rigorously described structure that may be analyzed by computers and easily understood by humans. Most curren...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of systems and software Jg. 64; H. 3; S. 207 - 218
Hauptverfasser: Tseng, Frank S.C., Hwung, Wen-Jong
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier Inc 15.12.2002
Schlagworte:
ISSN:0164-1212, 1873-1228
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Extensible markup language (XML), a simplified version of standard generalized markup language (SGML), is designed to enable electronic text interchange in the Internet. XML documents have a rigorously described structure that may be analyzed by computers and easily understood by humans. Most current approaches store XML documents in file systems or in relational database systems. However, the nature and the design of file system or relational database schema may cause limitations on fitting with XML document structure. In this paper, we present an automatic load/extract scheme to store and retrieve XML documents through object-relational databases. We propose an architecture, called XML meta-generator (XMG), which, after reading a specific document type definition (DTD), automatically generates the corresponding object-relational database schema (OR-Schema), a DI-Decomposer and a DI-Reconstructor, which are explained as follows: 1. OR-Schema––an object-relational database schema in UniSQL/X format for a specific DTD. 2. DI-Decomposer––a module decomposes XML document instances (DIs) according to the specific DTD format and stores the elements into the corresponding object-relational database. 3. DI-Reconstructor––a module retrieves elements from the object-relational database and reconstructs it to recover the original DI. These modules make XML documents be automatically decomposed into and reconstructed from object-relational databases in a seamless manner. Moreover, documents stored in the object-relational databases can be managed and inquired more easily than it could be in file systems or relational databases. Useful applications on various documents can also be easily built on top of the target database, such as digital libraries, data warehouses, and data or text mining systems.
ISSN:0164-1212
1873-1228
DOI:10.1016/S0164-1212(02)00044-4