For Full-Text PDF, please login, if you are a member of IEICE,|
or go to Pay Per View on menu list, if you are a nonmember of IEICE.
Strategy for XML Integration Using Similarity in Structure and Content
Youn Hee KIM Byung Gon KIM Jaeho LEE Hae Chull LIM
IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2004/06/01
Print ISSN: 0916-8508
Type of Manuscript: Special Section PAPER (Special Section on Papers Selected from 2003 International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC 2003))
XML, integration, similarity, storage model,
Full Text: PDF>>
Most of the existing studies on storing and searching XML documents effectively manipulate each XML document independently. Therefore, techniques for storing XML documents together that have similar meaning or structure are required for efficiency. Also, as a unified access method for various XML storage systems that have different storage forms, studies to integrate the DTD or XML schema of each storage system into one are required, because many XML documents do not have a particular DTD or XML schema, or XML documents can be written in various ways. Therefore, studies on the integration techniques for XML instances are needed. The XML integration technique can be used effectively in the case of constructing a data warehouse for heterogeneous XML storage systems. The proposed integration techniques remove the space duplicated for the same elements in XML documents. The proposed techniques significantly reduce the search time for general queries on the XML documents because it stores the related parts in XML documents close.