Artigo Revisado por pares

Query generation for retrieving data from distributed semistructured documents using a metadata interface

2008; Elsevier BV; Volume: 35; Issue: 4 Linguagem: Inglês

10.1016/j.cl.2008.09.002

ISSN

1873-6866

Autores

Gui-Ja Choe, Young-Kwang Nam, Joseph A. Goguen, Guilian Wang,

Tópico(s)

Web Data Mining and Analysis

Resumo

We describe a method for generating queries for retrieving data from distributed heterogeneous semistructured documents, and its implementation in the metadata interface DDXMI (distributed document XML metadata interchange). The proposed system generates local queries appropriate to local schemas from a user query over the global schema. The system constructs mappings between global schema and local schemas (extracted from local documents if not given), path substitution, and node identification for resolving the heterogeneity among nodes with the same label that often exist in semistructured data. The system uses Quilt as its XML query language. An experiment is reported over three local semistructured documents: 'thesis', 'reports', and 'journal' documents with 'article' global schema. The prototype was developed under Windows system with Java and JavaCC.

Referência(s)