Capítulo de livro Revisado por pares

Information Retrieval from Distributed Semistructured Documents Using Metadata Interface

2006; Springer Science+Business Media; Linguagem: Inglês

10.1007/11730262_8

ISSN

1611-3349

Autores

Gui-Ja Choe, Young-Kwang Nam, Joseph A. Goguen, Guilian Wang,

Tópico(s)

Service-Oriented Architecture and Web Services

Resumo

We describe a method for retrieving information from distributed heterogeneous semistructured documents, and its implementation in the metadata interface DDXMI (Distributed Document XML Metadata Interface). The system generates local queries appropriate for local schemas from a user query over the global schema and shows the result of the generated queries. The three components are designed to generate the local queries: mappings between global schema and local schemas (extracted from local documents if not given), path substitution, and node identification for resolving the heterogeneity among nodes with the same label that often exist in semistructured data. The system uses Quilt as its XML query language. An experiment is reported over three local semistructured documents: ‘thesis’, ‘reports’, and ‘journal’ documents with ‘article’ global schema. The prototype was developed under Windows system with Java and JavaCC.

Referência(s)