Abstract
We propose a framework for integrating data from multiple relational sources into an XML document that both conforms to a given DTD and satisfies predefined XML constraints. The framework is based on a specification language, AIG, that extends a DTD by (1) associating element types with semantic attributes (inherited and synthesized, inspired by the corresponding notions from Attribute Grammars), (2) computing these attributes via parameterized SQL queries over multiple data sources, and (3) incorporating XML keys and inclusion constraints. The novelty of AIG consists in semantic attributes and their dependency relations for controlling context-dependent, DTD-directed construction of XML documents, as well as for checking XML constraints in parallel with document-generation. We also present cost-based optimization techniques for efficiently evaluating AIGs, including algorithms for merging queries and for scheduling queries on multiple data sources. This provides a new grammar-based approach for data integration under both syntactic and semantic constraints.
Original language | English (US) |
---|---|
Title of host publication | Proceedings of the ACM SIGMOD International Conference on Management of Data |
Editors | A.Y. Halevy, Z.G. Ives, A.H. Doan |
Pages | 277-288 |
Number of pages | 12 |
State | Published - 2003 |
Event | 2003 ACM SIGMOD International Conference on Management of Data - San Diego, CA, United States Duration: Jun 9 2003 → Jun 12 2003 |
Other
Other | 2003 ACM SIGMOD International Conference on Management of Data |
---|---|
Country/Territory | United States |
City | San Diego, CA |
Period | 6/9/03 → 6/12/03 |
ASJC Scopus subject areas
- General Computer Science