IMAX: Incremental maintenance of schema-based XML statistics

Maya Ramanath, Lingzhi Zhang, Juliana Freire, Jayant R. Haritsa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Current approaches for estimating the cardinality of XML queries are applicable to a static scenario wherein the underlying XML data does not change subsequent to the collection of statistics on the repository. However, in practice, many XML-based applications are dynamic and involve frequent updates to the data. In this paper, we investigate efficient strategies for incrementally maintaining statistical summaries as and when updates are applied to the data. Specifically, we propose algorithms that handle both the addition of new documents as well as random insertions in the existing document trees. We also show, through a detailed performance evaluation, that our incremental techniques are significantly faster than the naive recomputation approach; and that estimation accuracy can be maintained even with a fixed memory budget.

Original languageEnglish (US)
Title of host publicationProceedings - 21st International Conference on Data Engineering, ICDE 2005
Pages273-284
Number of pages12
DOIs
StatePublished - 2005
Event21st International Conference on Data Engineering, ICDE 2005 - Tokyo, Japan
Duration: Apr 5 2005Apr 8 2005

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627

Other

Other21st International Conference on Data Engineering, ICDE 2005
CountryJapan
CityTokyo
Period4/5/054/8/05

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Information Systems

Fingerprint Dive into the research topics of 'IMAX: Incremental maintenance of schema-based XML statistics'. Together they form a unique fingerprint.

  • Cite this

    Ramanath, M., Zhang, L., Freire, J., & Haritsa, J. R. (2005). IMAX: Incremental maintenance of schema-based XML statistics. In Proceedings - 21st International Conference on Data Engineering, ICDE 2005 (pp. 273-284). (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDE.2005.75