A flexible infrastructure for gathering XML statistics and estimating query cardinality

Juliana Freire, Maya Ramanath, Lingzhi Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The XML statistics collection process and result estimation of user cardinalities queries was discussed. The estimated cardinalities which included query optimizations and cost-based storage design were needed in variety of tasks and were used to give users a early feedback about the expected outcome of their queries. The result estimator, StatiX, uses specialized data structures and estimation algorithms. It uses histograms to capture uniformly the structural and value skew presented in documents, and also leverages schema information to produce high-quality and concise statistical summaries.

Original languageEnglish (US)
Title of host publicationProceedings - 20th International Conference on Data Engineering - ICDE 2004
Pages857
Number of pages1
Volume20
DOIs
StatePublished - 2004
EventProceedings - 20th International Conference on Data Engineering - ICDE 2004 - Boston, MA., United States
Duration: Mar 30 2004Apr 2 2004

Other

OtherProceedings - 20th International Conference on Data Engineering - ICDE 2004
Country/TerritoryUnited States
CityBoston, MA.
Period3/30/044/2/04

ASJC Scopus subject areas

  • Software
  • General Engineering
  • Engineering (miscellaneous)

Fingerprint

Dive into the research topics of 'A flexible infrastructure for gathering XML statistics and estimating query cardinality'. Together they form a unique fingerprint.

Cite this