The history of histograms (abridged)

Y Ioannidis - Proceedings 2003 VLDB Conference, 2003 - Elsevier
Publisher Summary The history of histograms is long and rich, full of detailed information in
every step. It includes the course of histograms in different scientific fields, the successes …

Learning deterministic regular expressions for the inference of schemas from XML data

GJ Bex, W Gelade, F Neven… - ACM Transactions on the …, 2010 - dl.acm.org
Inferring an appropriate DTD or XML Schema Definition (XSD) for a given collection of XML
documents essentially reduces to learning deterministic regular expressions from sets of …

[PDF][PDF] Inferring XML schema definitions from XML data

GJ Bex, F Neven, S Vansummeren - … conference on Very large data bases, 2007 - Citeseer
Although the presence of a schema enables many optimizations for operations on XML
documents, recent studies have shown that many XML documents in practice either do not …

Dynamic XML documents with distribution and replication

S Abiteboul, A Bonifati, G Cobena… - Proceedings of the …, 2003 - dl.acm.org
The advent of XML as a universal exchange format, and of Web services as a basis for
distributed computing, has fostered the apparition of a new class of documents: dynamic …

Approximate XML query answers

N Polyzotis, M Garofalakis, Y Ioannidis - Proceedings of the 2004 ACM …, 2004 - dl.acm.org
The rapid adoption of XML as the standard for data representation and exchange
foreshadows a massive increase in the amounts of XML data collected, maintained, and …

MARS: A system for publishing XML from mixed and redundant storage

A Deutsch, V Tannen - Proceedings 2003 VLDB Conference, 2003 - Elsevier
Publisher Summary This chapter presents a system called mixed and redundant storage
(MARS) for publishing XML data from mixed proprietary storage, while supporting …

Structural XML query processing

R Bača, M Krátký, I Holubová, M Nečaský… - ACM Computing …, 2017 - dl.acm.org
Since the boom in new proposals on techniques for efficient querying of XML data is now
over and the research world has shifted its attention toward new types of data formats, we …

[PDF][PDF] Statistical learning techniques for costing XML queries

N Zhang, PJ Haas, V Josifovski, GM Lohman… - Proceedings of the 31st …, 2005 - vldb.org
Developing cost models for query optimization is significantly harder for XML queries than
for traditional relational queries. The reason is that XML query operators are much more …

Structure and value synopses for XML data graphs

N Polyzotis, M Garofalakis - VLDB'02: Proceedings of the 28th International …, 2002 - Elsevier
Publisher Summary This chapter proposes a novel XSKETCH graph synopsis model for
eXtensible Markup Language (XML) data graphs with raw data values. All existing …

[PDF][PDF] Crossing the Structure Chasm.

AY Halevy, O Etzioni, AH Doan, ZG Ives… - …, 2003 - projectsweb.cs.washington.edu
Online information comes in two flavors: unstructured corpora of text on the one hand, and
structured data managed by databases and knowledge bases on the other. These two …