Automata for XML—a survey

T Schwentick - Journal of Computer and System Sciences, 2007 - Elsevier
Automata play an important role for the theoretical foundations of XML data management,
but also in tools for various XML processing tasks. This survey article aims to give an …

XPath leashed

M Benedikt, C Koch - ACM Computing Surveys (CSUR), 2009 - dl.acm.org
This survey gives an overview of formal results on the XML query language XPath. We
identify several important fragments of XPath, focusing on subsets of XPath 1.0. We then …

[图书][B] Graph structure and monadic second-order logic: a language-theoretic approach

B Courcelle, J Engelfriet - 2012 - books.google.com
The study of graph structure has advanced in recent years with great strides: finite graphs
can be described algebraically, enabling them to be constructed out of more basic elements …

Taxonomy of XML schema languages using formal language theory

M Murata, D Lee, M Mani, K Kawaguchi - ACM Transactions on Internet …, 2005 - dl.acm.org
On the basis of regular tree grammars, we present a formal framework for XML schema
languages. This framework helps to describe, compare, and implement such schema …

Finite state machines for strings over infinite alphabets

F Neven, T Schwentick, V Vianu - ACM Transactions on Computational …, 2004 - dl.acm.org
Motivated by formal models recently proposed in the context of XML, we study automata and
logics on strings over infinite alphabets. These are conservative extensions of classical …

[图书][B] Compiler design

R Wilhelm, D Maurer - 1995 - Springer
Compilers for high-level programming languages are large and complex software systems.
They have, however, several distinct properties by which they favorably differ from most …

Containment and equivalence for an XPath fragment

G Miklau, D Suciu - Proceedings of the twenty-first ACM SIGMOD …, 2002 - dl.acm.org
XPath is a simple language for navigating an XML document and selecting a set of element
nodes. XPath expressions are used to query XML data, describe key constraints, express …

Monadic datalog and the expressive power of languages for web information extraction

G Gottlob, C Koch - Journal of the ACM (JACM), 2004 - dl.acm.org
Research on information extraction from Web pages (wrapping) has seen much activity
recently (particularly systems implementations), but little work has been done on formally …

Document spanners: A formal approach to information extraction

R Fagin, B Kimelfeld, F Reiss… - Journal of the ACM (JACM …, 2015 - dl.acm.org
An intrinsic part of information extraction is the creation and manipulation of relations
extracted from text. In this article, we develop a foundational framework where the central …

DTD inference for views of XML data

Y Papakonstantinou, V Vianu - … of the nineteenth ACM SIGMOD-SIGACT …, 2000 - dl.acm.org
We study the inference of Data Type Definitions (DTDs) for views of XML data, using an
abstraction that focuses on document content structure. The views are defined by a query …