[图书][B] Probabilistic databases

D Suciu, D Olteanu, C Ré, C Koch - 2022 - books.google.com
Probabilistic databases are databases where the value of some attributes or the presence of
some records are uncertain and known only with some probability. Applications in many …

The MADlib analytics library or MAD skills, the SQL

J Hellerstein, C Ré, F Schoppmann, DZ Wang… - arXiv preprint arXiv …, 2012 - arxiv.org
MADlib is a free, open source library of in-database analytic methods. It provides an
evolving suite of SQL-based algorithms for machine learning, data mining and statistics that …

Towards a unified architecture for in-RDBMS analytics

X Feng, A Kumar, B Recht, C Ré - Proceedings of the 2012 ACM …, 2012 - dl.acm.org
The increasing use of statistical data analysis in enterprise applications has created an arms
race among database vendors to offer ever more sophisticated in-database analytics. One …

Efficient query answering in probabilistic RDF graphs

X Lian, L Chen - Proceedings of the 2011 ACM SIGMOD International …, 2011 - dl.acm.org
In this paper, we tackle the problem of efficiently answering queries on probabilistic RDF
data graphs. Specifically, we model RDF data by probabilistic graphs, and an RDF query is …

In-RDBMS hardware acceleration of advanced analytics

D Mahajan, JK Kim, J Sacks, A Ardalan… - arXiv preprint arXiv …, 2018 - arxiv.org
The data revolution is fueled by advances in machine learning, databases, and hardware
design. Programmable accelerators are making their way into each of these areas …

Hybrid in-database inference for declarative information extraction

DZ Wang, MJ Franklin, M Garofalakis… - Proceedings of the …, 2011 - dl.acm.org
In the database community, work on information extraction (IE) has centered on two themes:
how to effectively manage IE tasks, and how to manage the uncertainties that arise in the IE …

Efficient and effective similarity search over probabilistic data based on earth mover's distance

J Xu, Z Zhang, AKH Tung, G Yu - The VLDB Journal, 2012 - Springer
Advances in geographical tracking, multimedia processing, information extraction, and
sensor networks have created a deluge of probabilistic data. While similarity search is an …

[PDF][PDF] Automatic knowledge base construction using probabilistic extraction, deductive reasoning, and human feedback

DZ Wang, Y Chen, S Goldberg… - Proceedings of the Joint …, 2012 - aclanthology.org
We envision an automatic knowledge base construction system consisting of three
interrelated components. MADDEN is a knowledge extraction system applying statistical text …

Asynchronous complex analytics in a distributed dataflow architecture

JE Gonzalez, P Bailis, MI Jordan, MJ Franklin… - arXiv preprint arXiv …, 2015 - arxiv.org
Scalable distributed dataflow systems have recently experienced widespread adoption, with
commodity dataflow engines such as Hadoop and Spark, and even commodity SQL engines …

Optimizing statistical information extraction programs over evolving text

F Chen, X Feng, C Ré, M Wang - 2012 IEEE 28th International …, 2012 - ieeexplore.ieee.org
Statistical information extraction (IE) programs are increasingly used to build real-world IE
systems such as Alibaba, CiteSeer, Kylin, and YAGO. Current statistical IE approaches …