The big data system, components, tools, and technologies: a survey

TR Rao, P Mitra, R Bhatt, A Goswami - Knowledge and Information …, 2019 - Springer
The traditional databases are not capable of handling unstructured data and high volumes
of real-time datasets. Diverse datasets are unstructured lead to big data, and it is laborious …

TiDB: a Raft-based HTAP database

D Huang, Q Liu, Q Cui, Z Fang, X Ma, F Xu… - Proceedings of the …, 2020 - dl.acm.org
Hybrid Transactional and Analytical Processing (HTAP) databases require processing
transactional and analytical queries in isolation to remove the interference between them. To …

A big data system supporting bosch braga industry 4.0 strategy

MY Santos, JO e Sá, C Andrade, FV Lima… - International Journal of …, 2017 - Elsevier
People, devices, infrastructures and sensors can constantly communicate exchanging data
and generating new data that trace many of these exchanges. This leads to vast volumes of …

Data blocks: Hybrid OLTP and OLAP on compressed storage using both vectorization and compilation

H Lang, T Mühlbauer, F Funke, PA Boncz… - Proceedings of the …, 2016 - dl.acm.org
This work aims at reducing the main-memory footprint in high performance hybrid OLTP &
OLAP databases, while retaining high query performance and transactional throughput. For …

Constant overhead quantum fault tolerance with quantum expander codes

O Fawzi, A Grospellier, A Leverrier - Communications of the ACM, 2020 - dl.acm.org
The threshold theorem is a seminal result in the field of quantum computing asserting that
arbitrarily long quantum computations can be performed on a faulty quantum computer …

A deep dive into common open formats for analytical dbmss

C Liu, A Pavlenko, M Interlandi, B Haynes - Proceedings of the VLDB …, 2023 - dl.acm.org
This paper evaluates the suitability of Apache Arrow, Parquet, and ORC as formats for
subsumption in an analytical DBMS. We systematically identify and explore the high-level …

Big data processing tools: An experimental performance evaluation

M Rodrigues, MY Santos… - … Reviews: Data Mining …, 2019 - Wiley Online Library
Big Data is currently a hot topic of research and development across several business areas
mainly due to recent innovations in information and communication technologies. One of the …

An empirical evaluation of columnar storage formats

X Zeng, Y Hui, J Shen, A Pavlo, W McKinney… - arXiv preprint arXiv …, 2023 - arxiv.org
Columnar storage is a core component of a modern data analytics system. Although many
database management systems (DBMSs) have proprietary storage formats, most provide …

High-speed query processing over high-speed networks

W Rödiger, T Mühlbauer, A Kemper… - arXiv preprint arXiv …, 2015 - arxiv.org
Modern database clusters entail two levels of networks: connecting CPUs and NUMA
regions inside a single server in the small and multiple servers in the large. The huge …

Translation of relational and non-relational databases into RDF with xR2RML

F Michel, L Djimenou, CF Zucker… - … Confenrence on Web …, 2015 - hal.science
With the growing amount of data being continuously produced, it is crucial to come up with
solutions to expose data from ever more heterogeneous databases (eg NoSQL systems) as …