Query evaluation techniques for large databases

G Graefe - ACM Computing Surveys (CSUR), 1993 - dl.acm.org
Database management systems will continue to manage large data volumes. Thus, efficient
algorithms for accessing and manipulating large sets and sequences will be required to …

A survey of data partitioning and sampling methods to support big data analysis

MS Mahmud, JZ Huang, S Salloum… - Big Data Mining and …, 2020 - ieeexplore.ieee.org
Computer clusters with the shared-nothing architecture are the major computing platforms
for big data processing and analysis. In cluster computing, data partitioning and sampling …

Split learning for health: Distributed deep learning without sharing raw patient data

P Vepakomma, O Gupta, T Swedish… - arXiv preprint arXiv …, 2018 - arxiv.org
Can health entities collaboratively train deep learning models without sharing sensitive raw
data? This paper proposes several configurations of a distributed deep learning method …

Searchable symmetric encryption with forward search privacy

J Li, Y Huang, Y Wei, S Lv, Z Liu… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Searchable symmetric encryption (SSE) has been widely applied in the encrypted database
for queries in practice. Although SSE is powerful and feature-rich, it is always plagued by …

Distributed hierarchical gpu parameter server for massive scale deep learning ads systems

W Zhao, D Xie, R Jia, Y Qian, R Ding… - … of Machine Learning …, 2020 - proceedings.mlsys.org
Neural networks of ads systems usually take input from multiple resources, eg query-ad
relevance, ad features and user portraits. These inputs are encoded into one-hot or multi-hot …

[图书][B] Principles of distributed database systems

MT Özsu, P Valduriez - 1999 - Springer
The first edition of this book appeared in 1991 when the technology was new and there were
not too many products. In the Preface to the first edition, we had quoted Michael Stonebraker …

Netherite: Efficient execution of serverless workflows

S Burckhardt, B Chandramouli, C Gillum… - Proceedings of the …, 2022 - dl.acm.org
Serverless is a popular choice for cloud service architects because it can provide scalability
and load-based billing with minimal developer effort. Functions-as-a-service (FaaS) are …

[图书][B] In-memory data management: technology and applications

H Plattner, A Zeier - 2012 - books.google.com
In the last fifty years the world has been completely transformed through the use of IT. We
have now reached a new inflection point. This book presents, for the first time, how in …

Distributed database design methodologies

S Ceri, B Pernici, G Wiederhold - Proceedings of the IEEE, 1987 - ieeexplore.ieee.org
This paper surveys methodological approaches for distributed database design. The design
of distribution can be performed top-down or bottom-up; the first approach is typical of a …

Hyrise: a main memory hybrid storage engine

M Grund, J Krüger, H Plattner, A Zeier… - Proceedings of the …, 2010 - dl.acm.org
In this paper, we describe a main memory hybrid database system called HYRISE, which
automatically partitions tables into vertical partitions of varying widths depending on how the …