PlinyCompute: A platform for high-performance, distributed, data-intensive tool development

J Zou, RM Barnett, T Lorido-Botran, S Luo… - Proceedings of the …, 2018 - dl.acm.org
Proceedings of the 2018 International Conference on Management of Data, 2018dl.acm.org
This paper describes PlinyCompute, a system for development of high-performance, data-
intensive, distributed computing tools and libraries.\emphIn the large, PlinyCompute
presents the programmer with a very high-level, declarative interface, relying on automatic,
relational-database style optimization to figure out how to stage distributed computations.
However, in the small, PlinyCompute presents the capable systems programmer with a
persistent object data model and API (the" PC object model'') and associated memory …
This paper describes PlinyCompute, a system for development of high-performance, data-intensive, distributed computing tools and libraries. \emphIn the large, PlinyCompute presents the programmer with a very high-level, declarative interface, relying on automatic, relational-database style optimization to figure out how to stage distributed computations. However, in the small, PlinyCompute presents the capable systems programmer with a persistent object data model and API (the "PC object model'') and associated memory management system that has been designed from the ground-up for high performance, distributed, data-intensive computing. This contrasts with most other Big Data systems, which are constructed on top of the Java Virtual Machine (JVM), and hence must at least partially cede performance-critical concerns such as memory management (including layout and de/allocation) and virtual method/function dispatch to the JVM. This hybrid approach---declarative in the large, trusting the programmer's ability to utilize PC object model efficiently in the small---results in a system that is ideal for the development of reusable, data-intensive tools and libraries.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果