PlinyCompute: A platform for high-performance, distributed, data-intensive tool development- 学术资源搜索

PlinyCompute: A platform for high-performance, distributed, data-intensive tool development

J Zou, RM Barnett, T Lorido-Botran, S Luo… - Proceedings of the …, 2018 - dl.acm.org

J Zou, RM Barnett, T Lorido-Botran, S Luo, C Monroy, S Sikdar, K Teymourian, B Yuan…

Proceedings of the 2018 International Conference on Management of Data, 2018•dl.acm.org

This paper describes PlinyCompute, a system for development of high-performance, data-intensive, distributed computing tools and libraries. \emphIn the large, PlinyCompute presents the programmer with a very high-level, declarative interface, relying on automatic, relational-database style optimization to figure out how to stage distributed computations. However, in the small, PlinyCompute presents the capable systems programmer with a persistent object data model and API (the "PC object model'') and associated memory management system that has been designed from the ground-up for high performance, distributed, data-intensive computing. This contrasts with most other Big Data systems, which are constructed on top of the Java Virtual Machine (JVM), and hence must at least partially cede performance-critical concerns such as memory management (including layout and de/allocation) and virtual method/function dispatch to the JVM. This hybrid approach---declarative in the large, trusting the programmer's ability to utilize PC object model efficiently in the small---results in a system that is ideal for the development of reusable, data-intensive tools and libraries.

ACM Digital Library

展开收起

被引用次数：25 相关文章所有 8 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果