A parallel random forest algorithm for big data in a spark cloud computing environment

J Chen, K Li, Z Tang, K Bilal, S Yu… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
With the emergence of the big data age, the issue of how to obtain valuable knowledge from
a dataset efficiently and accurately has attracted increasingly attention from both academia …

Analytics, challenges and applications in big data environment: a survey

MR Bendre, VR Thool - Journal of Management Analytics, 2016 - Taylor & Francis
Big data refer to the massive amounts and varieties of information in the structured and
unstructured form, generated by social networking sites, biomedical equipment, financial …

A comparative study of cluster-based Big Data Cube implementations

AFM Caetano, CM Hirata, RR Silva - Future Generation Computer Systems, 2022 - Elsevier
Abstract Research on Data Cubes scalability is extensive, yet sparse. Scalable design
patterns for Data Cube implementations are a trend as the technology shifts from centralized …

Data aggregation processes: a survey, a taxonomy, and design guidelines

S Cai, B Gallina, D Nyström, C Seceleanu - Computing, 2019 - Springer
Data aggregation processes are essential constituents for data management in modern
computer systems, such as decision support systems and Internet of Things systems, many …

Analytic queries over geospatial time-series data using distributed hash tables

M Malensek, S Pallickara… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
As remote sensing equipment and networked observational devices continue to proliferate,
their corresponding data volumes have surpassed the storage and processing capabilities …

An effective hot topic detection method for microblog on spark

W Ai, K Li, K Li - Applied Soft Computing, 2018 - Elsevier
With the emergence of the big data age, methods for quickly and accurately obtaining
valuable hot topics from the vast amount of digitized textual material have attracted much …

Fast, ad hoc query evaluations over multidimensional geospatial datasets

M Malensek, S Pallickara… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Networked observational devices and remote sensing equipment continue to proliferate and
contribute to the accumulation of extreme-scale datasets. Both the rate and resolution of the …

Hybrid parallel linguistic fuzzy rules with canopy mapreduce for big data classification in cloud

V Vennila, AR Kannan - International Journal of Fuzzy Systems, 2019 - Springer
With the increasing availability of large amount of information and the benefits related to
data processing, big data have gained large significance in recent years. With scalable …

Temporal big data analytics: New frontiers for big data analytics research (panel description)

A Cuzzocrea - 28th International Symposium on Temporal …, 2021 - drops.dagstuhl.de
Big data analytics is an emerging research area with many sophisticated contributions in the
actual literature. Big data analytics aims at discovering actionable knowledge from large …

Overlay indexes: Efficiently supporting aggregate range queries and authenticated data structures in off-the-shelf databases

D Pennino, M Pizzonia, A Papi - IEEE Access, 2019 - ieeexplore.ieee.org
Commercial off-the-shelf DataBase Management Systems (DBMSes) are highly optimized to
process a wide range of queries by means of carefully designed indexing and query …