Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks

A Fernández, S del Río, V López… - … : Data Mining and …, 2014 - Wiley Online Library
The term 'Big Data'has spread rapidly in the framework of Data Mining and Business
Intelligence. This new scenario can be defined by means of those problems that cannot be …

Reliability meets big data: opportunities and challenges

WQ Meeker, Y Hong - Quality engineering, 2014 - Taylor & Francis
Reliability field data such as that obtained from warranty claims and maintenance records
have been used traditionally for such purposes as generating predictions for warranty costs …

[图书][B] Interactive web-based data visualization with R, plotly, and shiny

C Sievert - 2020 - taylorfrancis.com
The richly illustrated Interactive Web-Based Data Visualization with R, plotly, and shiny
focuses on the process of programming interactive web graphics for multidimensional data …

Online updating of statistical inference in the big data setting

ED Schifano, J Wu, C Wang, J Yan, MH Chen - Technometrics, 2016 - Taylor & Francis
We present statistical methods for big data arising from online analytical processing, where
large amounts of data arrive in streams and require fast analysis without storage/access to …

Opening the black box: Strategies for increased user involvement in existing algorithm implementations

T Mühlbacher, H Piringer, S Gratzl… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
An increasing number of interactive visualization tools stress the integration with
computational software like MATLAB and R to access a variety of proven algorithms. In …

Random sample partition: a distributed data model for big data analysis

S Salloum, JZ Huang, Y He - IEEE Transactions on Industrial …, 2019 - ieeexplore.ieee.org
With the ever-increasing volume of data, alternative strategies are required to divide big data
into statistically consistent data blocks that can be used directly as representative samples of …

Big data dimension reduction using PCA

T Zhang, B Yang - 2016 IEEE international conference on smart …, 2016 - ieeexplore.ieee.org
Principal component analysis (PCA) is a powerfultool in dimensional reduction for highly
correlated data. ClassicalPCA approaches cannot be applied to big data because ofmemory …

Renewable estimation and incremental inference in generalized linear models with streaming data sets

L Luo, PXK Song - Journal of the Royal Statistical Society Series …, 2020 - academic.oup.com
The paper presents an incremental updating algorithm to analyse streaming data sets using
generalized linear models. The method proposed is formulated within a new framework of …

Does incident solar ultraviolet radiation lower blood pressure?

RB Weller, Y Wang, J He, FW Maddux… - Journal of the …, 2020 - Am Heart Assoc
Background Hypertension remains a leading global cause for premature death and disease.
Most treatment guidelines emphasize the importance of risk factors, but not all are known …

[HTML][HTML] Statistical methods and computing for big data

C Wang, MH Chen, E Schifano, J Wu… - Statistics and its …, 2016 - ncbi.nlm.nih.gov
Big data are data on a massive scale in terms of volume, intensity, and complexity that
exceed the capacity of standard analytic tools. They present opportunities as well as …