Subsampling and jackknifing: a practically convenient solution for large data analysis with...

J Yu, M Ai, Z Ye - Statistical Papers, 2024 - Springer

Subsampling focuses on selecting a subsample that can efficiently sketch the information of
the original data in terms of statistical inference. It provides a powerful tool in big data …

被引用次数：32 相关文章所有 4 个版本

[PDF] tandfonline.com

A selective review on statistical methods for massive data computation: distributed computing, subsampling, and minibatch techniques

X Li, Y Gao, H Chang, D Huang, Y Ma… - Statistical Theory and …, 2024 - Taylor & Francis

This paper presents a selective review of statistical computation methods for massive data
analysis. A huge amount of statistical methods for massive data computation have been …

被引用次数：3 相关文章所有 5 个版本

Supervised Stratified Subsampling for Predictive Analytics

MC Chang - Journal of Computational and Graphical Statistics, 2024 - Taylor & Francis

Predictive analytics involves the use of statistical models to make predictions; however, the
power of these techniques is hindered by ever-increasing quantities of data. The richness …

[PDF] arxiv.org

CluBear: a subsampling package for interactive statistical analysis with massive data on a single machine

K Xu, Y Zhu, Y Liu, H Wang - Communications in Statistics …, 2024 - Taylor & Francis

This article introduces CluBear, a Python-based open-source package for interactive
massive data analysis. The key feature of CluBear is that it enables users to conduct …

On the asymptotic properties of a bagging estimator with a massive dataset

Y Gao, R Zhang, H Wang - Stat, 2022 - Wiley Online Library

Bagging is a useful method for large‐scale statistical analysis, especially when the
computing resources are very limited. We study here the asymptotic properties of bagging …

被引用次数：1 相关文章所有 6 个版本

[PDF] arxiv.org

[PDF] uce.edu.ec

[PDF][PDF] Tendencia corrosiva por CO2 del gas natural basada en su composición mediante Redes Neuronales Artificiales CO2 corrosion trend of natural gas based on …

TD Marín-Velásquez - Investigación y Desarrollo - revistadigital.uce.edu.ec

Para el desarrollo de la investigación se obtuvo una muestra de 46 cromatografías de gas
natural de bases de datos de Petróleos de Venezuela (PDVSA) de la región oriental de …