Big data and big data analytics

Y Shi, Y Shi - Advances in Big Data Analytics: Theory, Algorithms …, 2022 - Springer
Big data now is a common term. However, the evolution of big data comes from twofold. The
creation of the computer in the 1940s gradually provides tools for human beings to collect …

[HTML][HTML] Exploring big data analysis: fundamental scientific problems

Z Xu, Y Shi - Annals of Data Science, 2015 - Springer
Abstract Although Big Data has been one of most popular topics since last several years,
how to effectively conduct Big Data analysis is a big challenge for every field. This paper …

SPARSE k-MEANS WITH ℓ/ℓ0 PENALTY FOR HIGH-DIMENSIONAL DATA CLUSTERING

X Chang, Y Wang, R Li, Z Xu - Statistica Sinica, 2018 - JSTOR
One of the existing sparse clustering approaches, ℓ1-k-means, maximizes the weighted
between-cluster sum of squares subject to the ℓ1 penalty. In this paper, we propose a sparse …

Out-of-sample error estimation: the blessing of high dimensionality

L Oneto, A Ghio, S Ridella, JLR Ortiz… - … Conference on Data …, 2014 - ieeexplore.ieee.org
Dealing with high dimensionality when learning from data is a tough task since, for example,
similarity and correlation in data cannot be properly captured by the conventional notions of …

Community detection for clustered attributed graphs via a variational EM algorithm

X Cao, X Chang, Z Xu - … of the 2014 International Conference on Big …, 2014 - dl.acm.org
Community detection for attributed graphs, also called attributed graph clustering, is a new
challenging issue in data mining due to the increasing emergence of different kinds of real …

[PDF][PDF] Statistica Sinica Preprint No: SS-2015-0261R3

HDD Clustering, X Chang, Y Wang, Z Xu - stat.sinica.edu.tw
One of the existing sparse clustering approaches, i1-k-means, maximizes the weighted
between-cluster sum of squares subject to the i1 penalty. In this paper, we propose a new …