Selective inference for k-means clustering

YT Chen, DM Witten - Journal of Machine Learning Research, 2023 - jmlr.org
We consider the problem of testing for a difference in means between clusters of
observations identified via k-means clustering. In this setting, classical hypothesis tests lead …

A doubly enhanced em algorithm for model-based tensor clustering

Q Mai, X Zhang, Y Pan, K Deng - Journal of the American Statistical …, 2022 - Taylor & Francis
Modern scientific studies often collect datasets in the form of tensors. These datasets call for
innovative statistical analysis methods. In particular, there is a pressing need for tensor …

Trajectories of symptom severity in children with autism: variability and turning points through the transition to school

S Georgiades, PA Tait, PD McNicholas, E Duku… - Journal of Autism and …, 2022 - Springer
This study examined the trajectories of autistic symptom severity in an inception cohort of
187 children with ASD assessed across four time points from diagnosis to age 10. Trajectory …

On parsimonious models for modeling matrix data

S Sarkar, X Zhu, V Melnykov, S Ingrassia - Computational Statistics & Data …, 2020 - Elsevier
Finite mixture modeling is a popular technique for capturing heterogeneity in data. Although
the vast majority of the theory developed in this area up to date deals with vector-valued …

Improved initialization of the em algorithm for mixture model parameter estimation

B Panić, J Klemenc, M Nagode - Mathematics, 2020 - mdpi.com
A commonly used tool for estimating the parameters of a mixture model is the Expectation–
Maximization (EM) algorithm, which is an iterative procedure that can serve as a maximum …

Matrix normal cluster-weighted models

SD Tomarchio, PD McNicholas, A Punzo - Journal of Classification, 2021 - Springer
Finite mixtures of regressions with fixed covariates are a commonly used model-based
clustering methodology to deal with regression data. However, they assume assignment …

Two new matrix-variate distributions with application in model-based clustering

SD Tomarchio, A Punzo, L Bagnato - Computational Statistics & Data …, 2020 - Elsevier
Two matrix-variate distributions, both elliptical heavy-tailed generalization of the matrix-
variate normal distribution, are introduced. They belong to the normal scale mixture family …

Mixtures of matrix-variate contaminated normal distributions

SD Tomarchio, MPB Gallaugher, A Punzo… - … of Computational and …, 2022 - Taylor & Francis
Analysis of matrix-variate data is becoming ever more prevalent in the literature, especially
in the area of clustering and classification. Real data, including real matrix-variate data, are …

A novel two-sample test within the space of symmetric positive definite matrix distributions and its application in finance

Ž Lukić, B Milošević - Annals of the Institute of Statistical Mathematics, 2024 - Springer
This paper introduces a novel two-sample test for a broad class of orthogonally invariant
positive definite symmetric matrix distributions. Our test is the first of its kind, and we derive …

A mixture of coalesced generalized hyperbolic distributions

C Tortora, BC Franczak, RP Browne… - Journal of …, 2019 - Springer
A mixture of multiple scaled generalized hyperbolic distributions (MMSGHDs) is introduced.
Then, a coalesced generalized hyperbolic distribution (CGHD) is developed by joining a …