A white paper on good research practices in benchmarking: The case of cluster analysis

I Van Mechelen, AL Boulesteix, R Dangl… - … : Data Mining and …, 2023 - Wiley Online Library
To achieve scientific progress in terms of building a cumulative body of knowledge, careful
attention to benchmarking is of the utmost importance, requiring that proposals of new …

On the role of benchmarking data sets and simulations in method comparison studies

S Friedrich, T Friede - Biometrical Journal, 2024 - Wiley Online Library
Method comparisons are essential to provide recommendations and guidance for applied
researchers, who often have to choose from a plethora of available approaches. While many …

Pitfalls and potentials in simulation studies: Questionable research practices in comparative simulation studies allow for spurious claims of superiority of any method

S Pawel, L Kook, K Reeve - Biometrical Journal, 2024 - Wiley Online Library
Comparative simulation studies are workhorse tools for benchmarking statistical methods.
As with other empirical studies, the success of simulation studies hinges on the quality of …

Position: Why we must rethink empirical research in machine learning

M Herrmann, FJD Lange, K Eggensperger… - 2024 - epub.ub.uni-muenchen.de
We warn against a common but incomplete understanding of empirical research in machine
learning that leads to non-replicable results, makes findings unreliable, and threatens to …

[HTML][HTML] A framework for benchmarking clustering algorithms

M Gagolewski - SoftwareX, 2022 - Elsevier
The evaluation of clustering algorithms can involve running them on a variety of benchmark
problems, and comparing their outputs to the reference, ground-truth groupings provided by …

Explaining the optimistic performance evaluation of newly proposed methods: A cross‐design validation experiment

C Nießl, S Hoffmann, T Ullmann… - Biometrical …, 2024 - Wiley Online Library
The constant development of new data analysis methods in many fields of research is
accompanied by an increasing awareness that these new methods often perform better in …

Leveraging Sports Analytics and Association Rule Mining to Uncover Recovery and Economic Impacts in NBA Basketball

V Sarlis, G Papageorgiou, C Tjortjis - Data, 2024 - mdpi.com
This study examines the multifaceted field of injuries and their impacts on performance in the
National Basketball Association (NBA), leveraging a blend of Data Science, Data Mining …

Data with Density-Based Clusters: A Generator for Systematic Evaluation of Clustering Algorithms

P Jahn, CMM Frey, A Beer, C Leiber, T Seidl - Joint European Conference …, 2024 - Springer
Mining data containing density-based clusters is well-established and widespread but faces
problems when it comes to systematic and reproducible comparison and evaluation …

SHADE: Deep Density-based Clustering

A Beer, P Weber, L Miklautz, C Leiber, W Durani… - arXiv preprint arXiv …, 2024 - arxiv.org
Detecting arbitrarily shaped clusters in high-dimensional noisy data is challenging for
current clustering methods. We introduce SHADE (Structure-preserving High-dimensional …

Cell Population Identification and Benchmarking of Tools in Single-Cell Data Analysis

A Sonrel - 2024 - zora.uzh.ch
In an era of Biology where modern imaging and sequencing technologies allow to study
almost any biological process at molecular levels, it is now possible to study the content …