A white paper on good research practices in benchmarking: The case of cluster analysis
I Van Mechelen, AL Boulesteix, R Dangl… - … : Data Mining and …, 2023 - Wiley Online Library
To achieve scientific progress in terms of building a cumulative body of knowledge, careful
attention to benchmarking is of the utmost importance, requiring that proposals of new …
attention to benchmarking is of the utmost importance, requiring that proposals of new …
On the role of benchmarking data sets and simulations in method comparison studies
S Friedrich, T Friede - Biometrical Journal, 2024 - Wiley Online Library
Method comparisons are essential to provide recommendations and guidance for applied
researchers, who often have to choose from a plethora of available approaches. While many …
researchers, who often have to choose from a plethora of available approaches. While many …
Pitfalls and potentials in simulation studies: Questionable research practices in comparative simulation studies allow for spurious claims of superiority of any method
Comparative simulation studies are workhorse tools for benchmarking statistical methods.
As with other empirical studies, the success of simulation studies hinges on the quality of …
As with other empirical studies, the success of simulation studies hinges on the quality of …
Position: Why we must rethink empirical research in machine learning
M Herrmann, FJD Lange, K Eggensperger… - 2024 - epub.ub.uni-muenchen.de
We warn against a common but incomplete understanding of empirical research in machine
learning that leads to non-replicable results, makes findings unreliable, and threatens to …
learning that leads to non-replicable results, makes findings unreliable, and threatens to …
[HTML][HTML] A framework for benchmarking clustering algorithms
M Gagolewski - SoftwareX, 2022 - Elsevier
The evaluation of clustering algorithms can involve running them on a variety of benchmark
problems, and comparing their outputs to the reference, ground-truth groupings provided by …
problems, and comparing their outputs to the reference, ground-truth groupings provided by …
Explaining the optimistic performance evaluation of newly proposed methods: A cross‐design validation experiment
The constant development of new data analysis methods in many fields of research is
accompanied by an increasing awareness that these new methods often perform better in …
accompanied by an increasing awareness that these new methods often perform better in …
Leveraging Sports Analytics and Association Rule Mining to Uncover Recovery and Economic Impacts in NBA Basketball
This study examines the multifaceted field of injuries and their impacts on performance in the
National Basketball Association (NBA), leveraging a blend of Data Science, Data Mining …
National Basketball Association (NBA), leveraging a blend of Data Science, Data Mining …
Data with Density-Based Clusters: A Generator for Systematic Evaluation of Clustering Algorithms
Mining data containing density-based clusters is well-established and widespread but faces
problems when it comes to systematic and reproducible comparison and evaluation …
problems when it comes to systematic and reproducible comparison and evaluation …
SHADE: Deep Density-based Clustering
Detecting arbitrarily shaped clusters in high-dimensional noisy data is challenging for
current clustering methods. We introduce SHADE (Structure-preserving High-dimensional …
current clustering methods. We introduce SHADE (Structure-preserving High-dimensional …
Cell Population Identification and Benchmarking of Tools in Single-Cell Data Analysis
A Sonrel - 2024 - zora.uzh.ch
In an era of Biology where modern imaging and sequencing technologies allow to study
almost any biological process at molecular levels, it is now possible to study the content …
almost any biological process at molecular levels, it is now possible to study the content …