Transforming big data into smart data: An insight on the use of the k‐nearest neighbors algorithm to obtain quality data

I Triguero, D García‐Gil, J Maillo… - … : Data Mining and …, 2019 - Wiley Online Library
The k‐nearest neighbors algorithm is characterized as a simple yet effective data mining
technique. The main drawback of this technique appears when massive amounts of data …

Early prediction of learners at risk in self-paced education: A neural network approach

H Waheed, SU Hassan, R Nawaz, NR Aljohani… - Expert Systems with …, 2023 - Elsevier
To address the demands of modern education and increase flexibility, many higher
education institutions are considering self-paced education programs. However, student …

Virtual learning environment to predict withdrawal by leveraging deep learning

SU Hassan, H Waheed, NR Aljohani… - … Journal of Intelligent …, 2019 - Wiley Online Library
The current evolution in multidisciplinary learning analytics research poses significant
challenges for the exploitation of behavior analysis by fusing data streams toward advanced …

A multi-variate heart disease optimization and recognition framework

HM Balaha, AO Shaban, EM El-Gendy… - Neural Computing and …, 2022 - Springer
Cardiovascular diseases (CVD) are the most widely spread diseases all over the world
among the common chronic diseases. CVD represents one of the main causes of morbidity …

A survey on classifying big data with label noise

JM Johnson, TM Khoshgoftaar - ACM Journal of Data and Information …, 2022 - dl.acm.org
Class label noise is a critical component of data quality that directly inhibits the predictive
performance of machine learning algorithms. While many data-level and algorithm-level …

T-shaped partial least squares for high-dosed new active pharmaceutical ingredients in continuous twin-screw wet granulation: Granule size prediction with limited …

K Matsunami, J Meyer, M Rowland, N Dawson… - International Journal of …, 2023 - Elsevier
This work presents a granule size prediction approach applicable to diverse formulations
containing new active pharmaceutical ingredients (APIs) in continuous twin-screw wet …

TPLS as predictive platform for twin-screw wet granulation process and formulation development

A Ryckaert, D Van Hauwermeiren, J Dhondt… - International Journal of …, 2021 - Elsevier
In recent years, the interest in continuous manufacturing techniques, such as twin-screw wet
granulation, has increased. However, the understanding of the influence of the combination …

A data mining experimentation framework to improve six sigma projects

AF Fahmy, HK Mohamed… - 2017 13th International …, 2017 - ieeexplore.ieee.org
The complexity and dynamics have been increased in the last years in the field of telecom
operations. Due to this, many challenges were introduced in this area especially in handling …

Performance analysis of data mining algorithms

DM Amin, A Garg - Journal of Computational and Theoretical …, 2019 - ingentaconnect.com
The globalisation of Internet is creating enormous amount of data on servers. The data
created during last two years is itself equivalent to the data created during all these years …

CohortFinder: an open-source tool for data-driven partitioning of digital pathology and imaging cohorts to yield robust machine-learning models

F Fan, G Martinez, T DeSilvio, J Shin, Y Chen, J Jacobs… - npj Imaging, 2024 - nature.com
Batch effects (BEs) refer to systematic technical differences in data collection unrelated to
biological variations whose noise is shown to negatively impact machine learning (ML) …