Re-thinking data strategy and integration for artificial intelligence: concepts, opportunities, and challenges

A Aldoseri, KN Al-Khalifa, AM Hamouda - Applied Sciences, 2023 - mdpi.com
The use of artificial intelligence (AI) is becoming more prevalent across industries such as
healthcare, finance, and transportation. Artificial intelligence is based on the analysis of …

Conceptual and empirical comparison of dimensionality reduction algorithms (pca, kpca, lda, mds, svd, lle, isomap, le, ica, t-sne)

F Anowar, S Sadaoui, B Selim - Computer Science Review, 2021 - Elsevier
Abstract Feature Extraction Algorithms (FEAs) aim to address the curse of dimensionality
that makes machine learning algorithms incompetent. Our study conceptually and …

Saint: Improved neural networks for tabular data via row attention and contrastive pre-training

G Somepalli, M Goldblum, A Schwarzschild… - arXiv preprint arXiv …, 2021 - arxiv.org
Tabular data underpins numerous high-impact applications of machine learning from fraud
detection to genomics and healthcare. Classical approaches to solving tabular problems …

A latent factor analysis-based approach to online sparse streaming feature selection

D Wu, Y He, X Luo, MC Zhou - IEEE Transactions on Systems …, 2021 - ieeexplore.ieee.org
Online streaming feature selection (OSFS) has attracted extensive attention during the past
decades. Current approaches commonly assume that the feature space of fixed data …

Secureml: A system for scalable privacy-preserving machine learning

P Mohassel, Y Zhang - 2017 IEEE symposium on security and …, 2017 - ieeexplore.ieee.org
Machine learning is widely used in practice to produce predictive models for applications
such as image processing, speech and text recognition. These models are more accurate …

Improved random forest for classification

A Paul, DP Mukherjee, P Das… - … on Image Processing, 2018 - ieeexplore.ieee.org
We propose an improved random forest classifier that performs classification with a
minimum number of trees. The proposed method iteratively removes some unimportant …

Fantastic four:{Honest-Majority}{Four-Party} secure computation with malicious security

A Dalskov, D Escudero, M Keller - 30th USENIX Security Symposium …, 2021 - usenix.org
This work introduces a novel four-party honest-majority MPC protocol with active security
that achieves comparable efficiency to equivalent protocols in the same setting, while having …

Feature selection with the Boruta package

MB Kursa, WR Rudnicki - Journal of statistical software, 2010 - jstatsoft.org
This article describes a R package Boruta, implementing a novel feature selection algorithm
for finding emph {all relevant variables}. The algorithm is designed as a wrapper around a …

Infinite feature selection

G Roffo, S Melzi, M Cristani - Proceedings of the IEEE …, 2015 - openaccess.thecvf.com
Filter-based feature selection has become crucial in many classification settings, especially
object recognition, recently faced with feature learning strategies that originate thousands of …

Making big data open: data sharing in neuroimaging

RA Poldrack, KJ Gorgolewski - Nature neuroscience, 2014 - nature.com
In the last decade, major advances have been made in the availability of shared
neuroimaging data, such that there are more than 8,000 shared MRI (magnetic resonance …