XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer

Q Li, H Yang, P Wang, X Liu, K Lv, M Ye - Journal of translational medicine, 2022 - Springer
Background For a long time, breast cancer has been a leading cancer diagnosed in women
worldwide, and approximately 90% of cancer-related deaths are caused by metastasis. For …

Intelligent mining of large-scale bio-data: Bioinformatics applications

FS Golestan Hashemi, M Razi Ismail… - Biotechnology & …, 2018 - Taylor & Francis
Today, there is a collection of a tremendous amount of bio-data because of the
computerized applications worldwide. Therefore, scholars have been encouraged to …

A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification

ZY Algamal, MH Lee - Advances in data analysis and classification, 2019 - Springer
The common issues of high-dimensional gene expression data are that many of the genes
may not be relevant, and there exists a high correlation among genes. Gene selection has …

[PDF][PDF] Classification of breast cancer histopathology images based on adaptive sparse support vector machine

MA Kahya, W Al-Hayani, ZY Algamal - Journal of Applied …, 2017 - academia.edu
Feature extraction and classification of the histopathological image plays a significant role in
prediction and diagnosis of diseases, such as breast cancer. The common issues of the …

[HTML][HTML] Integration of eQTL and machine learning to dissect causal genes with pleiotropic effects in genetic regulation networks of seed cotton yield

T Zhao, H Wu, X Wang, Y Zhao, L Wang, J Pan, H Mei… - Cell Reports, 2023 - cell.com
The dissection of a gene regulatory network (GRN) that complements the genome-wide
association study (GWAS) locus and the crosstalk underlying multiple agronomical traits …

Cancer diagnosis and disease gene identification via statistical machine learning

L Chen, J Li, M Chang - Current Bioinformatics, 2020 - ingentaconnect.com
Diagnosing cancer and identifying the disease gene by using DNA microarray gene
expression data are the hot topics in current bioinformatics. This paper is devoted to the …

[HTML][HTML] WeDIV–an improved k-means clustering algorithm with a weighted distance and a novel internal validation index

Z Ning, J Chen, J Huang, UJ Sabo, Z Yuan… - Egyptian Informatics …, 2022 - Elsevier
Designing appropriate similarity metrics (distance) and estimating the optimal number of
clusters have been two important issues in cluster analysis. This study proposed an …

A wrapper feature subset selection method based on randomized search and multilayer structure

Y Mao, Y Yang - BioMed research international, 2019 - Wiley Online Library
The identification of discriminative features from information‐rich data with the goal of
clinical diagnosis is crucial in the field of biomedical science. In this context, many machine …

Feature selection for binary classification within functional genomics experiments via interquartile range and clustering

Z Khan, M Naeem, U Khalil, DM Khan… - IEEE …, 2019 - ieeexplore.ieee.org
Datasets produced in modern research, such as biomedical science, pose a number of
challenges for machine learning techniques used in binary classification due to high …

[HTML][HTML] A deep learning model for accurate diagnosis of infection using antibody repertoires

Y Chen, Z Ye, Y Zhang, W Xie, Q Chen… - The Journal of …, 2022 - journals.aai.org
The adaptive immune receptor repertoire consists of the entire set of an individual's BCRs
and TCRs and is believed to contain a record of prior immune responses and the potential …