Impact of word embedding models on text analytics in deep learning environment: a review

DS Asudani, NK Nagwani, P Singh - Artificial intelligence review, 2023 - Springer
The selection of word embedding and deep learning models for better outcomes is vital.
Word embeddings are an n-dimensional distributed representation of a text that attempts to …

iPro-WAEL: a comprehensive and robust framework for identifying promoters in multiple species

P Zhang, H Zhang, H Wu - Nucleic Acids Research, 2022 - academic.oup.com
Promoters are consensus DNA sequences located near the transcription start sites and they
play an important role in transcription initiation. Due to their importance in biological …

CLNN-loop: a deep learning model to predict CTCF-mediated chromatin loops in the different cell lines and CTCF-binding sites (CBS) pair types

P Zhang, Y Wu, H Zhou, B Zhou, H Zhang… - Bioinformatics, 2022 - academic.oup.com
Abstract Motivation Three-dimensional (3D) genome organization is of vital importance in
gene regulation and disease mechanisms. Previous studies have shown that CTCF …

A comprehensive revisit of the machine‐learning tools developed for the identification of enhancers in the human genome

LT Phan, C Oh, T He, B Manavalan - Proteomics, 2023 - Wiley Online Library
Enhancers are non‐coding DNA elements that play a crucial role in enhancing the
transcription rate of a specific gene in the genome. Experiments for identifying enhancers …

Integrative machine learning framework for the identification of cell-specific enhancers from the human genome

S Basith, MM Hasan, G Lee, L Wei… - Briefings in …, 2021 - academic.oup.com
Enhancers are deoxyribonucleic acid (DNA) fragments which when bound by transcription
factors enhance the transcription of related genes. Due to its sporadic distribution and …

Genomic benchmarks: a collection of datasets for genomic sequence classification

K Grešová, V Martinek, D Čechák, P Šimeček… - BMC Genomic …, 2023 - Springer
Background Recently, deep neural networks have been successfully applied in many
biological fields. In 2020, a deep learning model AlphaFold won the protein folding …

XG-ac4C: identification of N4-acetylcytidine (ac4C) in mRNA using eXtreme gradient boosting with electron-ion interaction pseudopotentials

W Alam, H Tayara, KT Chong - Scientific reports, 2020 - nature.com
Abstract N4-acetylcytidine (ac4C) is a post-transcriptional modification in mRNA which plays
a major role in the stability and regulation of mRNA translation. The working mechanism of …

DNA sequences performs as natural language processing by exploiting deep learning algorithm for the identification of N4-methylcytosine

A Wahab, H Tayara, Z Xuan, KT Chong - Scientific reports, 2021 - nature.com
N4-methylcytosine is a biochemical alteration of DNA that affects the genetic operations
without modifying the DNA nucleotides such as gene expression, genomic imprinting …

DeepCap-Kcr: accurate identification and investigation of protein lysine crotonylation sites based on capsule network

J Khanal, H Tayara, Q Zou… - Briefings in …, 2022 - academic.oup.com
Lysine crotonylation (Kcr) is a posttranslational modification widely detected in histone and
nonhistone proteins. It plays a vital role in human disease progression and various cellular …

A robust drug–target interaction prediction framework with capsule network and transfer learning

Y Huang, HY Huang, Y Chen, YCD Lin, L Yao… - International Journal of …, 2023 - mdpi.com
Drug–target interactions (DTIs) are considered a crucial component of drug design and drug
discovery. To date, many computational methods were developed for drug–target …