Machine learning methods for small data challenges in molecular science

B Dou, Z Zhu, E Merkurjev, L Ke, L Chen… - Chemical …, 2023 - ACS Publications
Small data are often used in scientific and engineering research due to the presence of
various constraints, such as time, cost, ethics, privacy, security, and technical limitations in …

Methods for biological data integration: perspectives and challenges

V Gligorijević, N Pržulj - Journal of the Royal Society …, 2015 - royalsocietypublishing.org
Rapid technological advances have led to the production of different types of biological data
and enabled construction of complex networks with various types of interactions between …

Finding the best classification threshold in imbalanced classification

Q Zou, S Xie, Z Lin, M Wu, Y Ju - Big Data Research, 2016 - Elsevier
Classification with imbalanced class distributions is a major problem in machine learning.
Researchers have given considerable attention to the applications in many real-world …

Support vector machines and kernels for computational biology

A Ben-Hur, CS Ong, S Sonnenburg… - PLoS computational …, 2008 - journals.plos.org
The increasing wealth of biological data coming from a large variety of platforms and the
continued development of new high-throughput methods for probing biological systems …

[图书][B] Kernel methods in computational biology

B Schölkopf, K Tsuda, JP Vert - 2004 - books.google.com
A detailed overview of current research in kernel methods and their application to
computational biology. Modern machine learning techniques are proving to be extremely …

[PDF][PDF] Kernel methods for predicting protein–protein interactions

A Ben-Hur, WS Noble - Bioinformatics, 2005 - noble.gs.washington.edu
Motivation: Despite advances in high-throughput methods for discovering protein–protein
interactions, the interaction networks of even well-studied model organisms are sketchy at …

Support vector machine applications in computational biology

WS Noble - 2004 - direct.mit.edu
During the past 3 years, the support vector machine (SVM) learning algorithm has been
extensively applied within the field of computational biology. The algorithm has been used to …

Fast kernels for string and tree matching

SVN Vishwanathan, AJ Smola - 2004 - direct.mit.edu
This algorithm can be extended in various ways to provide linear time prediction cost in the
length of the sequence to be classified. We demonstrate extensions in the case of position …

Semi-supervised protein classification using cluster kernels

J Weston, D Zhou, A Elisseeff… - Advances in neural …, 2003 - proceedings.neurips.cc
A key issue in supervised protein classification is the representation of input sequences of
amino acids. Recent work using string kernels for protein data has achieved state-of-the-art …

Choosing negative examples for the prediction of protein-protein interactions

A Ben-Hur, WS Noble - BMC bioinformatics, 2006 - Springer
The protein-protein interaction networks of even well-studied model organisms are sketchy
at best, highlighting the continued need for computational methods to help direct …