[PDF][PDF] A review of machine learning algorithms for text-documents classification

A Khan, B Baharudin, LH Lee, K Khan - Journal of advances in …, 2010 - academia.edu
With the increasing availability of electronic documents and the rapid growth of the World
Wide Web, the task of automatic categorization of documents became the key method for …

[PDF][PDF] Learning the kernel matrix with semidefinite programming

GRG Lanckriet, N Cristianini, P Bartlett… - Journal of Machine …, 2004 - jmlr.org
Kernel-based learning algorithms work by embedding the data into a Euclidean space, and
then searching for linear relations among the embedded data points. The embedding is …

Statistical topic models for multi-label document classification

TN Rubin, A Chambers, P Smyth, M Steyvers - Machine learning, 2012 - Springer
Abstract Machine learning approaches to multi-label document classification have to date
largely relied on discriminative modeling techniques such as support vector machines. A …

An enhanced support vector machine classification framework by using Euclidean distance function for text document categorization

LH Lee, CH Wan, R Rajkumar, D Isa - Applied Intelligence, 2012 - Springer
This paper presents the implementation of a new text document classification framework that
uses the Support Vector Machine (SVM) approach in the training phase and the Euclidean …

Large-scale sparse logistic regression

J Liu, J Chen, J Ye - Proceedings of the 15th ACM SIGKDD international …, 2009 - dl.acm.org
Logistic Regression is a well-known classification method that has been used widely in
many applications of data mining, machine learning, computer vision, and bioinformatics …

A hybrid text classification approach with low dependency on parameter by integrating K-nearest neighbor and support vector machine

CH Wan, LH Lee, R Rajkumar, D Isa - Expert Systems with Applications, 2012 - Elsevier
This work implements a new text document classifier by integrating the K-nearest neighbor
(KNN) classification approach with the support vector machine (SVM) training algorithm. The …

[PDF][PDF] A Fast Hybrid Algorithm for Large-Scale l1-Regularized Logistic Regression

J Shi, W Yin, S Osher, P Sajda - The Journal of Machine Learning …, 2010 - jmlr.org
Abstract ℓ1-regularized logistic regression, also known as sparse logistic regression, is
widely used in machine learning, computer vision, data mining, bioinformatics and neural …

[PDF][PDF] Maximum Entropy Discrimination Markov Networks.

J Zhu, EP Xing - Journal of Machine Learning Research, 2009 - jmlr.org
The standard maximum margin approach for structured prediction lacks a straightforward
probabilistic interpretation of the learning scheme and the prediction rule. Therefore its …

High Relevance Keyword Extraction facility for Bayesian text classification on different domains of varying characteristic

LH Lee, D Isa, WO Choo, WY Chue - Expert Systems with Applications, 2012 - Elsevier
High Relevance Keyword Extraction (HRKE) facility is introduced to Bayesian text
classification to perform feature/keyword extraction during the classifying stage, without …

Distributed text classification with an ensemble kernel-based learning approach

C Silva, U Lotric, B Ribeiro… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
Constructing a single text classifier that excels in any given application is a rather inviable
goal. As a result, ensemble systems are becoming an important resource, since they permit …