Word-wise sinhala tamil and english script identification using gaussian kernel svm

PK Singh, R Sarkar, M Nasipuri - Computer Science Review, 2015 - Elsevier

Abstract Offline Script Identification (OSI) facilitates many important applications such as
automatic archiving of multilingual documents, searching online/offline archives of document …

被引用次数：56 相关文章所有 2 个版本

[PDF] ieee.org

Script identification of multi-script documents: a survey

K Ubul, G Tursun, A Aysa, D Impedovo, G Pirlo… - IEEE …, 2017 - ieeexplore.ieee.org

In recent years, with the widespread of Internet and digitized processing of multi-script
documents worldwide, script identification techniques have become more important in the …

被引用次数：79 相关文章所有 4 个版本

[PDF] academia.edu

Offline script recognition from handwritten and printed multilingual documents: a survey

D Sinwar, VS Dhaka, N Pradhan, S Pandey - International Journal on …, 2021 - Springer

Script recognition has many real-life applications like optical character recognition,
document archiving, writer identification, searching within the documents, etc. Automatic …

被引用次数：22 相关文章所有 4 个版本

[PDF] arxiv.org

Survey on publicly available sinhala natural language processing tools and research

N De Silva - arXiv preprint arXiv:1906.02358, 2019 - arxiv.org

Sinhala is the native language of the Sinhalese people who make up the largest ethnic
group of Sri Lanka. The language belongs to the globe-spanning language tree, Indo …

被引用次数：44 相关文章所有 6 个版本

Chinese text classification based on character-level CNN and SVM

H Wu, D Li, M Cheng - International Journal of Intelligent …, 2019 - inderscienceonline.com

Aiming at the problems of curse of dimensionality, sparse data and long computation time in
traditional SVM classification algorithm based on term frequency-inverse document …

被引用次数：30 相关文章所有 4 个版本

Benchmark databases of handwritten Bangla-Roman and Devanagari-Roman mixed-script document images

PK Singh, R Sarkar, N Das, S Basu, M Kundu… - Multimedia Tools and …, 2018 - Springer

Handwritten document image dataset is one of the basic necessities to conduct research on
developing Optical Character Recognition (OCR) systems. In a multilingual country like …

被引用次数：31 相关文章所有 4 个版本

Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition

S Kundu, S Paul, PK Singh, R Sarkar… - Neural Computing and …, 2020 - Springer

This paper presents a deep learning architecture modified for resource-constrained
environments, called Non-Fully-Connected Network or NFC-Net, based on convolutional …

被引用次数：22 相关文章所有 4 个版本

A comprehensive handwritten Indic script recognition system: a tree-based approach

PK Singh, R Sarkar, V Bhateja, M Nasipuri - Journal of Ambient …, 2024 - Springer

A noteworthy achievement has been accomplished in developing optical character
recognition (OCR) systems for different Indic scripts handwritten document images. But in a …

被引用次数：18 相关文章

[PDF] atailab.cn

A similarity-based two-view multiple instance learning method for classification

Y Xiao, Z Yin, B Liu - Knowledge-Based Systems, 2020 - Elsevier

Multiple instance learning (MIL) has been proposed to classify the bag of instances. In
practice, we may meet the problems which have more than one view data. For example, in …

被引用次数：12 相关文章所有 2 个版本

Separation of handwritten and machine-printed texts from noisy documents using contourlet transform

P Sahare, SB Dhok - Arabian Journal for Science and Engineering, 2018 - Springer

To make paperless environment in office, document image analysis, where optical character
recognition is mostly used, plays a major role. The documents such as bank cheques …

被引用次数：11 相关文章所有 2 个版本