Offline script identification from multilingual indic-script documents: a state-of-the-art

PK Singh, R Sarkar, M Nasipuri - Computer Science Review, 2015 - Elsevier
Abstract Offline Script Identification (OSI) facilitates many important applications such as
automatic archiving of multilingual documents, searching online/offline archives of document …

Script identification of multi-script documents: a survey

K Ubul, G Tursun, A Aysa, D Impedovo, G Pirlo… - IEEE …, 2017 - ieeexplore.ieee.org
In recent years, with the widespread of Internet and digitized processing of multi-script
documents worldwide, script identification techniques have become more important in the …

Offline script recognition from handwritten and printed multilingual documents: a survey

D Sinwar, VS Dhaka, N Pradhan, S Pandey - International Journal on …, 2021 - Springer
Script recognition has many real-life applications like optical character recognition,
document archiving, writer identification, searching within the documents, etc. Automatic …

Survey on publicly available sinhala natural language processing tools and research

N De Silva - arXiv preprint arXiv:1906.02358, 2019 - arxiv.org
Sinhala is the native language of the Sinhalese people who make up the largest ethnic
group of Sri Lanka. The language belongs to the globe-spanning language tree, Indo …

Chinese text classification based on character-level CNN and SVM

H Wu, D Li, M Cheng - International Journal of Intelligent …, 2019 - inderscienceonline.com
Aiming at the problems of curse of dimensionality, sparse data and long computation time in
traditional SVM classification algorithm based on term frequency-inverse document …

Benchmark databases of handwritten Bangla-Roman and Devanagari-Roman mixed-script document images

PK Singh, R Sarkar, N Das, S Basu, M Kundu… - Multimedia Tools and …, 2018 - Springer
Handwritten document image dataset is one of the basic necessities to conduct research on
developing Optical Character Recognition (OCR) systems. In a multilingual country like …

Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition

S Kundu, S Paul, PK Singh, R Sarkar… - Neural Computing and …, 2020 - Springer
This paper presents a deep learning architecture modified for resource-constrained
environments, called Non-Fully-Connected Network or NFC-Net, based on convolutional …

A comprehensive handwritten Indic script recognition system: a tree-based approach

PK Singh, R Sarkar, V Bhateja, M Nasipuri - Journal of Ambient …, 2024 - Springer
A noteworthy achievement has been accomplished in developing optical character
recognition (OCR) systems for different Indic scripts handwritten document images. But in a …

A similarity-based two-view multiple instance learning method for classification

Y Xiao, Z Yin, B Liu - Knowledge-Based Systems, 2020 - Elsevier
Multiple instance learning (MIL) has been proposed to classify the bag of instances. In
practice, we may meet the problems which have more than one view data. For example, in …

Separation of handwritten and machine-printed texts from noisy documents using contourlet transform

P Sahare, SB Dhok - Arabian Journal for Science and Engineering, 2018 - Springer
To make paperless environment in office, document image analysis, where optical character
recognition is mostly used, plays a major role. The documents such as bank cheques …