Offline script identification from multilingual indic-script documents: a state-of-the-art
Abstract Offline Script Identification (OSI) facilitates many important applications such as
automatic archiving of multilingual documents, searching online/offline archives of document …
automatic archiving of multilingual documents, searching online/offline archives of document …
Script identification of multi-script documents: a survey
K Ubul, G Tursun, A Aysa, D Impedovo, G Pirlo… - IEEE …, 2017 - ieeexplore.ieee.org
In recent years, with the widespread of Internet and digitized processing of multi-script
documents worldwide, script identification techniques have become more important in the …
documents worldwide, script identification techniques have become more important in the …
Offline script recognition from handwritten and printed multilingual documents: a survey
Script recognition has many real-life applications like optical character recognition,
document archiving, writer identification, searching within the documents, etc. Automatic …
document archiving, writer identification, searching within the documents, etc. Automatic …
Survey on publicly available sinhala natural language processing tools and research
N De Silva - arXiv preprint arXiv:1906.02358, 2019 - arxiv.org
Sinhala is the native language of the Sinhalese people who make up the largest ethnic
group of Sri Lanka. The language belongs to the globe-spanning language tree, Indo …
group of Sri Lanka. The language belongs to the globe-spanning language tree, Indo …
Chinese text classification based on character-level CNN and SVM
H Wu, D Li, M Cheng - International Journal of Intelligent …, 2019 - inderscienceonline.com
Aiming at the problems of curse of dimensionality, sparse data and long computation time in
traditional SVM classification algorithm based on term frequency-inverse document …
traditional SVM classification algorithm based on term frequency-inverse document …
Benchmark databases of handwritten Bangla-Roman and Devanagari-Roman mixed-script document images
Handwritten document image dataset is one of the basic necessities to conduct research on
developing Optical Character Recognition (OCR) systems. In a multilingual country like …
developing Optical Character Recognition (OCR) systems. In a multilingual country like …
Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition
This paper presents a deep learning architecture modified for resource-constrained
environments, called Non-Fully-Connected Network or NFC-Net, based on convolutional …
environments, called Non-Fully-Connected Network or NFC-Net, based on convolutional …
A comprehensive handwritten Indic script recognition system: a tree-based approach
A noteworthy achievement has been accomplished in developing optical character
recognition (OCR) systems for different Indic scripts handwritten document images. But in a …
recognition (OCR) systems for different Indic scripts handwritten document images. But in a …
A similarity-based two-view multiple instance learning method for classification
Y Xiao, Z Yin, B Liu - Knowledge-Based Systems, 2020 - Elsevier
Multiple instance learning (MIL) has been proposed to classify the bag of instances. In
practice, we may meet the problems which have more than one view data. For example, in …
practice, we may meet the problems which have more than one view data. For example, in …
Separation of handwritten and machine-printed texts from noisy documents using contourlet transform
P Sahare, SB Dhok - Arabian Journal for Science and Engineering, 2018 - Springer
To make paperless environment in office, document image analysis, where optical character
recognition is mostly used, plays a major role. The documents such as bank cheques …
recognition is mostly used, plays a major role. The documents such as bank cheques …