ConvPatchTrans: A script identification network with global and local semantics deeply integrated

K Yang, J Yi, A Chen, J Liu, W Chen, Z Jin - Engineering Applications of …, 2022 - Elsevier
Abstract Optical Character Recognition (OCR) system serves the need of reading text from
images. Script identification that identifies the language of the text in the image is an …

[HTML][HTML] LWSNet-a novel deep-learning architecture to segregate Covid-19 and pneumonia from x-ray imagery

A Lasker, M Ghosh, SM Obaidullah… - Multimedia Tools and …, 2023 - Springer
Automatic detection of lung diseases using AI-based tools became very much necessary to
handle the huge number of cases occurring across the globe and support the doctors. This …

An augmented reality for an arabic text reading and visualization assistant for the visually impaired

I Ouali, MB Halima, A Wali - Multimedia Tools and Applications, 2023 - Springer
Text, as one of humanity's most influential innovations, has played an important role in
shaping our lives. Reading a text is a difficult task due to several reasons factors, such as …

Scene text understanding: recapitulating the past decade

M Ghosh, H Mukherjee, SM Obaidullah, XZ Gao… - Artificial Intelligence …, 2023 - Springer
Computational perception has indeed been dramatically modified and reformed from
handcrafted feature-based techniques to the advent of deep learning. Scene text …

[HTML][HTML] Classification of geometric forms in mosaics using deep neural network

M Ghosh, SM Obaidullah, F Gherardini, M Zdimalova - Journal of Imaging, 2021 - mdpi.com
The paper addresses an image processing problem in the field of fine arts. In particular, a
deep learning-based technique to classify geometric forms of artworks, such as paintings …

Augmented reality for scene text recognition, visualization and reading to assist visually impaired people

I Ouali, MB Halima, W Ali - Procedia Computer Science, 2022 - Elsevier
Reading traffic signs while driving a car for visually impaired people and people with visual
problems is a very difficult task for them. This task is encountered every day, sometimes …

Document image analysis using deep multi-modular features

KV Jobin, A Mondal, CV Jawahar - SN Computer Science, 2022 - Springer
Texture or repeating patterns, discriminative patches, and shapes are the salient features for
various document image analysis problems. This article proposes a deep network …

Ensemble stack architecture for lungs segmentation from X-ray images

A Lasker, M Ghosh, SM Obaidullah… - … on Intelligent Data …, 2022 - Springer
In healthcare, chest X-rays are an inexpensive medical imaging diagnostic tools. The lung
images segmentation from chest X-rays (CXRs) is important for screening and diagnosing …

A deep learning-based framework for COVID-19 identification using chest X-Ray images

A Lasker, M Ghosh, SM Obaidullah… - … of Deep Learning …, 2023 - taylorfrancis.com
COVID-19 originally surfaced in Wuhan China quickly propagated over the globe and
become a pandemic. This has had a disastrous consequence for people's regular life …

MOPO-HBT: A movie poster dataset for title extraction and recognition

M Ghosh, SS Roy, B Banik, H Mukherjee… - Multimedia Tools and …, 2024 - Springer
Real-world images often encompass embedded texts that adhere to disparate disciplines
like business, education, and amusement, to name a few. Such images are graphically rich …