LWSINet: A deep learning-based approach towards video script identification

K Yang, J Yi, A Chen, J Liu, W Chen, Z Jin - Engineering Applications of …, 2022 - Elsevier

Abstract Optical Character Recognition (OCR) system serves the need of reading text from
images. Script identification that identifies the language of the text in the image is an …

被引用次数：24 相关文章所有 2 个版本

[HTML] springer.com

[HTML][HTML] LWSNet-a novel deep-learning architecture to segregate Covid-19 and pneumonia from x-ray imagery

A Lasker, M Ghosh, SM Obaidullah… - Multimedia Tools and …, 2023 - Springer

Automatic detection of lung diseases using AI-based tools became very much necessary to
handle the huge number of cases occurring across the globe and support the doctors. This …

被引用次数：16 相关文章所有 8 个版本

An augmented reality for an arabic text reading and visualization assistant for the visually impaired

I Ouali, MB Halima, A Wali - Multimedia Tools and Applications, 2023 - Springer

Text, as one of humanity's most influential innovations, has played an important role in
shaping our lives. Reading a text is a difficult task due to several reasons factors, such as …

被引用次数：9 相关文章所有 3 个版本

Scene text understanding: recapitulating the past decade

M Ghosh, H Mukherjee, SM Obaidullah, XZ Gao… - Artificial Intelligence …, 2023 - Springer

Computational perception has indeed been dramatically modified and reformed from
handcrafted feature-based techniques to the advent of deep learning. Scene text …

被引用次数：4 相关文章所有 2 个版本

[HTML] mdpi.com

[HTML][HTML] Classification of geometric forms in mosaics using deep neural network

M Ghosh, SM Obaidullah, F Gherardini, M Zdimalova - Journal of Imaging, 2021 - mdpi.com

The paper addresses an image processing problem in the field of fine arts. In particular, a
deep learning-based technique to classify geometric forms of artworks, such as paintings …

被引用次数：19 相关文章所有 13 个版本

[PDF] sciencedirect.com

Augmented reality for scene text recognition, visualization and reading to assist visually impaired people

I Ouali, MB Halima, W Ali - Procedia Computer Science, 2022 - Elsevier

Reading traffic signs while driving a car for visually impaired people and people with visual
problems is a very difficult task for them. This task is encountered every day, sometimes …

被引用次数：17 相关文章所有 2 个版本

[PDF] iiit.ac.in

Document image analysis using deep multi-modular features

KV Jobin, A Mondal, CV Jawahar - SN Computer Science, 2022 - Springer

Texture or repeating patterns, discriminative patches, and shapes are the salient features for
various document image analysis problems. This article proposes a deep network …

被引用次数：5 相关文章所有 5 个版本

Ensemble stack architecture for lungs segmentation from X-ray images

A Lasker, M Ghosh, SM Obaidullah… - … on Intelligent Data …, 2022 - Springer

In healthcare, chest X-rays are an inexpensive medical imaging diagnostic tools. The lung
images segmentation from chest X-rays (CXRs) is important for screening and diagnosing …

被引用次数：5 相关文章所有 3 个版本

A deep learning-based framework for COVID-19 identification using chest X-Ray images

A Lasker, M Ghosh, SM Obaidullah… - … of Deep Learning …, 2023 - taylorfrancis.com

COVID-19 originally surfaced in Wuhan China quickly propagated over the globe and
become a pandemic. This has had a disastrous consequence for people's regular life …

被引用次数：3 相关文章所有 2 个版本

MOPO-HBT: A movie poster dataset for title extraction and recognition

M Ghosh, SS Roy, B Banik, H Mukherjee… - Multimedia Tools and …, 2024 - Springer

Real-world images often encompass embedded texts that adhere to disparate disciplines
like business, education, and amusement, to name a few. Such images are graphically rich …