A Deep Learning-Based Bengali Visual Question Answering System
MH Rafi, S Islam, SMHI Labib… - … on Computer and …, 2022 - ieeexplore.ieee.org
Visual Question Answering (VQA) is a challenging task in Artificial Intelligence (AI), where
an AI agent answers questions regarding visual content based on images provided …
an AI agent answers questions regarding visual content based on images provided …
ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla
Visual Question Answer (VQA) poses the problem of answering a natural language question
about a visual context. Bangla, despite being a widely spoken language, is considered low …
about a visual context. Bangla, despite being a widely spoken language, is considered low …
A Deep Learning-Based Bengali Visual Question Answering System Using Contrastive Loss
MT Zaman, MY Zaman, FM Shah… - 2024 6th International …, 2024 - ieeexplore.ieee.org
Visual Question Answering (VQA) is an interdis-ciplinary research area that uses image
recognition, natural language processing, and cognitive understanding to interpret visual …
recognition, natural language processing, and cognitive understanding to interpret visual …
Detection of Schizophrenia Using Attention Mechanism with Convolutional Neural Network on EEG Signal
MKA Bhuiyan, D Biswas - 2024 IEEE International Conference …, 2024 - ieeexplore.ieee.org
Schizophrenia (SZ) is a challenging mental disorder, requiring accurate diagnosis for
effective treatment. Early diagnosis of schizophrenia can facilitate a normal life for patients …
effective treatment. Early diagnosis of schizophrenia can facilitate a normal life for patients …
A Deep Learning-Based Bengali Visual Commonsense Reasoning System
MY Zaman, MT Zaman, S Pandit… - … Graphics and Image …, 2024 - ieeexplore.ieee.org
Visual commonsense reasoning, an integral aspect of human intelligence, extends beyond
mere object identification, encompassing the nuanced inference of actions, intentions, and …
mere object identification, encompassing the nuanced inference of actions, intentions, and …
Novel cloud storage ecosystem for efficient and secured multimedia services
JN Mukta - 2023 - lib.buet.ac.bd
Since massive numbers of images are now being communicated from, and stored in
different cloud systems, faster retrieval has become extremely important. This is more …
different cloud systems, faster retrieval has become extremely important. This is more …
[PDF][PDF] Modular Co-attention Networks in Nepali Visual Question Answering Systems
A Gyanwali, B Sapkota, A Koirala… - Asian Journal of Research … - researchgate.net
Visual question answering (VQA) has been regarded as a challenging task requiring a
perfect blend of computer vision and natural language processing. As no dataset was …
perfect blend of computer vision and natural language processing. As no dataset was …