Multimodal research in vision and language: A review of current and emerging trends

S Uppal, S Bhagat, D Hazarika, N Majumder, S Poria… - Information …, 2022 - Elsevier
Deep Learning and its applications have cascaded impactful research and development
with a diverse range of modalities present in the real-world data. More recently, this has …

Fine-grained image analysis with deep learning: A survey

XS Wei, YZ Song, O Mac Aodha, J Wu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer
vision and pattern recognition, and underpins a diverse set of real-world applications. The …

Hierarchical deep click feature prediction for fine-grained image recognition

J Yu, M Tan, H Zhang, Y Rui… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
The click feature of an image, defined as the user click frequency vector of the image on a
predefined word vocabulary, is known to effectively reduce the semantic gap for fine-grained …

Knowledge-embedded routing network for scene graph generation

T Chen, W Yu, R Chen, L Lin - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
To understand a scene in depth not only involves locating/recognizing individual objects, but
also requires to infer the relationships and interactions among them. However, since the …

Learning semantic-specific graph representation for multi-label image recognition

T Chen, M Xu, X Hui, H Wu… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Recognizing multiple labels of images is a practical and challenging task, and significant
progress has been made by searching semantic-aware regions and modeling label …

Attention convolutional binary neural tree for fine-grained visual categorization

R Ji, L Wen, L Zhang, D Du, Y Wu… - Proceedings of the …, 2020 - openaccess.thecvf.com
Fine-grained visual categorization (FGVC) is an important but challenging task due to high
intra-class variances and low inter-class variances caused by deformation, occlusion …

Interpretable and accurate fine-grained recognition via region grouping

Z Huang, Y Li - Proceedings of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com
We present an interpretable deep model for fine-grained visual recognition. At the core of
our method lies the integration of region-based part discovery and attribution within a deep …

Accurate fine-grained object recognition with structure-driven relation graph networks

S Wang, Z Wang, H Li, J Chang, W Ouyang… - International Journal of …, 2024 - Springer
Fine-grained object recognition (FGOR) aims to learn discriminative features that can
identify the subtle distinctions between visually similar objects. However, less effort has …

Cross-domain facial expression recognition: A unified evaluation benchmark and adversarial graph learning

T Chen, T Pu, H Wu, Y Xie, L Liu… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Facial expression recognition (FER) has received significant attention in the past decade
with witnessed progress, but data inconsistencies among different FER datasets greatly …

Heterogeneous semantic transfer for multi-label recognition with partial labels

T Chen, T Pu, L Liu, Y Shi, Z Yang, L Lin - International Journal of …, 2024 - Springer
Multi-label image recognition with partial labels (MLR-PL), in which some labels are known
while others are unknown for each image, may greatly reduce the cost of annotation and …