Toward stable co-saliency detection and object co-segmentation

H Wang, B Li, S Wu, S Shen, F Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Dynamic Facial Expression Recognition (DFER) is a rapidly developing field that
focuses on recognizing facial expressions in video format. Previous research has …

被引用次数：31 相关文章所有 5 个版本

[PDF] thecvf.com

Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model

S Pramanick, G Han, R Hou, S Nag… - Proceedings of the …, 2024 - openaccess.thecvf.com

The ability of large language models (LLMs) to process visual inputs has given rise to
general-purpose vision systems unifying various vision-language (VL) tasks by instruction …

被引用次数：8 相关文章所有 3 个版本

[PDF] researchgate.net

Shape-Consistent One-Shot Unsupervised Domain Adaptation for Rail Surface Defect Segmentation

S Ma, K Song, M Niu, H Tian, Y Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Deep neural networks have greatly improved the performance of rail surface defect
segmentation when the test samples have the same distribution as the training samples …

被引用次数：22 相关文章所有 2 个版本

Tcnet: Co-salient object detection via parallel interaction of transformers and cnns

Y Ge, Q Zhang, TZ Xiang, C Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

The purpose of co-salient object detection (CoSOD) is to detect the salient objects that co-
occur in a group of relevant images. CoSOD has been significantly prospered by recent …

被引用次数：18 相关文章所有 3 个版本

[PDF] aaai.org

Attack can benefit: An adversarial approach to recognizing facial expressions under noisy annotations

J Zheng, B Li, SC Zhang, S Wu, L Cao… - Proceedings of the AAAI …, 2023 - ojs.aaai.org

Abstract The real-world Facial Expression Recognition (FER) datasets usually exhibit
complex scenarios with coupled noise annotations and imbalanced classes distribution …

被引用次数：4 相关文章所有 3 个版本

Sp-det: Leveraging saliency prediction for voxel-based 3d object detection in sparse point cloud

P An, Y Duan, Y Huang, J Ma, Y Chen… - IEEE Transactions …, 2023 - ieeexplore.ieee.org

Voxel is one of the common structural representation of 3D point cloud. Due to the sparsity of
point cloud generated by light detection and ranging (LiDAR), there is the extreme …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Zero-shot co-salient object detection framework

H Xiao, L Tang, B Li, Z Luo, S Li - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org

Co-salient Object Detection (CoSOD) endeavors to replicate the human visual system's
capacity to recognize common and salient objects within a collection of images. Despite …

被引用次数：4 相关文章所有 3 个版本

[PDF] thecvf.com

Scene Matters: Model-based Deep Video Compression

L Tang, X Zhang, G Zhang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Video compression has always been a popular research area, where many traditional and
deep video compression methods have been proposed. These methods typically rely on …

被引用次数：4 相关文章所有 5 个版本

Multi-View Graph Embedding Learning for Image Co-Segmentation and Co-Localization

A Huang, L Li, L Zhang, Y Niu, T Zhao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Image co-segmentation and co-localization exploit inter-image information to identify and
extract foreground objects with a batch mode. However, they remain challenging when …

被引用次数：1 相关文章

Predicting 360° Video Saliency: A ConvLSTM Encoder-Decoder Network with Spatio-temporal Consistency

Z Wan, H Qin, R Xiong, Z Li, X Fan… - IEEE Journal on …, 2024 - ieeexplore.ieee.org

360° videos have been widely used with the development of virtual reality technology and
triggered a demand to determine the most visually attractive objects in them, aka 360° video …