Lvos: A benchmark for long-term video object segmentation

L Hong, W Chen, Z Liu, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Existing video object segmentation (VOS) benchmarks focus on short-term videos which just
last about 3-5 seconds and where objects are visible most of the time. These videos are …

Onetracker: Unifying visual object tracking with foundation models and efficient tuning

L Hong, S Yan, R Zhang, W Li, X Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
Visual object tracking aims to localize the target object of each frame based on its initial
appearance in the first frame. Depending on the input modility tracking tasks can be divided …

Prototypical matching networks for video object segmentation

F Lin, Z Qiu, C Liu, T Yao, H Xie… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Semi-supervised video object segmentation is the task of segmenting the target in
sequential frames given the ground truth mask in the first frame. The modern approaches …

Boosting the transferability of adversarial attacks with global momentum initialization

J Wang, Z Chen, K Jiang, D Yang, L Hong… - Expert Systems with …, 2024 - Elsevier
Abstract Deep Neural Networks (DNNs) are vulnerable to adversarial examples, which are
crafted by adding human-imperceptible perturbations to the benign inputs. Simultaneously …

Reading relevant feature from global representation memory for visual object tracking

X Zhou, P Guo, L Hong, J Li, W Zhang… - Advances in …, 2024 - proceedings.neurips.cc
Reference features from a template or historical frames are crucial for visual object tracking.
Prior works utilize all features from a fixed template or memory for visual object tracking …

Openvis: Open-vocabulary video instance segmentation

P Guo, T Huang, P He, X Liu, T Xiao, Z Chen… - arXiv preprint arXiv …, 2023 - arxiv.org
Open-vocabulary Video Instance Segmentation (OpenVIS) can simultaneously detect,
segment, and track arbitrary object categories in a video, without being constrained to …

Simulflow: Simultaneously extracting feature and identifying target for unsupervised video object segmentation

L Hong, W Zhang, S Gao, H Lu, WQ Zhang - Proceedings of the 31st …, 2023 - dl.acm.org
Unsupervised video object segmentation (UVOS) aims at detecting the primary objects in a
given video sequence without any human interposing. Most existing methods rely on two …

TagOOD: A Novel Approach to Out-of-Distribution Detection via Vision-Language Representations and Class Center Learning

J Li, X Zhou, K Jiang, L Hong, P Guo, Z Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Multimodal fusion, leveraging data like vision and language, is rapidly gaining traction. This
enriched data representation improves performance across various tasks. Existing methods …

LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation

L Hong, Z Liu, W Chen, C Tan, Y Feng, X Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
Video object segmentation (VOS) aims to distinguish and track target objects in a video.
Despite the excellent performance achieved by off-the-shell VOS models, existing VOS …

Towards Decision-based Sparse Attacks on Video Recognition

K Jiang, Z Chen, X Zhou, J Zhang, L Hong… - Proceedings of the 31st …, 2023 - dl.acm.org
Recent studies indicate that sparse attacks threaten the security of deep learning models,
which modify only a small set of pixels in the input based on the l0 norm constraint. While …