Gmmseg: Gaussian mixture based generative semantic segmentation models

P Li, CW Xie, H Xie, L Zhao, L Zhang… - Advances in neural …, 2024 - proceedings.neurips.cc

Video moment retrieval pursues an efficient and generalized solution to identify the specific
temporal segments within an untrimmed video that correspond to a given language …

被引用次数：42 相关文章所有 6 个版本

[PDF] arxiv.org

Clustseg: Clustering for universal segmentation

J Liang, T Zhou, D Liu, W Wang - arXiv preprint arXiv:2305.02187, 2023 - arxiv.org

We present CLUSTSEG, a general, transformer-based framework that tackles different
image segmentation tasks (ie, superpixel, semantic, instance, and panoptic) through a …

被引用次数：75 相关文章所有 5 个版本

[PDF] thecvf.com

Logic-induced diagnostic reasoning for semi-supervised semantic segmentation

C Liang, W Wang, J Miao… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Recent advances in semi-supervised semantic segmentation have been heavily reliant on
pseudo labeling to compensate for limited labeled data, disregarding the valuable relational …

被引用次数：24 相关文章所有 5 个版本

[PDF] thecvf.com

Diffusionret: Generative text-video retrieval with diffusion model

P Jin, H Li, Z Cheng, K Li, X Ji, C Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Existing text-video retrieval solutions are, in essence, discriminant models focused on
maximizing the conditional likelihood, ie, p (candidates| query). While straightforward, this …

被引用次数：42 相关文章所有 5 个版本

[PDF] thecvf.com

Generative semantic segmentation

J Chen, J Lu, X Zhu, L Zhang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Abstract We present Generative Semantic Segmentation (GSS), a generative learning
approach for semantic segmentation. Uniquely, we cast semantic segmentation as an image …

被引用次数：36 相关文章所有 7 个版本

[PDF] thecvf.com

Clustering based point cloud representation learning for 3d analysis

T Feng, W Wang, X Wang, Y Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Point cloud analysis (such as 3D segmentation and detection) is a challenging task,
because of not only the irregular geometries of many millions of unordered points, but also …

被引用次数：24 相关文章所有 6 个版本

[PDF] thecvf.com

Fedseg: Class-heterogeneous federated learning for semantic segmentation

J Miao, Z Yang, L Fan, Y Yang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Federated Learning (FL) is a distributed learning paradigm that collaboratively learns a
global model across multiple clients with data privacy-preserving. Although many FL …

被引用次数：35 相关文章所有 4 个版本

[PDF] arxiv.org

Local-global context aware transformer for language-guided video segmentation

C Liang, W Wang, T Zhou, J Miao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

We explore the task of language-guided video segmentation (LVS). Previous algorithms
mostly adopt 3D CNNs to learn video representation, struggling to capture long-term context …

被引用次数：73 相关文章所有 9 个版本

[PDF] arxiv.org

Catr: Combinatorial-dependence audio-queried transformer for audio-visual video segmentation

K Li, Z Yang, L Chen, Y Yang, J Xiao - Proceedings of the 31st ACM …, 2023 - dl.acm.org

Audio-visual video segmentation (AVVS) aims to generate pixel-level maps of sound-
producing objects within image frames and ensure the maps faithfully adheres to the given …

被引用次数：40 相关文章所有 4 个版本

[PDF] thecvf.com

Sparsely annotated semantic segmentation with adaptive gaussian mixtures

L Wu, Z Zhong, L Fang, X He, Q Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Sparsely annotated semantic segmentation (SASS) aims to learn a segmentation model by
images with sparse labels (ie, points or scribbles). Existing methods mainly focus on …

被引用次数：25 相关文章所有 6 个版本