Deep multimodal fusion for semantic image segmentation: A survey

Y Zhang, D Sidibé, O Morel, F Mériaudeau - Image and Vision Computing, 2021 - Elsevier
Recent advances in deep learning have shown excellent performance in various scene
understanding tasks. However, in some complex environments or under challenging …

GeoAI for large-scale image analysis and machine vision: recent progress of artificial intelligence in geography

W Li, CY Hsu - ISPRS International Journal of Geo-Information, 2022 - mdpi.com
GeoAI, or geospatial artificial intelligence, has become a trending topic and the frontier for
spatial analytics in Geography. Although much progress has been made in exploring the …

Open-vocabulary panoptic segmentation with text-to-image diffusion models

J Xu, S Liu, A Vahdat, W Byeon… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present ODISE: Open-vocabulary DIffusion-based panoptic SEgmentation, which unifies
pre-trained text-image diffusion and discriminative models to perform open-vocabulary …

Lisa: Reasoning segmentation via large language model

X Lai, Z Tian, Y Chen, Y Li, Y Yuan… - Proceedings of the …, 2024 - openaccess.thecvf.com
Although perception systems have made remarkable advancements in recent years they still
rely on explicit human instruction or pre-defined categories to identify the target objects …

Convolutions die hard: Open-vocabulary segmentation with single frozen convolutional clip

Q Yu, J He, X Deng, X Shen… - Advances in Neural …, 2024 - proceedings.neurips.cc
Open-vocabulary segmentation is a challenging task requiring segmenting and recognizing
objects from an open set of categories in diverse environments. One way to address this …

Oneformer: One transformer to rule universal image segmentation

J Jain, J Li, MT Chiu, A Hassani… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Universal Image Segmentation is not a new concept. Past attempts to unify image
segmentation include scene parsing, panoptic segmentation, and, more recently, new …

Block-nerf: Scalable large scene neural view synthesis

M Tancik, V Casser, X Yan, S Pradhan… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We present Block-NeRF, a variant of Neural Radiance Fields that can represent
large-scale environments. Specifically, we demonstrate that when scaling NeRF to render …

Panoptic neural fields: A semantic object-aware neural scene representation

A Kundu, K Genova, X Yin, A Fathi… - Proceedings of the …, 2022 - openaccess.thecvf.com
We present PanopticNeRF, an object-aware neural scene representation that decomposes
a scene into a set of objects (things) and background (stuff). Each object is represented by a …

Masked-attention mask transformer for universal image segmentation

B Cheng, I Misra, AG Schwing… - Proceedings of the …, 2022 - openaccess.thecvf.com
Image segmentation groups pixels with different semantics, eg, category or instance
membership. Each choice of semantics defines a task. While only the semantics of each task …

Rethinking range view representation for lidar segmentation

L Kong, Y Liu, R Chen, Y Ma, X Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
LiDAR segmentation is crucial for autonomous driving perception. Recent trends favor point-
or voxel-based methods as they often yield better performance than the traditional range …