Hyperbolic learning with synthetic captions for open-world detection

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Hyperbolic learning with synthetic captions for open-world detection

在引用文章中搜索

[PDF] arxiv.org

Seeing Beyond Classes: Zero-Shot Grounded Situation Recognition via Language Explainer

J Lei, L Li, C Wang, J Xiao, L Chen - Proceedings of the 32nd ACM …, 2024 - dl.acm.org

Benefiting from strong generalization ability, pre-trained vision-language models (VLMs), eg,
CLIP, have been widely utilized in zero-shot scene understanding. Unlike simple recognition …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Which Viewpoint Shows it Best? Language for Weakly Supervising View Selection in Multi-view Videos

S Majumder, T Nagarajan, Z Al-Halah… - arXiv preprint arXiv …, 2024 - arxiv.org

Given a multi-view video, which viewpoint is most informative for a human observer?
Existing methods rely on heuristics or expensive``best-view" supervision to answer this …

Learning Visual Hierarchies with Hyperbolic Embeddings

Z Wang, S Ramasinghe, C Xu, J Monteil… - arXiv preprint arXiv …, 2024 - arxiv.org

Structuring latent representations in a hierarchical manner enables models to learn patterns
at multiple levels of abstraction. However, most prevalent image understanding models …

Adversarial Attacks on Hyperbolic Networks

M van Spengler, J Zahálka, P Mettes - arXiv preprint arXiv:2412.01495, 2024 - arxiv.org

As hyperbolic deep learning grows in popularity, so does the need for adversarial
robustness in the context of such a non-Euclidean geometry. To this end, this paper …