Zero-shot learning—a comprehensive evaluation of the good, the bad and the ugly

K Zhou, Z Liu, Y Qiao, T Xiang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Generalization to out-of-distribution (OOD) data is a capability natural to humans yet
challenging for machines to reproduce. This is because most learning algorithms strongly …

被引用次数：962 相关文章所有 9 个版本

[HTML] sciencedirect.com

[HTML][HTML] Generative artificial intelligence and its applications in materials science: Current situation and future perspectives

Y Liu, Z Yang, Z Yu, Z Liu, D Liu, H Lin, M Li, S Ma… - Journal of …, 2023 - Elsevier

Abstract Generative Artificial Intelligence (GAI) is attracting the increasing attention of
materials community for its excellent capability of generating required contents. With the …

被引用次数：94 相关文章

[PDF] ieee.org

Multimodal learning with transformers: A survey

P Xu, X Zhu, DA Clifton - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Transformer is a promising neural network learner, and has achieved great success in
various machine learning tasks. Thanks to the recent prevalence of multimodal applications …

被引用次数：410 相关文章所有 9 个版本

[PDF] arxiv.org

Glm-130b: An open bilingual pre-trained model

A Zeng, X Liu, Z Du, Z Wang, H Lai, M Ding… - arXiv preprint arXiv …, 2022 - arxiv.org

We introduce GLM-130B, a bilingual (English and Chinese) pre-trained language model
with 130 billion parameters. It is an attempt to open-source a 100B-scale model at least as …

被引用次数：391 相关文章所有 5 个版本

[PDF] nature.com

Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning

E Tiu, E Talius, P Patel, CP Langlotz, AY Ng… - Nature Biomedical …, 2022 - nature.com

In tasks involving the interpretation of medical images, suitably trained machine-learning
models often exceed the performance of medical experts. Yet such a high-level of …

被引用次数：208 相关文章所有 9 个版本

[PDF] arxiv.org

Simple open-vocabulary object detection

M Minderer, A Gritsenko, A Stone, M Neumann… - … on Computer Vision, 2022 - Springer

Combining simple architectures with large-scale pre-training has led to massive
improvements in image classification. For object detection, pre-training and scaling …

被引用次数：354 相关文章所有 10 个版本

[PDF] thecvf.com

Learning to prompt for open-vocabulary object detection with vision-language model

Y Du, F Wei, Z Zhang, M Shi… - Proceedings of the …, 2022 - openaccess.thecvf.com

Recently, vision-language pre-training shows great potential in open-vocabulary object
detection, where detectors trained on base classes are devised for detecting new classes …

被引用次数：273 相关文章所有 10 个版本

[PDF] nowpublishers.com

Multimodal foundation models: From specialists to general-purpose assistants

C Li, Z Gan, Z Yang, J Yang, L Li… - … and Trends® in …, 2024 - nowpublishers.com

Neural compression is the application of neural networks and other machine learning
methods to data compression. Recent advances in statistical machine learning have opened …

被引用次数：120 相关文章所有 6 个版本

[PDF] thecvf.com

Lit: Zero-shot transfer with locked-image text tuning

X Zhai, X Wang, B Mustafa, A Steiner… - Proceedings of the …, 2022 - openaccess.thecvf.com

This paper presents contrastive-tuning, a simple method employing contrastive training to
align image and text models while still taking advantage of their pre-training. In our empirical …

被引用次数：471 相关文章所有 7 个版本

[PDF] thecvf.com

Fake it till you make it: Learning transferable representations from synthetic imagenet clones

MB Sarıyıldız, K Alahari, D Larlus… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent image generation models such as Stable Diffusion have exhibited an impressive
ability to generate fairly realistic images starting from a simple text prompt. Could such …

被引用次数：106 相关文章所有 13 个版本