Harnessing gans for zero-shot learning of new classes in visual speech recognition

[HTML][HTML] A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets

K Bayoudh, R Knani, F Hamdaoui, A Mtibaa - The Visual Computer, 2022 - Springer

The research progress in multimodal learning has grown rapidly over the last decade in
several areas, especially in computer vision. The growing potential of multimodal data …

被引用次数：274 相关文章所有 7 个版本

[PDF] arxiv.org

Advances and challenges in deep lip reading

M Oghbaie, A Sabaghi, K Hashemifard… - arXiv preprint arXiv …, 2021 - arxiv.org

Driven by deep learning techniques and large-scale datasets, recent years have witnessed
a paradigm shift in automatic lip reading. While the main thrust of Visual Speech …

被引用次数：14 相关文章所有 3 个版本

[HTML] mdpi.com

[HTML][HTML] Visual speech recognition for kannada language using vgg16 convolutional neural network

S Rudregowda, S Patil Kulkarni, G HL, V Ravi… - Acoustics, 2023 - mdpi.com

Visual speech recognition (VSR) is a method of reading speech by noticing the lip actions of
the narrators. Visual speech significantly depends on the visual features derived from the …

被引用次数：13 相关文章所有 10 个版本

[PDF] wiley.com Full View

A Survey of Long‐Tail Item Recommendation Methods

J Qin - Wireless Communications and Mobile Computing, 2021 - Wiley Online Library

Recommender systems represent a critical field of AI technology applications. The core
function of a recommender system is to recommend items of interest to users, but if it is only …

被引用次数：6 相关文章所有 7 个版本

[HTML] sciencedirect.com

[HTML][HTML] Read my lips: Artificial intelligence word-level arabic lipreading system

W Dweik, S Altorman, S Ashour - Egyptian Informatics Journal, 2022 - Elsevier

Lipreading is the ability to recognize words or sentences from the mouth movements of a
speaking person. This process is also known as Visual Speech Recognition (VSR) …

被引用次数：9 相关文章

[PDF] aaai.org

Robust Uncertainty Quantification Using Conformalised Monte Carlo Prediction

D Bethell, S Gerasimou, R Calinescu - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Deploying deep learning models in safety-critical applications remains a very challenging
task, mandating the provision of assurances for the dependable operation of these models …

被引用次数：1 相关文章所有 7 个版本

Seamless authentication for online teaching and meeting

M Mohanty, W Yaqub - 2020 IEEE Sixth International …, 2020 - ieeexplore.ieee.org

The lockdowns and travel restrictions in the current coronavirus pandemic situation has
replaced face-to-face teaching and meeting with the online alternatives. Recently, the video …

被引用次数：10 相关文章所有 4 个版本

[PDF] researchgate.net

Recent developments in generative adversarial networks: A review (workshop paper)

A Yadav, DK Vishwakarma - 2020 IEEE Sixth International …, 2020 - ieeexplore.ieee.org

In recent times, Generative Adversarial Networks (GANs) have created a lot of buzz in the
research community. GANs are formulated on the zero-sum game theory, where two neural …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Trinity: Syncretizing Multi-/Long-tail/Long-term Interests All in One

J Yan, L Jiang, J Cui, Z Zhao, X Bin, F Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Interest modeling in recommender system has been a constant topic for improving user
experience, and typical interest modeling tasks (eg multi-interest, long-tail interest and long …

DFS: A Diverse Feature Synthesis Model for Generalized Zero-Shot Learning

B Li, Y Hu, C Han, T Guo - 2022 26th International Conference …, 2022 - ieeexplore.ieee.org

Generative based strategy has shown great potential in the Generalized Zero-Shot Learning
task. However, it suffers severe generalization problem due to lacking of feature diversity for …