[HTML][HTML] A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets

K Bayoudh, R Knani, F Hamdaoui, A Mtibaa - The Visual Computer, 2022 - Springer
The research progress in multimodal learning has grown rapidly over the last decade in
several areas, especially in computer vision. The growing potential of multimodal data …

Advances and challenges in deep lip reading

M Oghbaie, A Sabaghi, K Hashemifard… - arXiv preprint arXiv …, 2021 - arxiv.org
Driven by deep learning techniques and large-scale datasets, recent years have witnessed
a paradigm shift in automatic lip reading. While the main thrust of Visual Speech …

[HTML][HTML] Visual speech recognition for kannada language using vgg16 convolutional neural network

S Rudregowda, S Patil Kulkarni, G HL, V Ravi… - Acoustics, 2023 - mdpi.com
Visual speech recognition (VSR) is a method of reading speech by noticing the lip actions of
the narrators. Visual speech significantly depends on the visual features derived from the …

A Survey of Long‐Tail Item Recommendation Methods

J Qin - Wireless Communications and Mobile Computing, 2021 - Wiley Online Library
Recommender systems represent a critical field of AI technology applications. The core
function of a recommender system is to recommend items of interest to users, but if it is only …

[HTML][HTML] Read my lips: Artificial intelligence word-level arabic lipreading system

W Dweik, S Altorman, S Ashour - Egyptian Informatics Journal, 2022 - Elsevier
Lipreading is the ability to recognize words or sentences from the mouth movements of a
speaking person. This process is also known as Visual Speech Recognition (VSR) …

Robust Uncertainty Quantification Using Conformalised Monte Carlo Prediction

D Bethell, S Gerasimou, R Calinescu - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Deploying deep learning models in safety-critical applications remains a very challenging
task, mandating the provision of assurances for the dependable operation of these models …

Seamless authentication for online teaching and meeting

M Mohanty, W Yaqub - 2020 IEEE Sixth International …, 2020 - ieeexplore.ieee.org
The lockdowns and travel restrictions in the current coronavirus pandemic situation has
replaced face-to-face teaching and meeting with the online alternatives. Recently, the video …

Recent developments in generative adversarial networks: A review (workshop paper)

A Yadav, DK Vishwakarma - 2020 IEEE Sixth International …, 2020 - ieeexplore.ieee.org
In recent times, Generative Adversarial Networks (GANs) have created a lot of buzz in the
research community. GANs are formulated on the zero-sum game theory, where two neural …

Trinity: Syncretizing Multi-/Long-tail/Long-term Interests All in One

J Yan, L Jiang, J Cui, Z Zhao, X Bin, F Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Interest modeling in recommender system has been a constant topic for improving user
experience, and typical interest modeling tasks (eg multi-interest, long-tail interest and long …

DFS: A Diverse Feature Synthesis Model for Generalized Zero-Shot Learning

B Li, Y Hu, C Han, T Guo - 2022 26th International Conference …, 2022 - ieeexplore.ieee.org
Generative based strategy has shown great potential in the Generalized Zero-Shot Learning
task. However, it suffers severe generalization problem due to lacking of feature diversity for …