[HTML][HTML] A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets
K Bayoudh, R Knani, F Hamdaoui, A Mtibaa - The Visual Computer, 2022 - Springer
The research progress in multimodal learning has grown rapidly over the last decade in
several areas, especially in computer vision. The growing potential of multimodal data …
several areas, especially in computer vision. The growing potential of multimodal data …
Advances and challenges in deep lip reading
M Oghbaie, A Sabaghi, K Hashemifard… - arXiv preprint arXiv …, 2021 - arxiv.org
Driven by deep learning techniques and large-scale datasets, recent years have witnessed
a paradigm shift in automatic lip reading. While the main thrust of Visual Speech …
a paradigm shift in automatic lip reading. While the main thrust of Visual Speech …
[HTML][HTML] Visual speech recognition for kannada language using vgg16 convolutional neural network
Visual speech recognition (VSR) is a method of reading speech by noticing the lip actions of
the narrators. Visual speech significantly depends on the visual features derived from the …
the narrators. Visual speech significantly depends on the visual features derived from the …
A Survey of Long‐Tail Item Recommendation Methods
J Qin - Wireless Communications and Mobile Computing, 2021 - Wiley Online Library
Recommender systems represent a critical field of AI technology applications. The core
function of a recommender system is to recommend items of interest to users, but if it is only …
function of a recommender system is to recommend items of interest to users, but if it is only …
[HTML][HTML] Read my lips: Artificial intelligence word-level arabic lipreading system
W Dweik, S Altorman, S Ashour - Egyptian Informatics Journal, 2022 - Elsevier
Lipreading is the ability to recognize words or sentences from the mouth movements of a
speaking person. This process is also known as Visual Speech Recognition (VSR) …
speaking person. This process is also known as Visual Speech Recognition (VSR) …
Robust Uncertainty Quantification Using Conformalised Monte Carlo Prediction
Deploying deep learning models in safety-critical applications remains a very challenging
task, mandating the provision of assurances for the dependable operation of these models …
task, mandating the provision of assurances for the dependable operation of these models …
Seamless authentication for online teaching and meeting
The lockdowns and travel restrictions in the current coronavirus pandemic situation has
replaced face-to-face teaching and meeting with the online alternatives. Recently, the video …
replaced face-to-face teaching and meeting with the online alternatives. Recently, the video …
Recent developments in generative adversarial networks: A review (workshop paper)
A Yadav, DK Vishwakarma - 2020 IEEE Sixth International …, 2020 - ieeexplore.ieee.org
In recent times, Generative Adversarial Networks (GANs) have created a lot of buzz in the
research community. GANs are formulated on the zero-sum game theory, where two neural …
research community. GANs are formulated on the zero-sum game theory, where two neural …
Trinity: Syncretizing Multi-/Long-tail/Long-term Interests All in One
Interest modeling in recommender system has been a constant topic for improving user
experience, and typical interest modeling tasks (eg multi-interest, long-tail interest and long …
experience, and typical interest modeling tasks (eg multi-interest, long-tail interest and long …
DFS: A Diverse Feature Synthesis Model for Generalized Zero-Shot Learning
Generative based strategy has shown great potential in the Generalized Zero-Shot Learning
task. However, it suffers severe generalization problem due to lacking of feature diversity for …
task. However, it suffers severe generalization problem due to lacking of feature diversity for …