Contrabar: Contrastive bayes-adaptive deep rl
In meta reinforcement learning (meta RL), an agent seeks a Bayes-optimal policy–the
optimal policy when facing an unknown task that is sampled from some known task …
optimal policy when facing an unknown task that is sampled from some known task …
C-SENN: Contrastive self-explaining neural network
Y Sawada, K Nakamura - arXiv preprint arXiv:2206.09575, 2022 - arxiv.org
In this study, we use a self-explaining neural network (SENN), which learns unsupervised
concepts, to acquire concepts that are easy for people to understand automatically. In …
concepts, to acquire concepts that are easy for people to understand automatically. In …
Exploring temporal granularity in self-supervised video representation learning
This work presents a self-supervised learning framework named TeG to explore Temporal
Granularity in learning video representations. In TeG, we sample a long clip from a video …
Granularity in learning video representations. In TeG, we sample a long clip from a video …
[PDF][PDF] On Temporal Granularity in Self-Supervised Video Representation Learning.
This work presents an empirical exploration of temporal granularity in self-supervised video
representation learning. While state-of-the-art methods commonly enforce the learned …
representation learning. While state-of-the-art methods commonly enforce the learned …
Achieving timestamp prediction while recognizing with non-autoregressive end-to-end asr model
Conventional ASR systems use frame-level phoneme posterior to conduct force-alignment
(FA) and provide timestamps, while end-to-end ASR systems especially AED based ones …
(FA) and provide timestamps, while end-to-end ASR systems especially AED based ones …
Learning to Represent and Recognize Multimodal Videos
R Qian - 2023 - search.proquest.com
In today's digital landscape, the staggering growth of video resources has resulted in a
wealth of visual, auditory, and textual information readily available on the internet. To fully …
wealth of visual, auditory, and textual information readily available on the internet. To fully …
Xian Shi), Yanni Chen, Shiliang Zhang, and Zhijie Yan Speech Lab, Alibaba Group, Hangzhou, China {shixian. shi, cyn244124, sly. zsl, zhijie. yzj}@ alibaba-inc. com
ATP While, ASR End-to-End - Man-Machine Speech …, 2023 - books.google.com
Conventional ASR systems use frame-level phoneme posterior to conduct force-alignment
(FA) and provide timestamps, while endto-end ASR systems especially AED based ones are …
(FA) and provide timestamps, while endto-end ASR systems especially AED based ones are …