[PDF][PDF] End to end spoken language diarization with Wav2vec embeddings

J Mishra, JN Patil, A Chowdhury, M Prasanna - Proc. Interspeech, 2023 - isca-archive.org
The performance of the available end-to-end (E2E) spoken language diarization (LD)
systems is biased towards primary language. This is due to the unavailability of sufficient …

Implicit Self-supervised Language Representation for Spoken Language Diarization

J Mishra, SRM Prasanna - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
The use of spoken language diarization (LD) as a preprocessing system might be essential
in a code-switched (CS) scenario. Furthermore, implicit frameworks are preferable to explicit …