[PDF][PDF] End to end spoken language diarization with Wav2vec embeddings
The performance of the available end-to-end (E2E) spoken language diarization (LD)
systems is biased towards primary language. This is due to the unavailability of sufficient …
systems is biased towards primary language. This is due to the unavailability of sufficient …
Implicit Self-supervised Language Representation for Spoken Language Diarization
J Mishra, SRM Prasanna - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
The use of spoken language diarization (LD) as a preprocessing system might be essential
in a code-switched (CS) scenario. Furthermore, implicit frameworks are preferable to explicit …
in a code-switched (CS) scenario. Furthermore, implicit frameworks are preferable to explicit …