Automation of language sample analysis
Purpose: A major barrier to the wider use of language sample analysis (LSA) is the fact that
transcription is very time intensive. Methods that can reduce the required time and effort …
transcription is very time intensive. Methods that can reduce the required time and effort …
Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer
Deep neural network-based systems have significantly improved the performance of
speaker diarization tasks. However, end-to-end neural diarization (EEND) systems often …
speaker diarization tasks. However, end-to-end neural diarization (EEND) systems often …
Meeting recognition with continuous speech separation and transcription-supported diarization
T Von Neumann, C Boeddeker… - … , Speech, and Signal …, 2024 - ieeexplore.ieee.org
We propose a modular pipeline for the single-channel separation, recognition, and
diarization of meeting-style recordings and evaluate it on the Libri-CSS dataset. Using a …
diarization of meeting-style recordings and evaluate it on the Libri-CSS dataset. Using a …
Unified modeling of multi-talker overlapped speech recognition and diarization with a sidecar separator
Multi-talker overlapped speech poses a significant challenge for speech recognition and
diarization. Recent research indicated that these two tasks are inter-dependent and …
diarization. Recent research indicated that these two tasks are inter-dependent and …
Optimized speaker change detection approach for speaker segmentation towards speaker diarization based on deep learning
K VijayKumar - Data & Knowledge Engineering, 2023 - Elsevier
Speaker diarization is the partitioning of an audio source stream into homogeneous
segments according to the speaker's identity. It can improve the readability of an automatic …
segments according to the speaker's identity. It can improve the readability of an automatic …
Sec-gan for robust speaker recognition with emotional state dismatch
D Li, Z Yang, Z Wang, M Hua - Biomedical Signal Processing and Control, 2023 - Elsevier
Speaker recognition is often dependent on the speaker and susceptible to emotional factors,
thus mostly decreasing recognition performance, we propose a framework by combining …
thus mostly decreasing recognition performance, we propose a framework by combining …