[HTML][HTML] An experimental review of speaker diarization methods with application to two-speaker conversational telephone speech recordings
We performed an experimental review of current diarization systems for the conversational
telephone speech (CTS) domain. In detail, we considered a total of eight different algorithms …
telephone speech (CTS) domain. In detail, we considered a total of eight different algorithms …
[PDF][PDF] Robust entropy-based endpoint detection for speech recognition in noisy environments.
J Shen, J Hung, L Lee - ICSLP, 1998 - labrosa.ee.columbia.edu
This paper presents an entropy-based algorithm for accurate and robust endpoint detection
for speech recognition under noisy environments. Instead of using the conventional energy …
for speech recognition under noisy environments. Instead of using the conventional energy …
NetSTAT: A network-based intrusion detection approach
G Vigna, RA Kemmerer - Proceedings 14th Annual Computer …, 1998 - ieeexplore.ieee.org
Network-based attacks have become common and sophisticated. For this reason, intrusion
detection systems are now shifting their focus from the hosts and their operating systems to …
detection systems are now shifting their focus from the hosts and their operating systems to …
Spot me if you can: Uncovering spoken phrases in encrypted voip conversations
Despite the rapid adoption of Voice over IP (VoIP), its security implications are not yet fully
understood. Since VoIP calls may traverse untrusted networks, packets should be encrypted …
understood. Since VoIP calls may traverse untrusted networks, packets should be encrypted …
Towards a practical lipreading system
A practical lipreading system can be considered either as subject dependent (SD) or subject-
independent (SI). An SD system is user-specific, ie, customized for some particular user …
independent (SI). An SD system is user-specific, ie, customized for some particular user …
Robust endpoint detection algorithm based on the adaptive band-partitioning spectral entropy in adverse environments
In speech processing, endpoint detection in noisy environments is difficult, especially in the
presence of nonstationary noise. Robust endpoint detection is one of the most important …
presence of nonstationary noise. Robust endpoint detection is one of the most important …
[PDF][PDF] Entropy based voice activity detection in very noisy conditions.
P Renevey, A Drygajlo - INTERSPEECH, 2001 - researchgate.net
This paper addresses the problem of robust voice activity detection (VAD) capable for
working at very low signal-to-noise ratios (SNR< 10dB). A new algorithm that we propose is …
working at very low signal-to-noise ratios (SNR< 10dB). A new algorithm that we propose is …
An end-to-end multimodal voice activity detection using wavenet encoder and residual networks
I Ariav, I Cohen - IEEE Journal of Selected Topics in Signal …, 2019 - ieeexplore.ieee.org
Recently, there has been growing use of deep neural networks in many modern speech-
based systems such as speaker recognition, speech enhancement, and emotion …
based systems such as speaker recognition, speech enhancement, and emotion …
A novel approach to robust speech endpoint detection in car environments
L Huang, C Yang - 2000 IEEE International Conference on …, 2000 - ieeexplore.ieee.org
In the process of speech recognition, it is especially crucial to precisely locate endpoints of
the input utterance to be free of non-speech regions. This paper proposes a novel approach …
the input utterance to be free of non-speech regions. This paper proposes a novel approach …
System and method for multimodal utterance detection
AP Rao - US Patent 9,922,640, 2018 - Google Patents
The disclosure describe a system and method for detecting one or more segments of
desired speech utterances from an audio stream using timings of events from other modes …
desired speech utterances from an audio stream using timings of events from other modes …