Deep convolutional neural networks for large-scale speech tasks TN Sainath, B Kingsbury, G Saon, H Soltau, A Mohamed, G Dahl, ... Neural networks 64, 39-48, 2015 | 2010 | 2015 |
Speaker adaptation of neural network acoustic models using i-vectors G Saon, H Soltau, D Nahamoo, M Picheny 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 55-59, 2013 | 769 | 2013 |
Neural speech recognizer: Acoustic-to-word LSTM model for large vocabulary speech recognition H Soltau, H Liao, H Sak arXiv preprint arXiv:1610.09975, 2016 | 396 | 2016 |
fMPE: Discriminatively trained features for speech recognition D Povey, B Kingsbury, L Mangu, G Saon, H Soltau, G Zweig Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005 | 370 | 2005 |
Improvements to deep convolutional neural networks for LVCSR TN Sainath, B Kingsbury, A Mohamed, GE Dahl, G Saon, H Soltau, ... 2013 IEEE workshop on automatic speech recognition and understanding, 315-320, 2013 | 306 | 2013 |
Scalable Minimum Bayes Risk Training of Deep Neural Network Acoustic Models Using Distributed Hessian-free Optimization. B Kingsbury, TN Sainath, H Soltau Interspeech, 10-13, 2012 | 281 | 2012 |
A one-pass decoder based on polymorphic linguistic context assignment H Soltau, F Metze, C Fugen, A Waibel IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU …, 2001 | 251 | 2001 |
Method and system for efficient spoken term detection using confusion networks BED Kingsbury, HK Kuo, L Mangu, H Soltau US Patent 9,196,243, 2015 | 218 | 2015 |
Classifier-based system combination for spoken term detection BED Kingsbury, HKJ Kuo, LL Mangu, H Soltau US Patent 9,477,753, 2016 | 210 | 2016 |
Google usm: Scaling automatic speech recognition beyond 100 languages Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ... arXiv preprint arXiv:2303.01037, 2023 | 193 | 2023 |
Advances in automatic meeting record creation and access A Waibel, M Bett, F Metze, K Ries, T Schaaf, T Schultz, H Soltau, H Yu, ... 2001 IEEE International Conference on Acoustics, Speech, and Signal …, 2001 | 180 | 2001 |
Recognition of music types H Soltau, T Schultz, M Westphal, A Waibel Proceedings of the 1998 IEEE International Conference on Acoustics, Speech …, 1998 | 173 | 1998 |
The IBM Attila speech recognition toolkit H Soltau, G Saon, B Kingsbury 2010 IEEE Spoken Language Technology Workshop, 97-102, 2010 | 171 | 2010 |
Advances in speech transcription at IBM under the DARPA EARS program SF Chen, B Kingsbury, L Mangu, D Povey, G Saon, H Soltau, G Zweig IEEE Transactions on Audio, Speech, and Language Processing 14 (5), 1596-1608, 2006 | 163 | 2006 |
Analyzing convolutional neural networks for speech activity detection in mismatched acoustic conditions S Thomas, S Ganapathy, G Saon, H Soltau 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 144 | 2014 |
The IBM 2004 conversational telephony system for rich transcription H Soltau, B Kingsbury, L Mangu, D Povey, G Saon, G Zweig Proceedings.(ICASSP'05). IEEE International Conference on Acoustics, Speech …, 2005 | 144 | 2005 |
Joint training of convolutional and non-convolutional neural networks H Soltau, G Saon, TN Sainath 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 116 | 2014 |
Acoustic-to-word neural network speech recognizer H Soltau, H Sak, H Liao US Patent App. 15/834,254, 2018 | 115 | 2018 |
Joint speech recognition and speaker diarization via sequence transduction LE Shafey, H Soltau, I Shafran arXiv preprint arXiv:1907.05337, 2019 | 111 | 2019 |
Method and system for joint training of hybrid neural networks for acoustic modeling in automatic speech recognition GA Saon, H Soltau US Patent 9,665,823, 2017 | 90 | 2017 |