Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ... 2021 IEEE Spoken Language Technology Workshop (SLT), 897-904, 2021 | 85 | 2021 |
ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 70 | 2022 |
Investigations on speech recognition systems for low-resource dialectal Arabic–English code-switching speech I Hamed, P Denisov, CY Li, M Elmahdy, S Abdennadher, NT Vu Computer Speech & Language 72, 101278, 2022 | 42 | 2022 |
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning P Denisov, NT Vu Interspeech 2020, 881-885, 2020 | 33 | 2020 |
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning P Denisov, NT Vu Interspeech 2019, 4425-4429, 2019 | 25 | 2019 |
Speaker Anonymization with Phonetic Intermediate Representations S Meyer, F Lux, P Denisov, J Koch, P Tilli, NT Vu Interspeech 2022, 4925-4929, 2022 | 23 | 2022 |
Unsupervised domain adaptation by adversarial learning for robust speech recognition P Denisov, NT Vu, MF Font Speech Communication; 13th ITG-Symposium, 1-5, 2018 | 22 | 2018 |
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy S Meyer, P Tilli, P Denisov, F Lux, J Koch, NT Vu 2022 IEEE Spoken Language Technology Workshop (SLT), 912-919, 2023 | 18 | 2023 |
The IMS Toucan System for the Blizzard Challenge 2023 F Lux, J Koch, S Meyer, T Bott, N Schauffler, P Denisov, A Schweitzer, ... 18th Blizzard Challenge Workshop, 2023 | 17 | 2023 |
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study X Chang, B Yan, K Choi, JW Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 15 | 2024 |
IMS-speech: A speech to text tool P Denisov, NT Vu Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung …, 2019 | 14 | 2019 |
Context-aware Neural-based Dialog Act Classification on Automatically Generated Transcriptions D Ortega, CY Li, G Vallejo, P Denisov, NT Vu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 13 | 2019 |
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents CY Li, D Ortega, D Väth, F Lux, L Vanderlyn, M Schmidt, M Neumann, ... arXiv preprint arXiv:2005.01777, 2020 | 11 | 2020 |
Prosody Is Not Identity: A Speaker Anonymization Approach Using Prosody Cloning S Meyer, F Lux, J Koch, P Denisov, P Tilli, NT Vu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 9 | 2023 |
Findings of the Second AmericasNLP Competition on Speech-to-Text Translation A Ebrahimi, M Mager, A Wiemerslage, P Denisov, A Oncevay, D Liu, ... NeurIPS 2022 Competition Track 220, 217-232, 2022 | 4 | 2022 |
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task P Denisov, M Mager, NT Vu 2021 International Conference on Spoken Language Translation (IWSLT), 175-181, 2021 | 4 | 2021 |
Findings of the AmericasNLP 2024 shared task on the creation of educational materials for indigenous languages L Chiruzzo, P Denisov, A Molina-Villegas, SF Sabido, R Coto-Solano, ... Proceedings of the 4th Workshop on Natural Language Processing for …, 2024 | 2 | 2024 |
Cascade of Phonetic Speech Recognition, Speaker Embeddings GAN and Multispeaker Speech Synthesis for the VoicePrivacy 2022 Challenge S Meyer, P Tilli, F Lux, P Denisov, J Koch, NT Vu 2nd Symposium on Security and Privacy in Speech Communication, 2022 | 2 | 2022 |
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training P Denisov, T Vu Findings of the Association for Computational Linguistics: NAACL 2024, 814–834, 2024 | 1 | 2024 |
Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding P Denisov, NT Vu 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | | 2023 |