Using regional saliency for speech emotion recognition Z Aldeneh, EM Provost Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017 | 155 | 2017 |
Progressive neural networks for transfer learning in emotion recognition J Gideon, S Khorram, Z Aldeneh, D Dimitriadis, EM Provost arXiv preprint arXiv:1706.03256, 2017 | 148 | 2017 |
Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network. D Le, Z Aldeneh, EM Provost Interspeech, 1108-1112, 2017 | 86 | 2017 |
Pooling acoustic and lexical features for the prediction of valence Z Aldeneh, S Khorram, D Dimitriadis, EM Provost Proceedings of the 19th ACM International Conference on Multimodal …, 2017 | 50 | 2017 |
Capturing long-term temporal dependencies with convolutional networks for continuous emotion recognition S Khorram, Z Aldeneh, D Dimitriadis, M McInnis, EM Provost arXiv preprint arXiv:1708.07050, 2017 | 49 | 2017 |
Improving end-of-turn detection in spoken dialogues by detecting speaker intentions as a secondary task Z Aldeneh, D Dimitriadis, EM Provost 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 25 | 2018 |
Muse-ing on the impact of utterance ordering on crowdsourced emotion annotations M Jaiswal, Z Aldeneh, CP Bara, Y Luo, M Burzo, R Mihalcea, EM Provost ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 18 | 2019 |
Identifying mood episodes using dialogue features from clinical interviews Z Aldeneh, M Jaiswal, M Picheny, M McInnis, EM Provost arXiv preprint arXiv:1910.05115, 2019 | 16 | 2019 |
Self-supervised learning of visual speech features with audiovisual speech enhancement Z Aldeneh, AP Kumar, BJ Theobald, E Marchi, S Kajarekar, D Naik, ... arXiv preprint arXiv:2004.12031, 2020 | 14* | 2020 |
Controlling for confounders in multimodal emotion classification via adversarial learning M Jaiswal, Z Aldeneh, E Mower Provost 2019 International conference on multimodal interaction, 174-184, 2019 | 13 | 2019 |
Aphasic speech recognition using a mixture of speech intelligibility experts M Perez, Z Aldeneh, EM Provost arXiv preprint arXiv:2008.10788, 2020 | 12 | 2020 |
Wild wild emotion: a multimodal ensemble approach J Gideon, B Zhang, Z Aldeneh, Y Kim, S Khorram, D Le, EM Provost Proceedings of the 18th ACM International Conference on Multimodal …, 2016 | 10 | 2016 |
You're Not You When You're Angry: Robust Emotion Features Emerge by Recognizing Speakers Z Aldeneh, EM Provost IEEE Transactions on Affective Computing 14 (2), 1351-1362, 2021 | 9 | 2021 |
End-of-turn detection in spoken dialogues L Polymenakos, DB Dimitriadis, Z Aldeneh, EM Provost US Patent 10,957,320, 2021 | 6 | 2021 |
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models J Jung, W Zhang, J Shi, Z Aldeneh, T Higuchi, BJ Theobald, AH Abdelaziz, ... arXiv preprint arXiv:2401.17230, 2024 | 4 | 2024 |
Spatial LibriSpeech: An Augmented Dataset for Spatial Audio Learning M Sarabia, E Menyaylenko, A Toso, S Seto, Z Aldeneh, S Pirhosseinloo, ... arXiv preprint arXiv:2308.09514, 2023 | 3 | 2023 |
On the Role of LIP Articulation in Visual Speech Perception Z Aldeneh, M Fedzechkina, S Seto, K Metcalf, M Sarabia, N Apostoloff, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 3* | 2023 |
Naturalistic Head Motion Generation from Speech T Mittal, Z Aldeneh, M Fedzechkina, A Ranjan, BJ Theobald ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Learning paralinguistic features from audiobooks through style voice conversion Z Aldeneh, M Perez, EM Provost Proceedings of the 2021 Conference of the North American Chapter of the …, 2021 | 1 | 2021 |
Can you Remove the Downstream Model for Speaker Recognition with Self-Supervised Speech Features? Z Aldeneh, T Higuchi, J Jung, S Seto, T Likhomanenko, S Shum, ... arXiv preprint arXiv:2402.00340, 2024 | | 2024 |