All-neural online source separation, counting, and diarization for meeting analysis T Von Neumann, K Kinoshita, M Delcroix, S Araki, T Nakatani, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 105 | 2019 |
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR T von Neumann, C Boeddeker, L Drude, K Kinoshita, M Delcroix, ... arXiv preprint arXiv:2006.02786, 2020 | 47 | 2020 |
End-to-end training of time domain audio separation and recognition T von Neumann, K Kinoshita, L Drude, C Boeddeker, M Delcroix, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 39 | 2020 |
Deep attractor networks for speaker re-identification and blind source separation L Drude, T von Neumann, R Haeb-Umbach 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 36 | 2018 |
Monaural source separation: From anechoic to reverberant environments T Cord-Landwehr, C Boeddeker, T Von Neumann, C Zorilă, R Doddipatla, ... 2022 international workshop on acoustic signal enhancement (IWAENC), 1-5, 2022 | 27 | 2022 |
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers T von Neumann, K Kinoshita, C Boeddeker, M Delcroix, R Haeb-Umbach arXiv preprint arXiv:2107.14446, 2021 | 25 | 2021 |
On word error rate definitions and their efficient computation for multi-speaker speech recognition systems T von Neumann, C Boeddeker, K Kinoshita, M Delcroix, R Haeb-Umbach ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 18 | 2023 |
SA-SDR: A novel loss function for separation of meeting style data T von Neumann, K Kinoshita, C Boeddeker, M Delcroix, R Haeb-Umbach ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 16 | 2022 |
Multi-path RNN for hierarchical modeling of long sequential data and its application to speaker stream separation K Kinoshita, T von Neumann, M Delcroix, T Nakatani, R Haeb-Umbach arXiv preprint arXiv:2006.13579, 2020 | 10 | 2020 |
MMS-MSG: A multi-purpose multi-speaker mixture signal generator T Cord-Landwehr, T Von Neumann, C Boeddeker, R Haeb-Umbach 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 1-5, 2022 | 9 | 2022 |
An initialization scheme for meeting separation with spatial mixture models C Boeddeker, T Cord-Landwehr, T von Neumann, R Haeb-Umbach arXiv preprint arXiv:2204.01338, 2022 | 8 | 2022 |
Speeding up permutation invariant training for source separation T von Neumann, C Boeddeker, K Kinoshita, M Delcroix, R Haeb-Umbach Speech Communication; 14th ITG Conference, 1-5, 2021 | 7 | 2021 |
A meeting transcription system for an ad-hoc acoustic sensor network T Gburrek, C Boeddeker, T von Neumann, T Cord-Landwehr, ... arXiv preprint arXiv:2205.00944, 2022 | 6 | 2022 |
Segment-less continuous speech separation of meetings: Training and evaluation criteria T von Neumann, K Kinoshita, C Boeddeker, M Delcroix, R Haeb-Umbach IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 576-589, 2022 | 4 | 2022 |
Meeting recognition with continuous speech separation and transcription-supported diarization T von Neumann, C Boeddeker, T Cord-Landwehr, M Delcroix, ... arXiv preprint arXiv:2309.16482, 2023 | 3 | 2023 |
Estimation device, learning device, estimation method, learning method, and recording medium K Kinoshita, M Delcroix, T Nakatani, S Araki, L Drude, TC Von Neumann US Patent 11,456,003, 2022 | 3 | 2022 |
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT K Kinoshita, T von Neumann, M Delcroix, C Boeddeker, R Haeb-Umbach arXiv preprint arXiv:2207.13888, 2022 | 3 | 2022 |
MeetEval: A toolkit for computation of word error rates for meeting transcription systems T von Neumann, C Boeddeker, M Delcroix, R Haeb-Umbach arXiv preprint arXiv:2307.11394, 2023 | 2 | 2023 |
Multi-stage diarization refinement for the CHiME-7 DASR scenario CB Boeddeker, T Cord-Landwehr, T Neumann, R Haeb-Umbach Proc. CHiME 2023, 51-56, 2023 | 1 | 2023 |
Mixture Encoder Supporting Continuous Speech Separation for Meeting Recognition P Vieting, S Berger, T von Neumann, C Boeddeker, R Schlüter, ... arXiv preprint arXiv:2309.08454, 2023 | | 2023 |