How to train your deep multi-object tracker Y Xu, A Osep, Y Ban, R Horaud, L Leal-Taixé, X Alameda-Pineda Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 228 | 2020 |
TransCenter: Transformers with Dense Representations for Multiple-Object Tracking Y Xu*, Y Ban*, G Delorme, C Gan, D Rus, X Alameda-Pineda IEEE Transactions on Pattern Analysis & Machine Intelligence, 1-16, 2022 | 222* | 2022 |
Computer vision in surgery TM Ward, P Mascagni, Y Ban, G Rosman, N Padoy, O Meireles, ... Surgery 169 (5), 1253-1256, 2021 | 116 | 2021 |
Tracking multiple persons based on a variational bayesian model Y Ban, S Ba, X Alameda-Pineda, R Horaud European Conference on Computer Vision, 52-67, 2016 | 84 | 2016 |
Automated operative phase identification in peroral endoscopic myotomy TM Ward, DA Hashimoto, Y Ban, DW Rattner, H Inoue, KD Lillemoe, ... Surgical endoscopy 35, 4008-4015, 2021 | 60 | 2021 |
Deepmot: A differentiable framework for training multiple object trackers Y Xu, Y Ban, X Alameda-Pineda, R Horaud arXiv preprint arXiv:1906.06618 10 (11), 2019 | 57 | 2019 |
Variational bayesian inference for audio-visual tracking of multiple speakers Y Ban, X Alameda-Pineda, L Girin, R Horaud IEEE transactions on pattern analysis and machine intelligence 43 (5), 1761-1776, 2019 | 56 | 2019 |
SAGES consensus recommendations on an annotation framework for surgical video OR Meireles, G Rosman, MS Altieri, L Carin, G Hager, A Madani, N Padoy, ... Surgical endoscopy 35 (9), 4918-4929, 2021 | 53 | 2021 |
Online localization and tracking of multiple moving speakers in reverberant environments X Li*, Y Ban*, L Girin, X Alameda-Pineda, R Horaud IEEE Journal of Selected Topics in Signal Processing 13 (1), 88-103, 2019 | 44* | 2019 |
Challenges in surgical video annotation TM Ward, DM Fer, Y Ban, G Rosman, OR Meireles, DA Hashimoto Computer Assisted Surgery 26 (1), 58-68, 2021 | 43 | 2021 |
A deep network for arousal-valence emotion prediction with acoustic-visual cues S Peng, L Zhang, Y Ban, M Fang, S Winkler arXiv preprint arXiv:1805.00638, 2018 | 30 | 2018 |
Exploiting the complementarity of audio and visual data in multi-speaker tracking Y Ban, L Girin, X Alameda-Pineda, R Horaud Proceedings of the IEEE International Conference on Computer Vision …, 2017 | 28 | 2017 |
Tracking a varying number of people with a visually-controlled robotic head Y Ban, X Alameda-Pineda, F Badeig, S Ba, R Horaud 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2017 | 25 | 2017 |
Enhancing direct‐path relative transfer function using deep neural network for robust sound source localization B Yang, R Ding, Y Ban, X Li, H Liu CAAI Transactions on Intelligence Technology 7 (3), 446-454, 2022 | 24 | 2022 |
Tracking multiple audio sources with the von mises distribution and variational em Y Ban, X Alameda-Pineda, C Evers, R Horaud IEEE Signal Processing Letters 26 (6), 798-802, 2019 | 21 | 2019 |
Accounting for room acoustics in audio-visual multi-speaker tracking Y Ban, X Li, X Alameda-Pineda, L Girin, R Horaud 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 19 | 2018 |
Aggregating Long-Term Context for Learning Laparoscopic and Robot-Assisted Surgical Workflows Y Ban, G Rosman, T Ward, D Hashimoto, T Kondo, H Iwaki, O Meireles, ... IEEE International Conference on Robotics and Automation (ICRA), 2021, 2021 | 17* | 2021 |
Artificial intelligence prediction of cholecystectomy operative course from automated identification of gallbladder inflammation TM Ward, DA Hashimoto, Y Ban, G Rosman, OR Meireles Surgical Endoscopy 36 (9), 6832-6840, 2022 | 16 | 2022 |
Transformers with dense queries for multiple-object tracking. arXiv 2021 Y Xu, Y Ban, G Delorme, C Gan, D Rus, XT Alameda-Pineda arXiv preprint arXiv:2103.15145, 0 | 16 | |
Supr-Gan: surgical prediction GAN for event anticipation in laparoscopic and robotic surgery Y Ban, G Rosman, JA Eckhoff, TM Ward, DA Hashimoto, T Kondo, H Iwaki, ... IEEE Robotics and Automation Letters 7 (2), 5741-5748, 2022 | 15* | 2022 |