Confidence Measures in Encoder-Decoder Models for Speech Recognition. A Woodward, C Bonnín, I Masuda, D Varas, E Bou-Balust, JC Riveiro INTERSPEECH, 611-615, 2020 | 19 | 2020 |
Towards automatic generation of question answer pairs from images IM Mora, SP de la Puente, XG Nieto Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016 | 18 | 2016 |
Vits: video tagging system from massive web multimedia collections D Fernández, D Varas, J Espadaler, I Masuda, J Ferreira, A Woodward, ... Proceedings of the IEEE International Conference on Computer Vision …, 2017 | 17 | 2017 |
Open-ended visual question-answering I Masuda, SP de la Puente, X Giro-i-Nieto arXiv preprint arXiv:1610.02692, 2016 | 11* | 2016 |
Smooth proxy-anchor loss for noisy metric learning C Roig, D Varas, I Masuda, JC Riveiro, E Bou-Balust arXiv preprint arXiv:2006.05142, 2020 | 6 | 2020 |
Generalized local attention pooling for deep metric learning C Roig, D Varas, I Masuda, JC Riveiro, E Bou-Balust 2020 25th International Conference on Pattern Recognition (ICPR), 9951-9958, 2021 | 4 | 2021 |
Multi-modal pyramid feature combination for human action recognition C Roig, M Sarmiento, D Varas, I Masuda, JC Riveiro, E Bou-Balust 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW …, 2019 | 4 | 2019 |
Unsupervised multi-label dataset generation from web data C Roig, D Varas, I Masuda, JC Riveiro, E Bou-Balust arXiv preprint arXiv:2005.05623, 2020 | 2 | 2020 |
What is going on in the world? A display platform for media understanding D Fernandez, J Espadaler, D Varas, I Masuda, A Colom, D Rodriguez, ... 2018 IEEE Conference on Multimedia Information Processing and Retrieval …, 2018 | | 2018 |
Visual Question Answering 2.0 F Roldán Sánchez Universitat Politècnica de Catalunya, 2017 | | 2017 |
Unsupervised Large-Scale World Locations Dataset C Roig, D Varas, I Masuda, M Sarmiento, G Floriach, J Espadaler, ... | | |