Multi30K: Multilingual English-German image descriptions D Elliott, S Frank, K Sima'an, L Specia Proceedings of the 5th Workshop on Vision and Language, 2016 | 625 | 2016 |
Automatic description generation from images: A survey of models, datasets, and evaluation measures R Bernardi, R Cakici, D Elliott, A Erdem, E Erdem, N Ikizler-Cinbis, ... Journal of Artificial Intelligence Research 55, 409-442, 2016 | 450 | 2016 |
Image Description using Visual Dependency Representations D Elliott, F Keller Conference on Empirical Methods in Natural Language Processing, 1292-1302, 2013 | 362 | 2013 |
How2: a large-scale dataset for multimodal language understanding R Sanabria, O Caglayan, S Palaskar, D Elliott, L Barrault, L Specia, ... Workshop on Visually Grounded Interaction and Language, 2018 | 266 | 2018 |
A shared task on multimodal machine translation and crosslingual image description L Specia, S Frank, K Sima’An, D Elliott Proceedings of the First Conference on Machine Translation: Volume 2, Shared …, 2016 | 255 | 2016 |
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description D Elliott, S Frank, L Barrault, F Bougares, L Specia Proceedings of the Second Conference on Machine Translation, 215-233, 2017 | 240 | 2017 |
Findings of the third shared task on multimodal machine translation L Barrault, F Bougares, L Specia, C Lala, D Elliott, S Frank Proceedings of the Third Conference on Machine Translation, 308-327, 2018 | 167 | 2018 |
Imagination improves multimodal translation D Elliott, A Kádár Proceedings of the Eighth International Joint Conference on Natural Language …, 2017 | 159 | 2017 |
Comparing Automatic Evaluation Measures for Image Description D Elliott, F Keller Proceedings of the 52nd Annual Meeting of the Association for Computational …, 2014 | 154 | 2014 |
Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs E Bugliarello, R Cotterell, N Okazaki, D Elliott Transactions of the Association of Computational Linguistics, 2021 | 147 | 2021 |
Visually Grounded Reasoning across Languages and Cultures F Liu, E Bugliarello, EM Ponti, S Reddy, N Collier, D Elliott Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 108 | 2021 |
Multi-language Image Description with Neural Sequence Models D Elliott, S Frank, E Hasler arXiv preprint arXiv:1510.04709, 2015 | 106 | 2015 |
Adversarial Evaluation of Multimodal Machine Translation D Elliott Proceedings of the 2018 Conference on Empirical Methods in Natural Language …, 2018 | 98 | 2018 |
Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers S Frank, E Bugliarello, D Elliott Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 80 | 2021 |
Multimodal machine translation through visuals and speech U Sulubacak, O Caglayan, SA Grönroos, A Rouhe, D Elliott, L Specia, ... Machine Translation 34, 97-147, 2020 | 75 | 2020 |
Describing Images using Inferred Visual Dependency Representations. D Elliott, A de Vries Proceedings of the 53rd Annual Meeting of the Association for Computational …, 2015 | 64 | 2015 |
Revisiting Transformer-based Models for Long Document Classification X Dai, I Chalkidis, S Darkner, D Elliott Findings of Empirical Methods in Natural Language Processing, 2022 | 55 | 2022 |
Adversarial removal of demographic attributes revisited M Barrett, Y Kementchedjhieva, Y Elazar, D Elliott, A Søgaard Proceedings of the 2019 Conference on Empirical Methods in Natural Language …, 2019 | 53 | 2019 |
Measuring the diversity of automatic image descriptions E Van Miltenburg, D Elliott, P Vossen Proceedings of the 27th International Conference on Computational …, 2018 | 50 | 2018 |
Compositional Generalization in Image Captioning M Nikolaus, M Abdou, M Lamm, R Aralikatte, D Elliott Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019 | 48 | 2019 |