What's in a Caption? Dataset-Specific Linguistic Diversity and Its Effect on Visual Description...

L Xu, Q Tang, J Lv, B Zheng, X Zeng, W Li - Neurocomputing, 2023 - Elsevier

Image captioning, also called report generation in medical field, aims to describe visual
content of images in human language, which requires to model semantic relationship …

被引用次数：34 相关文章所有 2 个版本

[PDF] arxiv.org

Ic3: Image captioning by committee consensus

DM Chan, A Myers, S Vijayanarasimhan… - arXiv preprint arXiv …, 2023 - arxiv.org

If you ask a human to describe an image, they might do so in a thousand different ways.
Traditionally, image captioning models are trained to generate a single" best"(most like a …

被引用次数：15 相关文章所有 5 个版本

[PDF] arxiv.org

Distribution aware metrics for conditional natural language generation

DM Chan, Y Ni, DA Ross, S Vijayanarasimhan… - arXiv preprint arXiv …, 2022 - arxiv.org

Traditional automated metrics for evaluating conditional natural language generation use
pairwise comparisons between a single generated text and the best-matching gold-standard …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

[引用][C] 深度学习图像描述方法分析与展望

赵永强，金芝，张峰，赵海燕，陶政为，豆乘风，徐新海… - 中国图象图形学报, 2023

被引用次数：2 相关文章所有 2 个版本