Sequential vision to language as story: A storytelling dataset and benchmarking

ZM Malakan, S Anwar, GM Hassan, A Mian - IEEE Access, 2023 - ieeexplore.ieee.org
Storytelling is a remarkable human skill that plays a significant role in learning and
experiencing everyday life. Developing narratives is central to human mental health …

Vision transformer based model for describing a set of images as a story

ZM Malakan, GM Hassan, A Mian - Australasian Joint Conference on …, 2022 - Springer
Abstract Visual Story-Telling is the process of forming a multi sentence story from a set of
images. Appropriately including visual variation and contextual information captured inside …

Sequential Image Storytelling Model Based on Transformer Attention Pooling

ZM Malakan, GM Hassan, A Mian - 2023 38th International …, 2023 - ieeexplore.ieee.org
The Visual Storytelling Task (VST) extends beyond describing a single image, such as
image captioning, to sequential image descriptions in the form of a coherent story. However …

Automatic Generation of a Coherent Story from a Set of Images

ZM Malakan - 2024 - research-repository.uwa.edu.au
This dissertation explores vision and language (V&L) algorithms. While (V&L) succeeds in
image and video captioning tasks, the dynamic Visual Storytelling Task (VST) remains …