A review on video summarization techniques

P Meena, H Kumar, SK Yadav - Engineering Applications of Artificial …, 2023 - Elsevier
The exponential growth of technology has resulted in a profusion of advanced imaging
devices and eases internet accessibility, leading to an increase in the creation and use of …

Video summarization using deep neural networks: A survey

E Apostolidis, E Adamantidou, AI Metsai… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Video summarization technologies aim to create a concise and complete synopsis by
selecting the most informative parts of the video content. Several approaches have been …

The dawn of quantum natural language processing

R Di Sipio, JH Huang, SYC Chen… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
In this paper, we discuss the initial attempts at boosting understanding human language
based on deep-learning models with quantum computing. We successfully train a quantum …

Towards fine-grained citation evaluation in generated text: A comparative analysis of faithfulness metrics

W Zhang, M Aliannejadi, Y Yuan, J Pei… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) often produce unsupported or unverifiable content, known
as" hallucinations." To mitigate this, retrieval-augmented LLMs incorporate citations …

Optimizing numerical estimation and operational efficiency in the legal domain through large language models

JH Huang, CC Yang, Y Shen, AM Pacces… - Proceedings of the 33rd …, 2024 - dl.acm.org
The legal landscape encompasses a wide array of lawsuit types, presenting lawyers with
challenges in delivering timely and accurate information to clients, particularly concerning …

Expert-defined keywords improve interpretability of retinal image captioning

TW Wu, JH Huang, J Lin… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Automatic machine learning-based (ML-based) medical report generation systems for retinal
images suffer from a relative lack of interpretability. Hence, such ML-based systems are still …

Extracting keyframes of breast ultrasound video using deep reinforcement learning

R Huang, Q Ying, Z Lin, Z Zheng, L Tan, G Tang… - Medical Image …, 2022 - Elsevier
Ultrasound (US) plays a vital role in breast cancer screening, especially for women with
dense breasts. Common practice requires a sonographer to recognize key diagnostic …

A novel evaluation framework for image2text generation

JH Huang, H Zhu, Y Shen, S Rudinac… - arXiv preprint arXiv …, 2024 - arxiv.org
Evaluating the quality of automatically generated image descriptions is challenging,
requiring metrics that capture various aspects such as grammaticality, coverage …

Deepopht: medical report generation for retinal images via deep models and visual explanation

JH Huang, CHH Yang, F Liu, M Tian… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this work, we propose an AI-based method that intends to improve the conventional retinal
disease treatment procedure and help ophthalmologists increase diagnosis efficiency and …

Gpt2mvs: Generative pre-trained transformer-2 for multi-modal video summarization

JH Huang, L Murn, M Mrak, M Worring - Proceedings of the 2021 …, 2021 - dl.acm.org
Traditional video summarization methods generate fixed video representations regardless of
user interest. Therefore such methods limit users' expectations in content search and …