Data leakage in cross-modal retrieval training: A case study
The recent progress in text-based audio retrieval was largely propelled by the release of
suitable datasets. Since the manual creation of such datasets is a laborious task, obtaining …
suitable datasets. Since the manual creation of such datasets is a laborious task, obtaining …
Enhancing Audio Retrieval with Attention-based Encoder for Audio Feature Representation
Pretrained audio neural networks (PANNs) has been successful in a range of machine
audition applications. But its limitation in recognising relationships between acoustic scenes …
audition applications. But its limitation in recognising relationships between acoustic scenes …