UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection

Y Xiao, RK Das - arXiv preprint arXiv:2407.03657, 2024 - arxiv.org
This work explores class-incremental learning (CIL) for sound event detection (SED),
advancing adaptability towards real-world scenarios. CIL's success in domains like …

Sequence-level knowledge distillation for class-incremental end-to-end spoken language understanding

U Cappellazzo, M Yang, D Falavigna… - arXiv preprint arXiv …, 2023 - arxiv.org
The ability to learn new concepts sequentially is a major weakness for modern neural
networks, which hinders their use in non-stationary environments. Their propensity to fit the …

Class-Incremental Learning for Multi-Label Audio Classification

M Mulimani, A Mesaros - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
In this paper, we propose a method for class-incremental learning of potentially overlapping
sounds for solving a sequence of multi-label audio classification tasks. We design an …

Online detection method for variable load conditions and anomalous sound of hydro turbines using correlation analysis and PCA-adaptive-K-means

X Xu, H Wen, HJ Lin, ZX Li, C Huang - Measurement, 2024 - Elsevier
The hydro turbine is the critical component of hydropower stations. Sound analysis provides
a versatile non-contact approach for detecting anomalies or faults of hydro turbines …

Unsupervised improvement of audio-text cross-modal representations

Z Wang, C Subakan, K Subramani, J Wu… - … IEEE Workshop on …, 2023 - ieeexplore.ieee.org
Recent advances in using language models to obtain cross-modal audio-text
representations have overcome the limitations of conventional training approaches that use …

DCCN: A dual-cross contrastive neural network for 3D point cloud representation learning

X Wu, G Shi, Z Zhao, M Li, X Gao, X Yan - Expert Systems with Applications, 2024 - Elsevier
The proliferation of depth cameras and LiDAR sensors in actual industrial environments has
fueled the pursuit of an effective and efficient 3D point cloud model that enables us to …

Self-supervised learning for infant cry analysis

A Gorin, C Subakan, S Abdoli, J Wang… - … , Speech, and Signal …, 2023 - ieeexplore.ieee.org
In this paper, we explore self-supervised learning (SSL) for analyzing a first-of-its-kind
database of cry recordings containing clinical indications of more than a thousand …

Continual Contrastive Spoken Language Understanding

U Cappellazzo, E Fini, M Yang, D Falavigna… - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, neural networks have shown impressive progress across diverse fields, with
speech processing being no exception. However, recent breakthroughs in this area require …

Few-Shot Class-Incremental Audio Classification With Adaptive Mitigation of Forgetting and Overfitting

Y Li, J Li, Y Si, J Tan, Q He - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
Few-shot Class-incremental Audio Classification (FCAC) is a task to continuously identify
incremental classes with only few training samples after training the model on base classes …

Characterizing Continual Learning Scenarios and Strategies for Audio Analysis

R Bhatt, P Kumari, D Mahapatra, AE Saddik… - arXiv preprint arXiv …, 2024 - arxiv.org
Audio analysis is useful in many application scenarios. The state-of-the-art audio analysis
approaches assume that the data distribution at training and deployment time will be the …