Learning representations for new sound classes with continual self-supervised learning

Y Xiao, RK Das - arXiv preprint arXiv:2407.03657, 2024 - arxiv.org

This work explores class-incremental learning (CIL) for sound event detection (SED),
advancing adaptability towards real-world scenarios. CIL's success in domains like …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Sequence-level knowledge distillation for class-incremental end-to-end spoken language understanding

U Cappellazzo, M Yang, D Falavigna… - arXiv preprint arXiv …, 2023 - arxiv.org

The ability to learn new concepts sequentially is a major weakness for modern neural
networks, which hinders their use in non-stationary environments. Their propensity to fit the …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Class-Incremental Learning for Multi-Label Audio Classification

M Mulimani, A Mesaros - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org

In this paper, we propose a method for class-incremental learning of potentially overlapping
sounds for solving a sequence of multi-label audio classification tasks. We design an …

被引用次数：4 相关文章所有 3 个版本

Online detection method for variable load conditions and anomalous sound of hydro turbines using correlation analysis and PCA-adaptive-K-means

X Xu, H Wen, HJ Lin, ZX Li, C Huang - Measurement, 2024 - Elsevier

The hydro turbine is the critical component of hydropower stations. Sound analysis provides
a versatile non-contact approach for detecting anomalies or faults of hydro turbines …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Unsupervised improvement of audio-text cross-modal representations

Z Wang, C Subakan, K Subramani, J Wu… - … IEEE Workshop on …, 2023 - ieeexplore.ieee.org

Recent advances in using language models to obtain cross-modal audio-text
representations have overcome the limitations of conventional training approaches that use …

被引用次数：3 相关文章所有 4 个版本

DCCN: A dual-cross contrastive neural network for 3D point cloud representation learning

X Wu, G Shi, Z Zhao, M Li, X Gao, X Yan - Expert Systems with Applications, 2024 - Elsevier

The proliferation of depth cameras and LiDAR sensors in actual industrial environments has
fueled the pursuit of an effective and efficient 3D point cloud model that enables us to …

[PDF] arxiv.org

Self-supervised learning for infant cry analysis

A Gorin, C Subakan, S Abdoli, J Wang… - … , Speech, and Signal …, 2023 - ieeexplore.ieee.org

In this paper, we explore self-supervised learning (SSL) for analyzing a first-of-its-kind
database of cry recordings containing clinical indications of more than a thousand …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Continual Contrastive Spoken Language Understanding

U Cappellazzo, E Fini, M Yang, D Falavigna… - arXiv preprint arXiv …, 2023 - arxiv.org

Recently, neural networks have shown impressive progress across diverse fields, with
speech processing being no exception. However, recent breakthroughs in this area require …

Few-Shot Class-Incremental Audio Classification With Adaptive Mitigation of Forgetting and Overfitting

Y Li, J Li, Y Si, J Tan, Q He - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org

Few-shot Class-incremental Audio Classification (FCAC) is a task to continuously identify
incremental classes with only few training samples after training the model on base classes …

[PDF] arxiv.org

Characterizing Continual Learning Scenarios and Strategies for Audio Analysis

R Bhatt, P Kumari, D Mahapatra, AE Saddik… - arXiv preprint arXiv …, 2024 - arxiv.org

Audio analysis is useful in many application scenarios. The state-of-the-art audio analysis
approaches assume that the data distribution at training and deployment time will be the …