A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining
Language-queried audio source separation (LASS) aims to separate an audio source
guided by a text query, with the signal-to-distortion ratio (SDR)-based metrics being …
guided by a text query, with the signal-to-distortion ratio (SDR)-based metrics being …
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching
Language-queried audio source separation (LASS) focuses on separating sounds using
textual descriptions of the desired sources. Current methods mainly use discriminative …
textual descriptions of the desired sources. Current methods mainly use discriminative …