Music Source Separation
54 papers with code • 3 benchmarks • 7 datasets
Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.
( Image credit: SigSep )
Libraries
Use these libraries to find Music Source Separation models and implementationsMost implemented papers
A cappella: Audio-visual Singing Voice Separation
The task of isolating a target singing voice in music videos has useful applications.
Unsupervised Music Source Separation Using Differentiable Parametric Source Models
Integrating domain knowledge in the form of source models into a data-driven method leads to high data efficiency: the proposed approach achieves good separation quality even when trained on less than three minutes of audio.
Hybrid Transformers for Music Source Separation
While it performs poorly when trained only on MUSDB, we show that it outperforms Hybrid Demucs (trained on the same data) by 0. 45 dB of SDR when using 800 extra training songs.
The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
We propose a formalization of the errors that can occur in the design of a training dataset for MSS systems and introduce two new datasets that simulate such errors: SDXDB23_LabelNoise and SDXDB23_Bleeding.
Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)
Music source separation (MSS) aims to extract 'vocals', 'drums', 'bass' and 'other' tracks from a piece of mixed music.
Pre-training Music Classification Models via Music Source Separation
In this paper, we study whether music source separation can be used as a pre-training strategy for music representation learning, targeted at music classification tasks.
SCNet: Sparse Compression Network for Music Source Separation
We use a higher compression ratio on subbands with less information to improve the information density and focus on modeling subbands with more information.
MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation
Deep neural networks have become an indispensable technique for audio source separation (ASS).
Improving DNN-based Music Source Separation using Phase Features
Music source separation with deep neural networks typically relies only on amplitude features.
Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic Mixtures
We study the problem of semi-supervised singing voice separation, in which the training data contains a set of samples of mixed music (singing and instrumental) and an unmatched set of instrumental music.