Music Source Separation

54 papers with code • 3 benchmarks • 7 datasets

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

( Image credit: SigSep )

Benchmarks

Add a Result

These leaderboards are used to track progress in Music Source Separation

Dataset	Best Model	Compare
MUSDB18	Sparse HT Demucs (fine tuned)	See all
MUSDB18-HQ	BS-RoFormer (L=12, OA)	See all
Slakh2100	LQ-VAE + Scalable Transformer	See all

Libraries

Use these libraries to find Music Source Separation models and implementations

FaceOnLive/Spleeter-Android-iOS

4 papers

190

Datasets

Most implemented papers

Most implemented Social Latest No code

A cappella: Audio-visual Singing Voice Separation

JuanFMontesinos/Acappella-YNet • • 20 Apr 2021

The task of isolating a target singing voice in music videos has useful applications.

Paper
Code

Unsupervised Music Source Separation Using Differentiable Parametric Source Models

schufo/umss • • 24 Jan 2022

Integrating domain knowledge in the form of source models into a data-driven method leads to high data efficiency: the proposed approach achieves good separation quality even when trained on less than three minutes of audio.

Paper
Code

Hybrid Transformers for Music Source Separation

facebookresearch/demucs • • 15 Nov 2022

While it performs poorly when trained only on MUSDB, we show that it outperforms Hybrid Demucs (trained on the same data) by 0. 45 dB of SDR when using 800 extra training songs.

Paper
Code

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track

zfturbo/mvsep-mdx23-music-separation-model • • 14 Aug 2023

We propose a formalization of the errors that can occur in the design of a training dataset for MSS systems and introduce two new datasets that simulate such errors: SDXDB23_LabelNoise and SDXDB23_Bleeding.

Paper
Code

Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)

junyuchen-cjy/dttnet-pytorch • • 15 Sep 2023

Music source separation (MSS) aims to extract 'vocals', 'drums', 'bass' and 'other' tracks from a piece of mixed music.

Paper
Code

Pre-training Music Classification Models via Music Source Separation

cgaroufis/msspt • • 24 Oct 2023

In this paper, we study whether music source separation can be used as a pre-training strategy for music representation learning, targeted at music classification tasks.

Paper
Code

SCNet: Sparse Compression Network for Music Source Separation

starrytong/SCNet • • 24 Jan 2024

We use a higher compression ratio on subbands with less information to improve the information density and focus on modeling subbands with more information.

Paper
Code

MMDenseLSTM: An efficient combination of convolutional and recurrent neural networks for audio source separation

tsurumeso/vocal-remover • • 7 May 2018

Deep neural networks have become an indispensable technique for audio source separation (ASS).

Paper
Code

Improving DNN-based Music Source Separation using Phase Features

aRI0U/music-source-separation • • 7 Jul 2018

Music source separation with deep neural networks typically relies only on amplitude features.

Paper
Code

Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic Mixtures

sagiebenaim/Singing • • 14 Dec 2018

We study the problem of semi-supervised singing voice separation, in which the training data contains a set of samples of mixed music (singing and instrumental) and an unmatched set of instrumental music.

Paper
Code

Music Source Separation

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result