Audio Source Separation Using Variational Autoencoders and Weak Class Supervision

Source Code

Source code is available at GitHub.

Examples of the Separation Results

Mixture

3
Soft Mask 1
Original Source 1

Separated Source 1

SDR10.00
SIR12.97
SAR13.25
9
Soft Mask 2
Original Source 2

Separated Source 2

SDR9.37
SIR11.24
SAR14.26
Mixture

4
Soft Mask 1
Original Source 1

Separated Source 1

SDR10.54
SIR14.83
SAR12.70
9
Soft Mask 2
Original Source 2

Separated Source 2

SDR9.14
SIR11.08
SAR13.90
Mixture

2
Soft Mask 1
Original Source 1

Separated Source 1

SDR13.65
SIR15.80
SAR17.84
6
Soft Mask 2
Original Source 2

Separated Source 2

SDR14.94
SIR18.17
SAR17.82
Mixture

1
Soft Mask 1
Original Source 1

Separated Source 1

SDR14.98
SIR20.44
SAR16.47
5
Soft Mask 2
Original Source 2

Separated Source 2

SDR13.51
SIR16.13
SAR17.05
Mixture

3
Soft Mask 1
Original Source 1

Separated Source 1

SDR18.08
SIR20.66
SAR21.59
5
Soft Mask 2
Original Source 2

Separated Source 2

SDR19.45
SIR23.44
SAR21.68
Mixture

0
Soft Mask 1
Original Source 1

Separated Source 1

SDR4.97
SIR5.87
SAR13.22
5
Soft Mask 2
Original Source 2

Separated Source 2

SDR7.46
SIR12.84
SAR9.17
Mixture

8
Soft Mask 1
Original Source 1

Separated Source 1

SDR8.87
SIR9.98
SAR15.74
9
Soft Mask 2
Original Source 2

Separated Source 2

SDR12.68
SIR17.09
SAR14.72
Mixture

1
Soft Mask 1
Original Source 1

Separated Source 1

SDR13.69
SIR36.13
SAR13.71
6
Soft Mask 2
Original Source 2

Separated Source 2

SDR4.60
SIR4.75
SAR20.55
Mixture

0
Soft Mask 1
Original Source 1

Separated Source 1

SDR12.49
SIR14.14
SAR17.68
1
Soft Mask 2
Original Source 2

Separated Source 2

SDR15.49
SIR20.63
SAR17.11
Mixture

2
Soft Mask 1
Original Source 1

Separated Source 1

SDR2.63
SIR3.10
SAR14.25
8
Soft Mask 2
Original Source 2

Separated Source 2

SDR8.93
SIR10.80
SAR13.84