Professionally produced music recordings


Introduction


According to many positive requests from the community, we decided to repeat this task.

The task includes the following data:

Results


Test Data

Download test1.zip (external link) (7 MB) (Test data of SiSEC2008 (external link) )
Download test2.zip (external link) (9 MB) (Test data of SiSEC 2010 (external link))
Download test2_full_mix.zip (external link) (79 MB) (full-length recordings for SiSEC2010 data)

The data consist of stereo WAV audio files, that can be imported in Matlab using the wavread command. These files are named {test1,test2}__[<author>]-[<song>]___[<snip>]__{mix,full_mix}.wav , where <author> is the author name, <song> is the song name, and <snip> is a shortcut for snip information.

The data include the following mixtures (snips and full-length recordings):

test1
  • test1__tamy-que_pena_tanto_faz__snip__mix.wav
  • test1__bearlin-roads__snip__mix.wav
test2
  • test2__glen_philips-the_spirit_of_shackleton__snip_163_185__mix.wav
  • test2__nine_inch_nails-the_good_soldier__snip_104_125__mix.wav
  • test2__shannon_hurley-sunrise__snip_62_85__mix.wav
test2_full_mix
  • test2__glen_philips-the_spirit_of_shackleton__full_mix.wav
  • test2__nine_inch_nails-the_good_soldier__full_mix.wav
  • test2__shannon_hurley-sunrise__full_mix.wav

Development Data


Download dev1.zip (external link) (22 MB) (Development data of SiSEC 2008 (external link))
Download dev2.zip (external link) (36 MB) (Development data of SiSEC 2010 (external link))
Download dev2_full_mix.zip (external link) (75 MB) (full-length recordings for SiSEC2010 data)

The data consist of stereo WAV audio files, that can be imported in Matlab using the wavread command. These files are named {dev1,dev2}__ [ <author> ] - [ <song> ]__[ <snip> ]__ {mix,full_mix,<track>}.wav, where <author> is the author name, <song> is the song name, <snip> is a shortcut for snip information, and <track> is the separated track name (e.g., "vocals", "bass", etc.).

The data include the following mixtures (snips and full-length recordings):

dev1
  • dev1__bearlin-roads__snip_85_99__mix.wav
  • dev1__tamy-que_pena_tanto_faz__snip_6_19__mix.wav
dev2
  • dev2__another_dreamer-the_ones_we_love__snip_69_94__mix.wav
  • dev2__fort_minor-remember_the_name__snip_54_78__mix.wav
  • dev2__ultimate_nz_tour__snip_43_61__mix.wav
dev2_full_mix
  • dev2__another_dreamer-the_ones_we_love__full_mix.wav
  • dev2__fort_minor-remember_the_name__full_mix.wav
  • dev2__ultimate_nz_tour__full_mix.wav

Separated tracks files (needed for evaluation in dev1 and dev2) are in the corresponding folders named {dev1,dev2}__[<author>]-[<song>]__tracks .

License


All audio files are distributed under the terms different licenses, as listed below for each recodring:

All the former test and development data (test1 and dev1) are from MTG MASS database (external link) by M. Nxx.

All the remixes of newly proposed data (dev2 and test2) are done by Michel Desnoues from Telecom ParisTech .


Tasks


The following should be taken in to account:
  • Note that only 20 seconds snips are asked to be separated, and not full-length recordings.
  • Some track names below have the following meaning:
    • "vocals" = "a sum of any singing including main vocal, back vocals and singing in the reverb"
    • "drums" = "a sum of any drums including bass drum, hi-hat, snare etc."
    • "bass" = "bass guitar only (i.e., not bass drum)"

Test Tasks


test1__tamy-que_pena_tanto_faz__snip__mix.wav
Extract the following stereo tracks:
  • vocals
  • guitar

test1__bearlin-roads__snip__mix.wav
Extract the following stereo tracks:
  • vocals
  • bass
  • piano

test2__glen_philips-the_spirit_of_shackleton__snip_163_185__mix.wav
Extract the following stereo tracks:
  • vocals
  • drums
  • bass

test2__nine_inch_nails-the_good_soldier__snip_104_125__mix.wav
Extract the following stereo tracks:
  • vocals
  • drums

test2__shannon_hurley-sunrise__snip_62_85__mix.wav
Extract the following stereo tracks:
  • vocals
  • drums
  • bass
  • piano

Development Tasks


dev2__another_dreamer-the_ones_we_love__snip_69_94__mix.wav
Extract the following stereo tracks:
  • vocals
  • drums
  • guitar

dev2__fort_minor-remember_the_name__snip_54_78__mix.wav
Extract the following stereo tracks:
  • vocals
  • drums
  • bass
  • claps

dev2__ultimate_nz_tour__snip_43_61__mix.wav
Extract the following stereo tracks:
  • vocals
  • drums
  • bass

Submission


Participants may submit separation results for any above-mentioned tracks of any above (test and development) mixtures.

In addition, each participant is asked to provide basic information about his/her algorithm (e.g. a bibliographical reference) and to declare its average running time, expressed in seconds per test excerpt and per GHz of CPU.

Note that only 20 seconds snips are asked to be separated, and not full-length recordings.

How to submit


Each participant should make his results available online in the form of a tarball called <YourName>_<dataset>.zip.

The included files must be named as follows:
<dataset>_<author>-<song>__<snip>_<trackname>.wav
where <dataset> is one of the test/test2/dev2, <filename> is a shortcut for the set of source signals, <trackname> is the name of the extracted track.

(E.g., The estimated vocal track for the task file "test2_glen_philips-the_spirit_of_shackleton_snip_163_185_mix.wav" should be named as "test2_glen_philips-the_spirit_of_shackleton_snip_163_185_vocals.wav".)

Each participant should then send an email to "araki.shoko (at) lab.ntt.co.jp" and "nesta (a) fbk.eu" providing:

    • contact information (name, affiliation)
    • basic information about his/her algorithm, including its average running time (in seconds per test excerpt and per GHz of CPU) and a bibliographical reference if possible
    • the URL of the tarball(s)

The submitted audio files will be made available on a website under the terms of the same license as indicated in the section Licenses above. In other words, any modified version inherit exactly the same license as the original.

Evaluation criteria


The same basic evaluation criteria as for the under-determined speech and music mixtures dataset will be used first so that results are comparable. More precisely, the estimated stereo source signals will be evaluated via the criteria used for the Stereo Audio Source Separation Evaluation Campaign (external link), except that the order of the sources is fixed. These criteria distinguish spatial (or filtering) distortion, interference and artifacts.

Additional evaluation will be provided through the perceptual evaluation toolkit PEASS (external link).

Potential participants


  • M. Nxx
  • Vasileios Pantazis
  • Alexey Ozerov (alexey.ozerov (a) irisa_fr)
  • Jeanlouis Durrieu (durrieu (a) enst_fr)
  • Maximo Cobos (mcobos (a) iteam_upv_es)
  • Pablo Cancela (pcancela (a) gmail.com)
  • Antoine Liutkus (antoine.liutkus (a) telecom-paristech.fr)
  • Pierre Leveau (pierre.leveau (a) audionamix.com)
  • Jordi Janer (jordi.janer (a) upf.edu)
  • Nobutaka Ono (onono (a) nii.ac.jp)
Task proposed by Audio Committee

Menu

Rechercher avec Google

 
sisec2011.wiki.irisa.fr
WWW