Historique: Underdetermined speech and music mixtures

Comparaison de la version 9 à la version 20


@@ -Lignes: 3-6 changées en +Lignes: 3-9 @@
We propose to repeat the [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Underdetermined-+speech+and+music+mixtures|underdetermined-speech and music mixtures] task in SiSEC2010 with fresh test data.
+ !! Results
+ *Results for development sets: [http://www.irisa.fr/metiss/SiSEC11/underdetermined/underdetermined_dev1_all.html|dev1], [http://www.irisa.fr/metiss/SiSEC11/underdetermined/underdetermined_dev2_all.html|dev2], [http://www.irisa.fr/metiss/SiSEC11/underdetermined/underdetermined_dev3_all.html|dev3]
+ *Results for test sets: [http://www.irisa.fr/metiss/SiSEC11/underdetermined/underdetermined_test_all.html|test], [http://www.irisa.fr/metiss/SiSEC11/underdetermined/underdetermined_test2_all.html|test2], [http://www.irisa.fr/metiss/SiSEC11/underdetermined/underdetermined_test3_all.html|test3]

!! Test data

@@ -Lignes: 9-13 changées en +Lignes: 12-16 @@
__Download [http://www.irisa.fr/metiss/SiSEC10/underdetermined/test.zip|test.zip] (22 MB)__ (former test data of [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008].)
__Download [http://www.irisa.fr/metiss/SiSEC10/underdetermined/test2.zip|test2.zip] (16 MB)__ (former test data of [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Underdetermined-+speech+and+music+mixtures|SiSEC2010].)
- __Download[https://www.irisa.fr/metiss/members/test3/download|test3.zip] (8.6MB)__(~~red:fresh~~ data for SiSEC2011. This is the 3-ch mixtures of 4 speech sources.)
+ __Download [http://www.irisa.fr/metiss/SiSEC11/underdetermined/test3.zip|test3.zip] (8.6MB)__(~~red:fresh~~ data for SiSEC2011. This is the 3-ch mixtures of 4 speech sources.)

!!!test.zip

@@ -Lignes: 59-63 changées en +Lignes: 62-66 @@
* __simulated recordings__ (static sources filtered by impulse responses recorded in a real room situation with loudspeakers and omnidirectional microphones)
- The room dimension for simulated recordings was 4.45 x 3.55 x 2.5 m, and the distances between the sources and the center of the microphone pair was 1.0 m. The reverberation time for simulated recordings was set to either 130 ms or 380 ms and the distance between the two microphones to either 5 cm or 50 cm. Therefore, 5 mixing conditions are considered, together with instantaneous mixtures.
+ The room dimension for simulated recordings was 4.45 x 3.55 x 2.5 m, and the distances between the sources and the center of the microphone array (linear array) was 1.0 m. The reverberation time for simulated recordings was set to either 130 ms or 380 ms and the distance between the two microphones to either 5 cm or 50 cm. Therefore, 5 mixing conditions are considered, together with instantaneous mixtures.

For each mixing condition, 2 mixture signals have been generated from different sets of source signals placed at different spatial positions:

@@ -Lignes: 75-79 changées en +Lignes: 78-82 @@
__Download [http://www.irisa.fr/metiss/SiSEC10/underdetermined/dev2.zip|dev2.zip] (47 MB)__
(Both are the former development data of [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008] and [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Underdetermined-+speech+and+music+mixtures|SiSEC2010])
-
+ __Download [http://www.irisa.fr/metiss/SiSEC11/underdetermined/dev3.zip|dev3.zip] (47 MB)__ (~~red:Fresh~~ development data for 3-ch mixtures.)

The data consist of Matlab MAT-files and WAV audio files, that can be imported in Matlab using the commands load and wavread respectively. These files are named as follows:

@@ -Lignes: 87-92 changées en +Lignes: 90-107 @@

where <srcset> is a shortcut for the set of source signals, <mixtype> for a shortcut for the mixture type, <reverb> the reverberation time, <spacing> the microphone spacing and <j> the source index.
+

All mixture signals and source image signals have 10s duration. Music source signals have 11s duration to avoid border effects within convolutive mixtures. The last 10s are then selected once the mixing system has been applied.
+
+ __Note about dev1 and dev2__
+ The development data __dev1__ and __dev2__ have the same setup as that of __test1__.
+ The development set corresponding to __test2__ is not provided.
+
+ __Note about dev3__
+ The development data __dev3__ consists only of WAV audio files,
+ *dev3_<srcset>_<mixtype>_<reverb>_src_<j>.wav: mono source signal
+ *dev3_<srcset>_<mixtype>_<reverb>_<spacing>_sim_<j>.wav: stereo contribution of a source signal to the two mixture channels
+ *dev3_<srcset>_<mixtype>_<reverb>_<spacing>_mix.wav: stereo mixture signal
+

__Licensing issue: __ These files are made available under the terms of the Creative Commons [http://creativecommons.org/licenses/by-nc-sa/2.0/|Attribution-NonCommercial-ShareAlike 2.0] license. The authors are Another Dreamer and Alex Q for music source signals and Hiroshi Sawada, Shoko Araki and Emmanuel Vincent for mixture signals.

@@ -Lignes: 103-108 changées en +Lignes: 118-124 @@

Each participant is asked to submit the results of his/her algorithm for tasks 2 and/or 3
- * over all or part of "test" and "test2".
* and over all or part of "dev2", if his/her algorithm was not previously submitted to the [http://www.irisa.fr/metiss/SASSEC07/|Stereo Audio Source Separation Evaluation Campaign] nor [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008], so as to assess improvements compared to that campaign.
+ * over all or part of "test", "test2" and "test3".
* over all or part of "dev2", if his/her algorithm was not previously submitted to the [http://www.irisa.fr/metiss/SASSEC07/|Stereo Audio Source Separation Evaluation Campaign] nor [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008], so as to assess improvements compared to that campaign.
*and all or part of "dev3".

The results for task 1 may also be submitted.

@@ -Lignes: 110-115 changées en +Lignes: 126-144 @@
In addition, each participant is asked to provide basic information about his/her algorithm (e.g. a bibliographical reference) and to declare its average running time, expressed in seconds per test excerpt and per GHz of CPU.
- &quot;How to submit&quot; will be announced by July, 2011.
+ !!!How to submit
Each participant should make his results available online in the form of a tarball called
&lt;YourName>_<dataset>.zip.

The included files must be named as f
ollows:
* &l
t;dataset>__<srcset>_<mixtype>_<reverb>_src_<j>.wav: Estimated source <j> for task 2. Mono WAV file sampled at 16 kHz.
* <dataset>__<srcset>_<mixtype>_<reverb>_sim_<j>.
wav: Estimated spatial image of source <j> for task 3. Stereo (3ch for test3/dev3) WAV file sampled at 16 kHz. <br />* task1.txt: Estimated source numbers for task 1. The file's 1st column is the mixture label (dev1_&lt;srcset>_<mixtype>_<reverb>_<spacing>_mix) and 2nd column is the estimated number of sources.

where <dataset&g
t; is one of the test/test2/test3/dev2/dev3, &lt;srcset> is a shortcut for the set of source signals, <mixtype> for a shortcut for the mixture type, <reverb> the reverberation time and <spacing> the microphone spacing.

Each participant should th
en send an email to "araki.shoko (at) lab.ntt.co.jp" providing:
* contact information (name, affiliation)
* basic information abo
ut his/her algorithm, including its average running time (in seconds per test excerpt and per GHz of CPU) and a bibliographical reference if possible
* the URL of the tarball(s)
Note that the submitted audio files will be made available on a website under the terms of the Creative Commons [http://creativecommons.org/licenses/by-nc-sa/2.0/|Attribution-NonCommercial-ShareAlike 2.0] license.

Historique

Légende : v=afficher, c=comparer, d=différences
Date UtilisateurNote à propos de cette modification Version Action
lun. 12 de déc., 2011 06:11 CET admin   20
En cours
 v
lun. 12 de déc., 2011 06:11 CET admin   19  v  c  d  
jeu. 20 de oct., 2011 02:54 CEST admin   18  v  c  d  
ven. 09 de sept., 2011 07:48 CEST admin   17  v  c  d  
ven. 01 de juill., 2011 03:51 CEST admin   16  v  c  d  
jeu. 30 de juin, 2011 07:15 CEST admin   15  v  c  d  
jeu. 30 de juin, 2011 07:14 CEST admin   14  v  c  d  
jeu. 30 de juin, 2011 07:13 CEST admin   13  v  c  d  
jeu. 30 de juin, 2011 04:46 CEST admin   12  v  c  d  
jeu. 30 de juin, 2011 04:46 CEST admin   11  v  c  d  
jeu. 30 de juin, 2011 04:44 CEST admin   10  v  c  d  
jeu. 30 de juin, 2011 04:34 CEST admin   9  v  c  d  

Menu

Rechercher avec Google

 
sisec2011.wiki.irisa.fr
WWW