Utilisateur:Mot de passe:

Mémorisez-moiJ'ai oublié mon mot de passe

Historique: Two-channel mixtures of speech and real-world background noise

Comparaison de la version 83 à la version 89

@@ -Lignes: 3-6 changées en +Lignes: 3-11 @@

This task aims at evaluating source separation and denoising techniques in the context of speech enhancement by merging two datasets: the [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Source+separation+in+the+presence+of+real-world+background+noise|SiSEC 2010 noisy speech dataset] and the [http://www.dcs.shef.ac.uk/spandh/chime/PCC/datasets.html|CHiME corpus]. Both datasets consist of two-channel mixtures of one speech source and real-world background noise, so that algorithms applicable to one dataset are applicable to the other without additional effort. The source separation results obtained over the latter dataset will be analyzed in line of the speech recognition results obtained over that dataset as part of the [http://www.dcs.shef.ac.uk/spandh/chime/challenge.html|CHiME Challenge].

+
+
+ !!Results
+
+ __See the results over [http://www.irisa.fr/metiss/SiSEC11/noise/results_test.html|test] and [http://www.irisa.fr/metiss/SiSEC11/noise/results_dev.html|development] data__

@@ -Lignes: 62-65 changées en +Lignes: 67-72 @@

* -+dev_<env>_<cond>_<take>_DOA.txt+-: DOA of the speech source (see the [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Source+separation+in+the+presence+of+real-world+background+noise|SiSEC 2010 wiki] for the convention adopted to measure DOA)
Since the source DOAs were measured geometrically in the -+Su+- and -+Ca+- environments, they might contain a measurement error up to a few degrees; on the contrary, there is no such error in the -+Sq+- environment.

+
+ The mixtures dev_Ca1_Co_A_mix.wav and dev_Ca1_Co_B_mix.wav are identical (this is a mistake that will be corrected in future evaluations).

Entrants wishing to exploit the context of each sentence in the domestic environment database can also __download the corresponding [http://www.irisa.fr/metiss/SiSEC11/noise/dev_embedded.zip|5 min recordings] (86 MB)__ (same nomenclature as above).

@@ -Lignes: 81-85 changées en +Lignes: 88-92 @@

* [http://sisec2008.wiki.irisa.fr/tiki-download_file.php?fileId=1|stft_multi.m]: multichannel STFT
* [http://sisec2008.wiki.irisa.fr/tiki-download_file.php?fileId=9|istft_multi.m]: multichannel inverse STFT

- * [http://sisec2011.wiki.irisa.fr/tiki-download_userfile.php?fileId=3|example_denoising.m]: TDOA estimation by GCC-PHATmax, ML target and noise variance estimation under a diffuse noise model, and multichannel Wiener filtering

+ * [http://sisec2011.wiki.irisa.fr/tiki-download_file.php?fileId=3|example_denoising.m]: TDOA estimation by GCC-PHATmax, ML target and noise variance estimation under a diffuse noise model, and multichannel Wiener filtering

Due to the specific construction of the dataset, at least four strategies may be employed to process the domestic environment mixtures:

@@ -Lignes: 103-110 changées en +Lignes: 110-117 @@

For the domestic environment dataset, the CHiME file naming convention is also acceptable.

- Each participant should then send an email to "shoko (at) lab.ntt.co.jp", "nesta (a) fbk.eu" and "emmanuel.vincent (at) inria.fr" providing:
o contact information (name, affiliation)
o basic information about his/her algorithm, including its average running time (in seconds per test excerpt and per GHz of CPU) and a bibliographical reference if possible
o the URL of the tarball

+ Each participant should then send an email to "araki.shoko (at) lab.ntt.co.jp", "nesta (a) fbk.eu" and "emmanuel.vincent (at) inria.fr" providing:
* contact information (name, affiliation)
* basic information about his/her algorithm, including the __employed processing strategy__ among the four strategies outlined above, its average running time (in seconds per test excerpt and per GHz of CPU) and a bibliographical reference if possible
* the URL of the tarball

The submitted audio files will be made available on a website under the terms of the Licensing section below.

@@ -Lignes: 117-121 changées en +Lignes: 124-128 @@

The estimated speaker DOAs in task 1 will be evaluated in terms of absolute difference with the true DOAs.

- The estimated speech signals in task 2 will be evaluated via the energy ratio criteria defined in the [http://bass-db.gforge.inria.fr/bss_eval/|BSS_EVAL] toolbox allowing arbitrary filtering between the estimated source and the true source and via the perceptually-motivated criteria in the [http://bass-db.gforge.inria.fr/peass/PEASS-Software.html|PEASS] toolkit.

+ The estimated speech signals in task 2 will be evaluated via the energy ratio criteria defined in the [http://bass-db.gforge.inria.fr/bss_eval/|BSS_EVAL] toolbox allowing arbitrary filtering between the estimated source and the true source.

The estimated speech and noise spatial image signals in task 3 will be evaluated via the energy ratio criteria introduced for the [http://www.irisa.fr/metiss/SASSEC07/?show=criteria|Stereo Audio Source Separation Evaluation Campaign] and via the perceptually-motivated criteria in the [http://bass-db.gforge.inria.fr/peass/PEASS-Software.html|PEASS] toolkit.

@@ -Lignes: 124-133 changées en +Lignes: 131-140 @@

The above performance criteria and benchmarks are respectively implemented in

- * [http://sisec2011.wiki.irisa.fr/tiki-download_userfile.php?fileId=2|bss_eval_source_denoising.m]
* [http://sisec2011.wiki.irisa.fr/tiki-download_userfile.php?fileId=1|bss_eval_images_nosort.m]

+ * [http://sisec2011.wiki.irisa.fr/tiki-download_file.php?fileId=2|bss_eval_source_denoising.m]
* [http://sisec2011.wiki.irisa.fr/tiki-download_file.php?fileId=1|bss_eval_images_nosort.m]

* [http://bass-db.gforge.inria.fr/peass/PEASS-Software-v1.0.zip|PEASS]
* [http://sisec2008.wiki.irisa.fr/tiki-download_file.php?fileId=15|sep_ibm.m]
* [http://sisec2008.wiki.irisa.fr/tiki-download_file.php?fileId=16|Cochleagram Toolbox]

- An example use is given in [http://sisec2011.wiki.irisa.fr/tiki-download_userfile.php?fileId=3|example_denoising.m].

+ An example use is given in [http://sisec2011.wiki.irisa.fr/tiki-download_file.php?fileId=3|example_denoising.m].

Historique

Légende : v=afficher, c=comparer, d=différences

Date	Utilisateur	Version	Action
mer. 21 de déc., 2011 16:16 CET	admin	89 En cours	v
jeu. 17 de nov., 2011 14:31 CET	admin	88	v c d
mer. 16 de nov., 2011 08:55 CET	admin	87	v c d
mar. 20 de sept., 2011 04:09 CEST	admin	86	v c d
ven. 12 de août, 2011 11:54 CEST	admin	85	v c d
ven. 12 de août, 2011 11:43 CEST	admin	84	v c d
ven. 12 de août, 2011 11:41 CEST	admin	83	v c d
ven. 12 de août, 2011 11:33 CEST	admin	82	v c d
ven. 12 de août, 2011 11:32 CEST	admin	81	v c d
ven. 12 de août, 2011 11:30 CEST	admin	80	v c d
ven. 12 de août, 2011 08:27 CEST	admin	79	v c d
jeu. 11 de août, 2011 15:17 CEST	admin	78	v c d

Historique

Sidebar

Connexion

Connexion comme…

Utilisateur:

Mot de passe:

CapsLock is on.

Mémorisez-moi (pour 20 heures)

J'ai oublié mon mot de passe

Sidebar

Rechercher avec Google

sisec2011.wiki.irisa.fr
WWW

Historique: Two-channel mixtures of speech and real-world background noise

Comparaison de la version 83 à la version 89

Historique

Sidebar

Connexion

Menu

Sidebar

Rechercher avec Google