ISCApad Archive » 2021 » ISCApad #273 » Events » Other Events » (2021-08-30) Interspeech 2021 Special Session on the Multilingual and code-switching ASR challenges for low resource Indian languages, Brno, Czechia. |
ISCApad #273 |
Thursday, March 11, 2021 by Chris Wellekens |
Announcing the Multilingual and code-switching ASR challenges for low resource Indian languages - Interspeech 2021 Special Session
Recently, there have been increasing interests in multilingual automatic speech recognition (ASR) where a speech recognition system is built to cater to multiple low resource languages by taking advantage of low amount of labeled corpora in multiple languages. On the other hand, with multilingualism becoming common in today?s world, there has been increasing interest in code-switching ASR as well. In code-switching, multiple languages are freely interchanged within a single sentence or between sentences. The success of low-resource multilingual and code-switching ASR often depends on the variety of languages in terms of their acoustics, linguistic characteristics as well as amount of data available and how these are carefully considered in building the ASR system. In this challenge, we would like to focus on building multilingual and code-switching ASR systems through two different sub-tasks related to a total of seven Indian languages with constraints on the data available for acoustic modeling and language modeling.
Sub-task1
This sub-task involves building a multilingual ASR system in six languages, namely, Hindi, Marathi, Odia, Telugu, Tamil, and Gujarati. The blind test set will comprise recordings from a subset (or all) of these six languages
Sub-task2
This sub-task involves building a code-switching ASR system separately for Hindi-English and Bengali-English code-switched pairs. The blind test set will comprise recordings from these two code-switched language pairs.
Submissions to this special session should show results on one or both of the above mentioned tasks. Submissions on any topic related to building multilingual code-switching ASR are welcome. This includes (but is not limited to):
Organizers:
Kalika Bali (Mircosoft Research)
Prasanta Kumar Ghosh (IISc Bangalore)
Raoul Nanavati (Navana Tech.)
Jai Nanavati (Navana Tech.)
Sirnivasa Raghavan (Navana Tech.)
Vivek Seshadri (Microsoft Research)
Preethi Jyothi (IIT Bombay)
Sunita Sarawagi (IIT Bombay)
Samarth Bharadwaj (IBM Research)
Ashish Mittal (IIT Bombay & IBM Research)
Shreya Khare (IBM Research)
For more details and participation, please visit
https://navana-tech.github.io/IS21SS-indicASRchallenge/
Timeline
February 2, 2021 - Registration for the challenge opens
February 10, 2021 - Release training & test data
February 15, 2021 - Release baseline recipe
February 28, 2021 - Release blind test audio to participants
March 2, 2021 - Test trial upload begins
March 26, 2021 - Abstract submission deadline
March 26, 2021 - Final test trial upload deadline
April 2, 2021 - Interspeech final paper upload deadline
June 15, 2021 - Camera ready paper deadline
For any questions, write to
is21ss.indicasrchallenge@gmail.com
|
Back | Top |