ISCA - International Speech
Communication Association


ISCApad Archive  »  2021  »  ISCApad #275  »  Events  »  Other Events  »  (2021-08-30) Interspeech 2021 Special Session on the Multilingual and code-switching ASR challenges for low resource Indian languages, Brno, Czechia.

ISCApad #275

Thursday, May 13, 2021 by Chris Wellekens

3-3-21 (2021-08-30) Interspeech 2021 Special Session on the Multilingual and code-switching ASR challenges for low resource Indian languages, Brno, Czechia.
  
Announcing the Multilingual and code-switching ASR challenges for low resource Indian languages - Interspeech 2021 Special Session
 
Recently, there have been increasing interests in multilingual automatic speech recognition (ASR) where a speech recognition system is built to cater to multiple low resource languages by taking advantage of low amount of labeled corpora in multiple languages. On the other hand, with multilingualism becoming common in today?s world, there has been increasing interest in code-switching ASR as well. In code-switching, multiple languages are freely interchanged within a single sentence or between sentences. The success of low-resource multilingual and code-switching ASR often depends on the variety of languages in terms of their acoustics, linguistic characteristics as well as amount of data available and how these are carefully considered in building the ASR system. In this challenge, we would like to focus on building multilingual and code-switching ASR systems through two different sub-tasks related to a total of seven Indian languages with constraints on the data available for acoustic modeling and language modeling.
 
Sub-task1
This sub-task involves building a multilingual ASR system in six languages, namely, Hindi, Marathi, Odia, Telugu, Tamil, and Gujarati. The blind test set will comprise recordings from a subset (or all) of these six languages
 
Sub-task2
This sub-task involves building a code-switching ASR system separately for Hindi-English and Bengali-English code-switched pairs. The blind test set will comprise recordings from these two code-switched language pairs.
 
Submissions to this special session should show results on one or both of the above mentioned tasks. Submissions on any topic related to building multilingual code-switching ASR are welcome. This includes (but is not limited to):
 
  • Acoustic modeling for multilingual ASR models
  • Language modeling for multilingual ASR models
  • Multilingual ASR model for code-switching
  • Language modeling for code-switching
  • Linguistically informed models for code-switching
 
 
Organizers:
Kalika Bali (Mircosoft Research)
Prasanta Kumar Ghosh (IISc Bangalore)
Raoul Nanavati (Navana Tech.)
Jai Nanavati (Navana Tech.)
Sirnivasa Raghavan (Navana Tech.)
Vivek Seshadri (Microsoft Research)
Preethi Jyothi (IIT Bombay)
Sunita Sarawagi (IIT Bombay)
Samarth Bharadwaj (IBM Research)
Ashish Mittal (IIT Bombay & IBM Research)
Shreya Khare (IBM Research)
 
 
For more details and participation, please visit
https://navana-tech.github.io/IS21SS-indicASRchallenge/
 
Timeline
February 2, 2021 - Registration for the challenge opens
February 10, 2021 - Release training & test data
February 15, 2021 - Release baseline recipe
February 28, 2021 - Release blind test audio to participants
March 2, 2021 - Test trial upload begins
March 26, 2021 - Abstract submission deadline
March 26, 2021 - Final test trial upload deadline
April 2, 2021 - Interspeech final paper upload deadline
June 15, 2021 - Camera ready paper deadline
 
 
For any questions, write to
is21ss.indicasrchallenge@gmail.com
 

Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA