ISCA - International Speech
Communication Association


ISCApad Archive  »  2020  »  ISCApad #264  »  Academic and Industry Notes  »  FEARLESS STEPS Challenge Phase-2 for ISCA INTERSPEECH-2020

ISCApad #264

Wednesday, June 10, 2020 by Chris Wellekens

4-10 FEARLESS STEPS Challenge Phase-2 for ISCA INTERSPEECH-2020
  
Announcing the FEARLESS STEPS Challenge Phase-2 for ISCA INTERSPEECH-2020 (FS#2: taking the next step!)

 

 

 

The Fearless Steps Initiative by UTDallas-CRSS led to the digitization, recovery, and diarization of 19,000 hours of original analog audio data, as well as the development of algorithms to extract meaningful information from this multi-channel naturalistic data resource. As an initial step to motivate a stream-lined and collaborative effort from the speech and language community, UTDallas-CRSS is hosting a series of progressively complex tasks to promote advanced research on naturalistic ?Big Data? corpora. This began with ISCA INTERSPEECH-2019: 'The FEARLESS STEPS Challenge: Massive Naturalistic Audio (FS#1)'. This first edition of this challenge encouraged the development of core unsupervised/semi-supervised speech and language systems for single-channel data with low resource availability, serving as the ?First Step? towards extracting high-level information from such massive unlabeled corpora.
As a natural progression following the successful Inaugural Challenge FS#1, the FEARLESS STEPS Challenge Phase-#2 focuses on the development of single-channel supervised learning strategies. This FS#2 provides 80 hours of ground-truth data through Training and Development sets, with an additional 20 hours of blind-set Evaluation data. Based on feedback from the Fearless Steps participants, additional Tracks for streamlined speech recognition and speaker diarization have been included in the FS#2. The results for this Challenge will be presented at the ISCA INTERSPEECH-2020 Special Session. We encourage participants to explore any and all research tasks of interest with the Fearless Steps Corpus ? with suggested Task Domains listed below. Research participants can, however, also utilize the FS#2 corpus to explore additional problems dealing with naturalistic data, which we welcome as part of the special session.
 

 

 

 

 

TIMELINE:    Challenge Start Date (Data Release):      February 5th, 2020

                          INTERSPEECH-2020 Papers dealing with FEARLESS STEPS deadline:   March 30, 2020

 

                            

Challenge Tasks in Phase-2 (FS#2):

 

1. Speech Activity Detection                                                (SAD)


2. Speaker Identification (using Speaker Segments)        (SID)


3. Speaker Diarization:

      3a. Track 1: Diarization using system SAD                  (SD_track1)

      3b. Track 2: Diarization using reference SAD             (SD_track2)


4. Automatic Speech Recognition (ASR):

      4a. Track 1: ASR using system Diarization/SAD         (ASR_track1)

      4b. Track 2: ASR using Diarized Segments                 (ASR_track2)


 

 

 

 

Dataset Download and Registration link for the Challenge:     https://bit.ly/2qZ5tic

 

 

 

A Link for Downloading the Challenge Corpus will appear once the form is submitted. The README file in the Download folder has the Challenge Rules, Guidelines, and necessary details to get started with the data and challenges.

 

 

Researchers registering through the above link will be informed on any updates regarding the Challenge through personal emails from FearlessSteps@utdallas.edu

 

 

More details regarding the Data and Challenge will be posted on the Website.

https://fearless-steps.github.io/ChallengePhase2/


Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA