ISCA Services

ISCA - International Speech
Communication Association

ISCApad Archive » 2020 » ISCApad #260 » Events » Other Events » (2020-03-31) The Lifelong learning Speaker Diarization Challenge 2020, Valladolid, Spain

ISCApad #260

Monday, February 10, 2020 by Chris Wellekens

3-3-32 (2020-03-31) The Lifelong learning Speaker Diarization Challenge 2020, Valladolid, Spain

The Lifelong learning Speaker Diarization Challenge 2020
https://lium.univ-lemans.fr/allies-evaluation/
*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*

CHALLENGE TASKs
______________________________________________________________________________________
The ALLIES evaluation focuses on freeing speaker diarization
systems from the need of machine learning expert interventions upon two tasks:

* Diarization Across time:
automatic systems use the stream of incoming data
to update their knowledge and adapt to new data in order
to sustain performance across time

* Lifelong learning speaker diarization
systems are evaluated while processing a sequence
of incoming audio documents with or without
human-assisted learning including active learning (system initiative)
or interactive learning (user initiative)
Details are provided in the evaluation plan.

The ALLIES corpus consists of a new corpus of audio-visual
documents (news, debates, talk show.) including more
than 200 hours of TV shows from the French channel LCP.

BACKGROUND
______________________________________________________________________________________
Speech segmentation with speaker clustering, referred
to as speaker diarization, is a key pre-processing step
for several speech technologies including enriched automatic
speech recognition (ASR) or spoken document retrieval (SDR)
in very large multimedia repositories. The base accuracy of
such systems is of essential to allow applications to perform
adequately in real-world environments.
Performance of such systems usually degrades across time as
the distribution of incoming data moves away from the initial
training data (changes in accents, in recording conditions, etc).
Thus sustaining system performance across time requires frequent
interventions of machine learning experts which makes the maintenance
of such system very costly.

SCHEDULE
______________________________________________________________________________________
From now until 31st of March: Registration for participants is open
1st of March: Release of Development data
1st of March: Beat platform is open for development
1st of June: Evaluation data is released
30th of June: Final submission
End of July: Paper submission deadline
November: Iberspeech conference in Valladolid

REGISTRATION
______________________________________________________________________________________
Send an email to :

lifelong-speaker-evaluation@univ-lemans.fr

and specify
* the task(s) you're willing to participate
* the name of your team, that can be the name of your organization or any anonymous identity

More details on the evaluation plan.

EVALUATION PLAN
______________________________________________________________________________________
The evaluation plan and license agreement of the dataset can be downloaded here:

https://lium.univ-lemans.fr/allies-evaluation/

ORGANIZERS
Anthony Larcher, Le Mans Université, France
Olivier Galibert, LNE, France
Andre Anjos, IDIAP, Switzerland
Marta Ruiz Costa, UPC, Spain
Loïc Barrault, Sheffield University, UK

Back

Top

Organisation	Events	Membership	Help
> Board	> Interspeech	> Join - renew	> Sitemap
> Legal documents	> Workshops	> Membership directory	> Contact
> Logos			> FAQ
			> Privacy policy