ISCA - International Speech
Communication Association


ISCApad Archive  »  2024  »  ISCApad #316  »  Resources  »  Database  »  ELRA - ALLIES Corpus for the evaluation of Broadcast News

ISCApad #316

Thursday, October 10, 2024 by Chris Wellekens

5-2-2 ELRA - ALLIES Corpus for the evaluation of Broadcast News
  

Press Release - Immediate

Paris, France, October 2, 2024


ELDA releases the ALLIES Corpus for the evaluation of Broadcast News across time diarization systems

ELDA is pleased to announce the release of the ALLIES Corpus, as a result from the collaboration between the University of Le Mans (France) and ELDA. Overall, the ALLIES Corpus contains about 900 hours of news broadcast, including orthographic transcriptions, speaker annotations and segmentation. This corpus is based on the material that was used for the ESTER, REPERE and ETAPE evaluation packages (see ELRA Catalogue: http://catalogue.elra.info for respective packages). 

The ALLIES corpus was built as an extension of the previous produced corpora. It contains corrected annotations from the previous evaluation materials as well as new audio data with corresponding transcriptions. Corrections include corrected names of speakers and re-segmentation.


The dataset can be found in the ELRA Catalogue of Language Resources under the following links and references: ALLIES CorpusISLRN: 397-116-696-859-2

For more information and/or questions, please write to contact@elda.org.


About ALLIES project

The ALLIES Corpus was produced within the European CHIST-Era project ALLIES (Autonomous Lifelong learnIng intelLigent Systems). The ALLIES project enabled to carry out a campaign for the evaluation of Broadcast News across time diarization systems using French data. This project is an extension of the previous ESTER, REPERE and ETAPE evaluation campaigns that were carried out for the French language in this field. The University of Le Mans (France) and ELDA collaborated on the production of this new corpus.

To find out more about ALLIES project, please visit: https://www.chistera.eu/projects/allies


About ELDA


The Evaluation and Language resources Distribution Agency (ELDA) identifies, collects, markets, and distributes language resources, along with the dissemination of general information in the field of Human Language Technologies (HLT). ELDA has considerable knowledge and skills in HLT applications. ELDA is part of major French, European and international projects in the field of HLT.

To find out more about ELDA, please visit our web site: http://www.elda.org/


 




 

 

Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA