ISCA - International Speech
Communication Association


ISCApad Archive  »  2015  »  ISCApad #199  »  Resources  »  Database  »  ELRA - Language Resources Catalogue - Update (2014-09)

ISCApad #199

Sunday, January 18, 2015 by Chris Wellekens

5-2-1 ELRA - Language Resources Catalogue - Update (2014-09)
  
ELRA - Language Resources Catalogue - Update
*****************************************************************
 
We are happy to announce that 1 new Speech Resource and 3 new Written Corpora are now available in our catalogue.  
 
ELRA-S0371 PortMedia French and Italian corpus
This corpus contains 700 transcribed dialogues from about 140 French speakers and 604 transcribed dialogues from about 150 Italian speakers (several dialogues per speaker). The method chosen for the corpus construction process is that of a ‘Wizard of Oz’ (WoZ) system. This consists of simulating a natural language man-machine dialogue. The scenario was built in the domain of touristic information and reservation. A manual transcription and semantic annotation of the corpus are provided with corresponding wave files.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1224&language=en
 
ELRA-W0078 NE3L named entities Arabic corpus
The Arabic corpus contains 103,363 words coming from articles extracted from “Le Monde Diplomatique” newspaper, and published in 2004. 2 named entity categories were taken into account: Time and Amount.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1226&language=en
 
ELRA-W0079 NE3L named entities Chinese corpus
The Chinese corpus contains 79,302 words coming from articles extracted from “Le Monde Diplomatique” newspaper, and published in 2001. 3 named entity categories were taken into account: Person, Place and Organisation.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1227&language=en
 
ELRA-W0080 NE3L named entities Russian corpus
The Russian corpus contains 75,784 words coming from articles extracted from “Izvestia” newspaper, and published in 1995. 2 named entity categories were taken into account: Time and Amount.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1228&language=en

For more information on the catalogue, please contact Valérie Mapelli mailto:mapelli@elda.org
 

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html
 
 


Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA