ISCA - International Speech
Communication Association


ISCApad Archive  »  2021  »  ISCApad #277  »  Resources  »  Database  »  ELRA - Language Resources Catalogue - Update (June 2021)

ISCApad #277

Saturday, July 10, 2021 by Chris Wellekens

5-2-2 ELRA - Language Resources Catalogue - Update (June 2021)
  

We are happy to announce that 6 new written corpora and 8 new bilingual dictionaries are now available in our catalogue.

ELRA-W0310 Monolingual Vietnamese Annotated Corpus
ISLRN: 004-081-406-421-7
The Monolingual Vietnamese Annotated Corpus consists of 100,000 sentences, manually annotated with word boundaries, POS, named entities, with an average length of 20 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-W0310/

ELRA-W0311 English-Vietnamese Parallel Corpus
ISLRN: 893-470-491-825-6
The English-Vietnamese Parallel Corpus consists of 1,000,000 sentence pairs, with an average length of 20 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-W0311/


ELRA-W0312 Chinese-Vietnamese Parallel Corpus
ISLRN: 128-772-037-486-0
The Chinese-Vietnamese Parallel Corpus consists of 200,000 sentence pairs, with an average length of 15 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-W0312/


ELRA-W0313 Korean-Vietnamese Parallel Corpus
ISLRN: 365-128-449-700-7
The Korean-Vietnamese Parallel Corpus consists of 200,000 sentence pairs, with an average length of 15 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-W0313/

ELRA-W0314 English-Chinese-Vietnamese Trilingual Parallel Corpus
ISLRN: 637-630-726-817-9
The English-Chinese-Vietnamese Trilingual Parallel Corpus consists of 20,046 trilingual sets of sentence pairs. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-W0314/


ELRA-W0315 Persian Ezafe Construction Dataset
ISLRN: 663-014-610-121-2
This database includes gold Ezafe tags in almost 30 thousand Persian sentences. The sentences were manually annotated by six annotators who where all native Persian speakers and linguists.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-W0315/

ELRA-M0078 English-Vietnamese Dictionary
ISLRN: 853-782-057-600-0
The English-Vietnamese Dictionary consists of 125,000 entries containing the following information: phonetics (using IPA), morphology, grammar, semantics, pragmatics and examples for the source language only. The dictionary is provided in XML format.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-M0078/

ELRA-M0079 Vietnamese-English Dictionary
ISLRN: 747-175-261-587-4
The Vietnamese-English Dictionary consists of 156,000 entries containing the following information: phonetics (using IPA), morphology, grammar, semantics, pragmatics and examples for source language only. The dictionary is provided in XML format.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-M0079/

ELRA-M0080 Chinese-Vietnamese Dictionary
ISLRN: 120-577-487-890-2
The Chinese-Vietnamese Dictionary consists of 52,470 entries containing the following information: phonetics (using IPA), morphology, grammar, semantics, pragmatics and examples. The dictionary is provided in XML format.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-M0080/


ELRA-M0081 Vietnamese-Chinese Dictionary
ISLRN: 481-792-486-258-2
The Vietnamese-Chinese Dictionary consists of 50,911 entries containing the following information: phonetics (using IPA), morphology, grammar, semantics, pragmatics and examples for the source language only. The dictionary is provided in XML format.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-M0081/

ELRA-M0082 Japanese-Vietnamese Dictionary
ISLRN: 056-033-674-079-4
The Japanese-Vietnamese Dictionary consists of 59,369 entries containing the following information: phonetics (using IPA), morphology, grammar, semantics, pragmatics and examples for source language only. The dictionary is provided in XML format.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-M0082/

ELRA-M0083 Vietnamese-Japanese Dictionary

ISLRN: 719-247-130-680-9
The Vietnamese-Japanese Dictionary consists of 65,000 entries containing the following information: phonetics (using IPA), morphology, grammar, semantics, pragmatics and examples available for source language only. The dictionary is provided in XML format.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-M0083/

ELRA-M0084 Korean-Vietnamese Dictionary

ISLRN: 409-454-902-511-3
The Korean-Vietnamese Dictionary consists of 37,678 entries containing the following information: phonetics (using IPA), morphology, grammar, semantics, pragmatics and examples available only for source language. The dictionary is provided in XML format.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-M0084/


ELRA-M0085 Vietnamese-Korean Dictionary
ISLRN: 349-337-956-980-9
The Vietnamese-Korean Dictionary consists of 27,449 entries containing the following information: phonetics (using IPA), morphology, grammar, semantics, pragmatics and examples available only for the source language. The dictionary is provided in XML format.
For more information, see: http://catalog.elra.info/en-us/repository/browse/ELRA-M0085/

For more information on the catalogue, please contact Valérie Mapelli mailto:mapelli@elda.org


If you would like to enquire about having your resources distributed by ELRA, please do not hesitate to contact us.

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/en/catalogues/language-resources-announcements





 












Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA