ISCA Services

ISCA - International Speech
Communication Association

ISCApad Archive » 2018 » ISCApad #240 » Academic and Industry Notes » ELRA and LDC partner on a joint distribution of Language Resources

ISCApad #240

Tuesday, June 12, 2018 by Chris Wellekens

4-8 ELRA and LDC partner on a joint distribution of Language Resources

Press Release - Immediate
Paris, France, January 24, 2018

ELRA and LDC partner on a joint distribution of Language Resources

ELRA and LDC are pleased to announce the release of
1) three corpora from the 2007 CoNLL Shared Task, consisting of dependency treebanks, used as part of the CoNLL 2007 shared task on multi-lingual dependency parsing and the domain adaptation track. The source data in the treebanks consists for the most part of various texts (e.g., textbooks, news, literature) annotated in dependency format, and
2) one TRAD Chinese-French Parallel Text, which includes French translations of a subset of approximately 10,000 Chinese words from LDC?s GALE Phase 1 Chinese Blog Parallel.

These packages can be found in the ELRA and LDC catalogues under the following references:

2007 CoNLL Shared Task - Basque, Catalan, Czech & Turkish
ISLRN: 769-620-932-723-2
ELRA ID: ELRA-W012 1
LDC ID: LDC2018T06

2007 CoNLL Shared Task - Greek, Hungarian and Italian
ISLRN: 270-733-242-642-3
ELRA ID: ELRA-W0122
LDC ID: LDC2018T07

2007 CoNLL Shared Task ? Arabic & English
ISLRN: 505-782-255-628-8
ELRA ID: ELRA-W0123
LDC ID: LDC2018T08

TRAD Chinese-French Parallel Text ? Blog
ISLRN: 713-266-631-883-0
ELRA ID: ELRA-W0125
LDC ID: LDC2018T02

***About CoNLL and the 2007 shared task***
The Conference on Computational Natural Language Learning (CoNLL) is accompanied every year by a shared task intended to promote NLP applications and evaluate them in a standard setting. In 2007, the shared task was devoted to the parsing of syntactic dependencies using corpora from up to thirteen languages. The task aimed to define and extend the then-current state of the art in dependency parsing, a technology that complemented previous tasks by producing a different kind of syntactic description of input text. The 2007 shared task also included a domain adaptation track for English.
http://www.signll.org/conll/ and http://www.conll.org/previous-tasks/.

***About PEA TRAD project ***
PEA-TRAD (Translation as a Support for Document Analysis) is a project supported by the French Ministry of Defense (DGA) whose purpose was to develop speech-to-speech translation technology for multiple languages (e.g., Arabic, Chinese, Pashto) from a variety of domains.
More on the PEA TRAD project: http://www.elra.info/en/projects/archived-projects/pea-trad/.

*** About ELRA ***
The European Language Resources Association (ELRA) is a non-profit making organisation founded by the European Commission in 1995, with the mission of providing a clearing house for language resources and promoting Human Language Technologies (HLT).
To find out more about ELRA and its respective catalogue, please visit: http://www.elra.info and http://catalogue.elra.info

*** About LDC ***
The Linguistic Data Consortium (LDC) is an open consortium of universities, libraries, corporations and research laboratories that creates and distributes linguistic resources for language-related education, research and technology development.
To find out more about LDC and its respective catalogue, please visit: https://www.ldc.upenn.edu and https://catalog.ldc.upenn.edu

Back

Top

Organisation	Events	Membership	Help
> Board	> Interspeech	> Join - renew	> Sitemap
> Legal documents	> Workshops	> Membership directory	> Contact
> Logos			> FAQ
			> Privacy policy