ISCA - International Speech
Communication Association


ISCApad Archive  »  2018  »  ISCApad #236  »  Resources  »  Database  »  Speechocean – update (January 2017)

ISCApad #236

Saturday, February 10, 2018 by Chris Wellekens

5-2-13 Speechocean – update (January 2017)
  

 

Speechocean – update (Jan 2017):

 

 

 

Speechocean: A global language resources and data services supplier

 

 

 

About Speechocean

 

Speechocean is one of the world well-known language related resources & services provider in the fields of Human Computer Interaction and Human Language Technology. At present, we can provide data services with 110+ languages and dialects across the world.

 

 

 

KingLine Data Center ---Data Sharing Platform

 

Kingline Data Center is operated and supervised by Speechocean, which is mainly focused on language resources creating and providing for research and development of human language technology.

 

These diversified corpora are widely used for the research and development in the fields of Speech Recognition, Speech Synthesis, Natural Language Processing, Machine Translation, Web Search, etc. All corpora are openly accessible for users all over the world, including users from scientific research institutions, enterprises or individuals.

 

For more detailed information, please visit our website: http://kingline.speechocean.com

 

 

 

New released corpora:

 

  1. Uighur Speech Recognition Corpus-Sentence/Conversation (Telephone)-300 Speakers

 

ID: King-ASR-450

 

The Uighur Telephone Speech Recognition Corpus was collected in China. It contains the voices of 300 different speakers (150 males, 150 females) who were balanced distributed in age (mainly 18-35, 36-45, >46), gender and regional accents (for the details, please see the technical document). The script contains 120,000(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in quiet environments (home/office). Telephone platform, i.e. IVR was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. A pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

 

 

 

  1. Hindi Speech Recognition Corpus- Conversation (Mobile)- 200 Speakers

 

ID: King-ASR-323

 

The Hindi Mobile Speech Recognition Corpus was collected in India. It contains the voices of 200 different speakers (108 males, 92 females) who were balanced distributed in age (16-30, 31-45, 46-60), gender and regional accents (for the details, please see the technical document). More than 20 topics were included in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet office environment. Mobile platforms, i.e. iOS, Android and Windows were used for speech collection. Each utterance wave was stored in a separate file and uncompressed. A pronunciation lexicon is available with a phonemic transcription in SAMPA. All manually checked. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

 

 

 

  1. Kids Mandarin Speech Recognition Corpus-Sentence (Mobile, Desktop & Smart TV)-575 Speakers

 

ID: King-ASR-409

 

The Kids Mandarin Speech Recognition Corpus was collected in China Liaoning Province and Hebei Province. It contains the voices of 575 different native speakers (283 males, 292 females) who were balanced distributed in age (4-9 years old), gender and regional accents (for the details, please see the technical document). The script contains 396,000(approx.) utterances in total( for more details of script structure design, please check the specification), specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet office room. Mobile phone, desktop and smart TV were used for speech collection. Each utterance wave is stored in a separate file and uncompressed. A pronunciation lexicon is available with a phonemic transcription in Pinyin. All manually checked. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

 

 

 

  1. Hong Kong English Speech Recognition Corpus-Sentence (Mobile)-200 Speakers

 

ID: King-ASR-287

 

The Hong Kong English Mobile Speech Recognition Corpus was collected in Hong Kong. It contains the voices of 200 different speakers (99 males, 101 females) who were balanced distributed in age (18-30, 31-45, 46-60), gender and regional accents (for the details, please see the technical document). The script contains 179,406(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Mobile platforms, i.e. iOSAndroidWindows were used for speech collection. Each utterance wave was stored in a separate file and uncompressed. A pronunciation lexicon is available with a phonemic transcription in Hepburn. All manually checked. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.

 

 

 

Contact Information

 

Xianfeng Cheng

 

VP

 

Tel: +86-10-62660928; +86-10-62660053 ext.8080

 

Mobile: +86 13681432590

 

Skype: xianfeng.cheng1

 

Email: chengxianfeng@speechocean.com; cxfxy0cxfxy0@gmail.com

 

Website: www.speechocean.com


 

 


Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA