ISCApad #234 |
Monday, December 11, 2017 by Chris Wellekens |
Speechocean – update (Jan 2017):
Speechocean: A global language resources and data services supplier
About Speechocean
Speechocean is one of the world well-known language related resources & services provider in the fields of Human Computer Interaction and Human Language Technology. At present, we can provide data services with 110+ languages and dialects across the world.
KingLine Data Center ---Data Sharing Platform
Kingline Data Center is operated and supervised by Speechocean, which is mainly focused on language resources creating and providing for research and development of human language technology.
These diversified corpora are widely used for the research and development in the fields of Speech Recognition, Speech Synthesis, Natural Language Processing, Machine Translation, Web Search, etc. All corpora are openly accessible for users all over the world, including users from scientific research institutions, enterprises or individuals.
For more detailed information, please visit our website: http://kingline.speechocean.com
New released corpora:
ID: King-ASR-450
The Uighur Telephone Speech Recognition Corpus was collected in China. It contains the voices of 300 different speakers (150 males, 150 females) who were balanced distributed in age (mainly 18-35, 36-45, >46), gender and regional accents (for the details, please see the technical document). The script contains 120,000(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in quiet environments (home/office). Telephone platform, i.e. IVR was used for speech collection. Each utterance wave was stored in a separate file and uncompressed. A pronunciation lexicon is available. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.
ID: King-ASR-323
The Hindi Mobile Speech Recognition Corpus was collected in India. It contains the voices of 200 different speakers (108 males, 92 females) who were balanced distributed in age (16-30, 31-45, 46-60), gender and regional accents (for the details, please see the technical document). More than 20 topics were included in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet office environment. Mobile platforms, i.e. iOS, Android and Windows were used for speech collection. Each utterance wave was stored in a separate file and uncompressed. A pronunciation lexicon is available with a phonemic transcription in SAMPA. All manually checked. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.
ID: King-ASR-409
The Kids Mandarin Speech Recognition Corpus was collected in China Liaoning Province and Hebei Province. It contains the voices of 575 different native speakers (283 males, 292 females) who were balanced distributed in age (4-9 years old), gender and regional accents (for the details, please see the technical document). The script contains 396,000(approx.) utterances in total( for more details of script structure design, please check the specification), specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet office room. Mobile phone, desktop and smart TV were used for speech collection. Each utterance wave is stored in a separate file and uncompressed. A pronunciation lexicon is available with a phonemic transcription in Pinyin. All manually checked. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.
ID: King-ASR-287
The Hong Kong English Mobile Speech Recognition Corpus was collected in Hong Kong. It contains the voices of 200 different speakers (99 males, 101 females) who were balanced distributed in age (18-30, 31-45, 46-60), gender and regional accents (for the details, please see the technical document). The script contains 179,406(approx.) utterances in total (for more details of script structure design, please check the specification), specially designed to provide materials for both training and testing of many classes of speech recognizers. Each speaker was recorded in a quiet environment. Mobile platforms, i.e. iOSAndroidWindows were used for speech collection. Each utterance wave was stored in a separate file and uncompressed. A pronunciation lexicon is available with a phonemic transcription in Hepburn. All manually checked. All audio files were manually transcribed and annotated by native transcribers. Details are available with specification.
Contact Information
Xianfeng Cheng
VP
Tel: +86-10-62660928; +86-10-62660053 ext.8080
Mobile: +86 13681432590
Skype: xianfeng.cheng1
Email: chengxianfeng@speechocean.com; cxfxy0cxfxy0@gmail.com
Website: www.speechocean.com
|
Back | Top |