ISCA - International Speech
Communication Association


ISCApad Archive  »  2018  »  ISCApad #241  »  Resources  »  Database  »  Speechocean – update (July 2018)

ISCApad #241

Tuesday, July 10, 2018 by Chris Wellekens

5-2-12 Speechocean – update (July 2018)
  

 

Speechocean – update (July 2018):

 

 

 

Speechocean: A global language resources and data services supplier

 

 

 

About Speechocean

 

Speechocean is one of the world well-known language related resources & services provider in the fields of Human Computer Interaction and Human Language Technology. At present, we can provide data services with 110+ languages and dialects across the world.

 

 

 

KingLine Data Center---Data Sharing Platform

 

Kingline Data Center is operated and supervised by Speechocean, which is mainly focused on language resources creating and providing for research and development of human language technology.

 

These diversified corpora are widely used for the research and development in the fields of Speech Recognition, Speech Synthesis, Natural Language Processing, Machine Translation, Web Search, etc. All corpora are openly accessible for users all over the world, including users from scientific research institutions, enterprises or individuals.

 

 

 

For more detailed information, please visit our website: http://kingline.speechocean.com

 

 

 

New released data:

 

 

 

1. Chinese Mandarin Speech Recognition Corpus (Mobile)-Conversation-1250 Speakers

 

 

 

S.NKing-ASR-408

 

 

 

Details:

 

 

 

The Chinese Mandarin Speech Recognition Corpus was collected in China.

 

 

 

The script contains 625 pairs of daily spontaneous conversational speech data utterances in total, specially designed to provide materials for both training and testing of speech recognizers.

 

 

 

This corpus contains the voices of 1,250 different speakers (571 males, 679 females) who were balanced distributed in age (16 – 30, 31 – 45, 46 – 60), gender and regional accents. Each speaker was recorded in quiet office or home environment. 

 

 

 

Mobile platform, i.e. Android was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in zh-cn_pinyin. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

 

 

 

 

 

2. Guangdong Cantonese Speech Recognition Corpus (Mobile)-Sentences-1014 Speakers

 

 

 

S.NKing-ASR-241

 

 

 

Details:

 

 

 

The Guangdong Cantonese Speech Recognition Corpus was collected in Guangdong.

The script contains 2,339,859(approx.) utterances in total, specially designed to provide materials for both training and testing of speech recognizers. Each utterance wave was stored in a separate file and uncompressed.

This corpus contains the voices of 1014 different speakers (450 males, 564 females) who were balanced distributed in age (16 – 30, 31 – 45, >45), gender and regional accents. Each speaker was recorded in both quiet (office/home) and noisy (restaurant/street) environment. 

Mobile platform, i.e. Android was used for speech collection. A pronunciation lexicon is available with a phonemic transcription in Jyutping. All manually checked. All audio files were manually transcribed and annotated by native transcribers.

 

 

 

3. Russian Speech Synthesis Corpus - Male

 

 

 

S.NKing-TTS-020

 

 

 

Details:

 

 

 

Size: 8.12 GB

 

Recording Hours: 13.69 Hours

 

Parameters: 44.1k, 16bit; 2 Channels

 

 

 

The Russian Speech Synthesis Corpus contains the recordings of 1 male voice talent. He is a broadcaster, 34 years old when recording this database, and he was born and grew up in Moscow.

 

 

 

The corpus contains 9,212 utterances. It was recorded in a professional studio over two channels--waveform and electroglottography (EGG) signal. Speech rate, energy and timbre were strictly controlled during recording process.

 

 

 

Each utterance was carefully proofreaded by linguists and was stored in Windows uncompressed PCM format. Prosody labeling and phone boundary labeling are included. A pronouncing dictionary is available. All data were manually checked. 

 

 

 

 

 

4. Prounciation Lexicon of Loan words of US English

 

 

 

S.NKing-Lexicon-079

 

 

 

Details:

 

 

 

Entries: 350,000

 

Phoneme Inventory: Computer Readable IPAIt can be converted to the phoneset Sampa, XSampa, and etc., based on demand.)

 

Stress: Included

 

Syllable Boundary: Included

 

 

 

 

 

Contact Information

 

Xianfeng Cheng

 

VP

 

Tel: +86-10-62660928; +86-10-62660053 ext.8080

 

Mobile: +86 13681432590

 

Skype: xianfeng.cheng1

 

Email: chengxianfeng@speechocean.com; cxfxy0cxfxy0@gmail.com

 

Website: www.speechocean.com

 


 


 

 


Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA