ISCA - International Speech
Communication Association


ISCApad Archive  »  2015  »  ISCApad #200  »  Resources  »  Database  »  Speechocean – update (February 2015)

ISCApad #200

Friday, February 13, 2015 by Chris Wellekens

5-2-14 Speechocean – update (February 2015)
  

Speechocean – update (Feb 2015):

 

Speechocean: A global language resources and data services supplier

 

Speechocean has over 500 large-scale databases available in 110+ languages and accents with the platform of desktop, in-car, telephony and tablet PC. Our data repository is enormous and diversified, which includes ASR Databases, TTS Databases, Lexica, Text Corpora, etc.

 

Speechocean is glad to announce more resources that have been released:

ASR Databases

Speechocean provides 110+ regional languages corpora, available in a variety of formats, situational styles, scene environments and platform systems, covering In-car speech recognition corpora, mobile phone speech recognition corpora, fixed-line speech recognition corpora, desktop speech recognition corpora, etc. This month we are glad to introduce our most popular databases which were made for the tuning and testing purpose of speech recognition systems for speech ASR applications.

    1. In-Car

Serial Number

Kingline Data Names

Sound Parameter

King-ASR-125

Japanese Speech Recognition Database -(In car) 300 Speakers

48k,16bit

Four Channels

King-ASR-120

Chinese Mandarin Speech Recognition Database-(in car )1200 Speakers

16 K16 bit

Four Channels



    1. Mobile

Serial Number

Kingline Data Names

Sound Parameter

King-ASR-216

Chinese Mandarin Speech Recognition Database-Sentences (Mobile)--(5048 speakers)

16K,16bit

One Channel

King-ASR-044

Taiwanese Speech Recognition Database—(Mobile)--654 speakers

16K,16bit

One Channel

King-ASR-113

Chinese Mandarin Speech Recognition Database ----(Mobile)-4000 Speakers

16K,16bit

One Channel

 

    1. Telephony

Serial Number

Kingline Data Names

Sound Parameter

King-ASR-222

Japanese Speech Recognition Database ----
spontaneous dialog (Telephony)-200 Speakers

8k,16bit

King-ASR-027

Chinese Mandarin Speech Recognition Database ---- Spontaneous Speech (Telephone)-649 Speakers

8k,16bit

 

    1. Desktop

      Serial Number

      Kingline Data Names

      Sound Parameter

      King-ASR-087

      Taiwanese Speech Recognition Database ----
      Sentences (Desktop)-200 Speakers

      44.1k,16bit

      King-ASR-111

      Mandarin Speech Recognition Database ----
      spontaneous dialog (Desktop)-1013 Speakers

      44.1k,16bit

      King-ASR-175

      Japanese Speech Recognition Database ----
      Sentences (Desktop)-505 Speakers

      44.1k,16bit

  1. TTS Databases

Speechocean licenses a variety of databases in more than 40 languages for speech synthesis broadcasting speech, emotional speech, etc. which can be used in different algorithms.

Serial No.

Kingline Data Names

Sound Parameter

Recording Hours

King-TTS-020

Russian Speech Synthesis Database (Male)

44.1K,16bit
Two Channels

12.3

King-TTS-027

Taiwanese Speech Synthesis Database (Female)

44.1K,16bit
Two Channels

9.8

King-TTS-030

British English Speech Synthesis Database (Female)

44.1K,16bit
Two Channels

12.6

 

 

  1. Text Corpora

Speechocean licenses many kinds of text corpora in many languages which is superb for language model training.

ID

Kingline Data Names

Size

King-NLP-022

Japanese Name Variants Corpus

4000000 Words

King-NLP-023

Japanese Lexical Database

290000 Words

King-NLP-034

Japanese Organizations Names Corpus

580000 Words

 

  1. Lexica

Speechocean builds pronunciation lexica in many languages which can be licensed to customers.

No.

Name

Phoneset

King-Lexicon-032

Urdu Pronunciation Lexicon

SAMPA

King-Lexicon-033

Vietnamese Pronunciation Lexicon

SO

King-Lexicon-037

Brazilian Portugese Pronunciation Lexicon

SAMPA

 

Contact Information

Xianfeng Cheng

Business Manager of Commercial Department

Tel: +86-10-62660928; +86-10-62660053 ext.8080

Mobile: +86 13681432590

Skype: xianfeng.cheng1

Email: chengxianfeng@speechocean.com; cxfxy0cxfxy0@gmail.com

Website: www.speechocean.com

 

 

 

 

 

 

 

 




 

 

 

 

 

 


Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA