ISCA - International Speech
Communication Association


ISCApad Archive  »  2014  »  ISCApad #198  »  Resources  »  Database  »  Speechocean – update (November 2014)

ISCApad #198

Sunday, December 14, 2014 by Chris Wellekens

5-2-14 Speechocean – update (November 2014)
  

 

Speechocean – update (Nov 2014):

 

 

 

Speechocean: A global language resources and data services supplier

 

 

 

Speechocean has over 500 large-scale databases available in 110+ languages and accents with the platform of desktop, in-car, telephony and tablet PC. Our data repository is enormous and diversified, which includes ASR Databases, TTS Databases, Lexica, Text Corpora, etc.

 

 

 

Speechocean is glad to announce more resources that have been released:

 

ASR Databases

 

Speechocean provides 110+ regional languages corpora, available in a variety of formats, situational styles, scene environments and platform systems, covering In-car speech recognition corpora, mobile phone speech recognition corpora, fixed-line speech recognition corpora, desktop speech recognition corpora, etc. This month we are glad to introduce our most popular Asia Languages Databases which were made for the tuning and testing purpose of speech recognition systems for speech ASR applications.

 

    1. In-Car

 

Serial Number

Kingline Data Names

Sound Parameter

King-ASR-122

Chinese Mandarin Speech Recognition Database -(In car) 100 Speakers

48k,16bit

Four Channels

King-ASR-120

Chinese Mandarin Speech Recognition Database-(in car )160 Speakers

16 K16 bit

Four Channels

 



 

    1. Mobile

 

Serial Number

Kingline Data Names

Sound Parameter

King-ASR-216

Chinese Mandarin Speech Recognition Database-Sentences (Mobile)--(5048 speakers)

16K,16bit

One Channel

King-ASR-044

Taiwanese Speech Recognition Database—(Mobile)--654 speakers

16K,16bit

One Channel

King-ASR-113

Chinese Mandarin Speech Recognition Database ----(Mobile)-4000 Speakers

16K,16bit

One Channel

 

 

 

    1. Telephony

 

Serial Number

Kingline Data Names

Sound Parameter

King-ASR-222

Japanese Speech Recognition Database ----
spontaneous dialog (Telephony)-200 Speakers

8k,16bit

King-ASR-027

Chinese Mandarin Speech Recognition Database ---- Spontaneous Speech (Telephone)-649 Speakers

8k,16bit

 

 

 

    1. Desktop

      Serial Number

      Kingline Data Names

      Sound Parameter

      King-ASR-062

      Thai Speech Recognition Database ----
      Sentences (Decktop)-200 Speakers

      44.1k,16bit

      King-ASR-111

      Mandarin Speech Recognition Database ----
      spontaneous dialog (Desktop)-1013 Speakers

      44.1k,16bit

      King-ASR-175

      Japanese Speech Recognition Database ----
      Sentences (Desktop)-505 Speakers

      44.1k,16bit

  1. TTS Databases

 

Speechocean licenses a variety of databases in more than 40 languages for speech synthesis broadcasting speech, emotional speech, etc. which can be used in different algorithms.

 

Serial No.

Kingline Data Names

Sound Parameter

Recording Hours

King-TTS-024

Japanese Speech Synthesis Database (Male)

44.1K,16bit
Two Channels

7.5

King-TTS-025

Portugal Portuguese Speech Synthesis Database (Male)

44.1K,16bit
Two Channels

9.1

King-TTS-026

Russian Speech Synthesis Database (Female)

44.1K,16bit
Two Channels

8.6

 

 

 

 

 

  1. Text Corpora

 

Speechocean licenses many kinds of text corpora in many languages which is superb for language model training.

 

ID

Kingline Data Names

Size

King-NLP-037

Traditional Chinese Email Corpus

2700000 Chinese Characters

King-NLP-038

Traditional Chinese Chatting Corpus

2600000 Chinese Characters

King-NLP-039

Traditional Chinese SMS Corpus

12000000 Chinese Characters

 

 

 

  1. Lexica

 

Speechocean builds pronunciation lexica in many languages which can be licensed to customers.

 

No

Name

Phoneset

King-Lexicon-031

Catalan Pronunciation Lexicon

UPC

King-Lexicon-037

Brazilian Portuguese Pronunciation Lexicon

SAMPA

King-Lexicon-043

Norwegian Pronunciation Lexicon

XSAMPA

 

 

 

 

 

Contact Information

 

Xianfeng Cheng

 

Business Manager of Commercial Department

 

Tel: +86-10-62660928; +86-10-62660053 ext.8080

 

Mobile: +86 13681432590

 

Skype: xianfeng.cheng1

 

Email: chengxianfeng@speechocean.com; cxfxy0cxfxy0@gmail.com

 

Website: www.speechocean.com

 

 

 

 

 




 

 

 

 

 

 


Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA