ISCApad #203 |
Saturday, May 16, 2015 by Chris Wellekens |
Speechocean – update (April 2015):
Speechocean: A global language resources and data services supplier
Speechocean has over 500 large-scale databases available in 110+ languages and accents with the platform of desktop, in-car, telephony and tablet PC. Our data repository is enormous and diversified, which includes ASR Databases, TTS Databases, Lexica, Text Corpora, etc.
Speechocean is glad to announce more resources that available in its catalogue:
ID: King-ASR-134 This Turkish in-car speech recognition database was collected by Speechocean’s project team in Turkey. It contains the voices of 316 different native speakers who were balanced distributed in Age (mainly 18-30, 31-45, 46-60), Gender (156 males, 160 females) and regional accents. The script was specially designed to provide material for both training and testing of many classes of speech recognizers which contain 320 utterances covering 15 categories and 35 sub-categories for each speaker (for the detail script structure design, please see the technical document).
Each speaker was recorded under two environments from three environments (Parked, City Driving and Highway driving) with kinds of recording conditions such as motor running, fan on/off, window up/down and etc. and totally 320 utterances were recorded for each speaker under two environments (160 utterances and spontaneous sentences per environment). Four high quality audio channels (C1: SHURE SM10A, C2: SENNHEISER ME104, F1: AKG Q400, F2: AKG Q400) were used in a car, at least three popular cars were adopted and the speech data are stored as sequences of 16 kHz, 16 bit and uncompressed. Each utterance is stored in a separate file and each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information. A pronunciation lexicon with a phonemic transcription in SAMPA is also included. All the data was transcribed and labeled.
ID: King-ASR-222 This is a Cantonese speech database, which is collected over 3 different mobile Operating Systems: iOS, Android and Windows Mobile. This database is owned by Speechocean. For each platform, 2400 speakers were recorded, and each speaker recorded 320 utterances. With discarding some unqualified utterances, the whole corpus contains the recordings of 965,128 utterances of Cantonese. For the whole corpus, the pure recording time is about 1261.56 hours, including the leading and trailing silence (about 500 ms). The total size of this database is about 135G.
Xianfeng Cheng Business Manager of Commercial Department Tel: +86-10-62660928; +86-10-62660053 ext.8080 Mobile: +86 13681432590 Skype: xianfeng.cheng1 Email: chengxianfeng@speechocean.com; cxfxy0cxfxy0@gmail.com Website: www.speechocean.com
|
Back | Top |