ISCA - International Speech
Communication Association


ISCApad Archive  »  2012  »  ISCApad #163  »  Resources  »  Database  »  Speechocean December 2011 update

ISCApad #163

Wednesday, January 11, 2012 by Chris Wellekens

5-2-4 Speechocean December 2011 update
  

Speechocean - Language Resource Catalogue - New Released (2011-10)
Speechocean, as a global provider of language resources and data services, has more than 200 large-scale databases available in 80+ languages and accents covering the fields of Text to Speech, Automatic Speech Recognition, Text, Machine Translation, Web Search, Videos, Images etc.
Speechocean is glad to announce that more Speech Resources has been released:
Canadian French Speech Recognition Database - Sentences (Desktop) -- 200 speakers
This Canadian French speech recognition database was collected by Speechocean’s project team in Canada. It contains the voices of 200 different native speakers who were demographic balanced according age distribution (mainly 16 – 30, 31 – 45, 46 – 60), gender (50±5% Males, 50±5% Females) and regional accents. A script pool with a total of 20,000 simple sentences was phonetically designed for both training and testing of speech recognizers. Each speaker has recorded 300 sentences which were randomly selected from the script pool. All speakers have been recorded in a quiet office room through two professional microphones. Each prompted utterance is stored in a separate file and each signal file is accompanied by an ASCII SAM label file which contains the relevant descriptive information.
A pronunciation lexicon with a phonemic SAMPA transcription is also included.
For more information, please see the technical document at the following link:
http://www.speechocean.com/en-ASR-Corpora/616.html


UK English Speech Recognition Database ---- Sentences (Desktop)-200 Speakers
This UK English desktop speech recognition database was collected by Speechocean’s project team in UK. This database is one of our databases of Speech Data ----Desktop Project (SDD) which contains the database collections for 30 languages presently.
It contains the voices of 200 different native speakers who were balanced distributed by age (mainly 16 – 30,31 – 45,46 – 60), gender (106 males, 94 females) and regional accents (for the details, please see the technical document).The script was specially designed to provide material for both training and testing of many classes of speech recognizers. Each speaker has been recorded in a quiet office environment and 300 phonetically rich sentences were randomly selected from a pool of sentences specially designed.
The speech data are stored as sequences of 48.1 kHz, 16 bit and uncompressed. A pronunciation lexicon with a phonemic transcription in SAMPA is also included. The pure recording hours are 189.1. And the phoneme labelling of 6843 sentences (100034 words) which were chosen from 24 speakers were manually made.
For more information, please see the technical document at the following link:
http://www.speechocean.com/en-ASR-Corpora/792.html

UK TTS Speech database (Female)
The UK English Speech Corpus consists in one native UK Female professional broadcaster (Female, 35 years old) recorded in a studio with high SNR (>35dB) over two channels (AKG C4000B microphone and Electroglottography (EGG) sensor).
The Corpus includes the following sub-corpora:
1. Sentence sub-corpus: including 3000 short sentences (7~12 words) and 2000 sentences with normal length (13~20 words). Considering all kinds of linguistic phenomena, all sentences were extracted from the daily articles in England, such as national and international news, papers about life, travel, and so on. The sentences with political/religious/obscene/pornographic words which might lead to negative emotions were carefully excluded.
2. Emotional sub-corpus: including 100 exclamatory sentences and 100 interrogative sentences which can be used for emotional TTS study;
3. Digit sub-corpus: including many kinds of digits data, such as isolated digit, connected digits with blocks, natural and ordinal number readings;
4. Expression sub-corpus: consists of general expressions, such as date, time, money and measure expression;
5. Spell sub-corpus: including characters in alphabet, Greek characters and general abbreviations; All reading prompts are manually revised and prosody annotations were made according to real speech.
All speech data are segmented and labeled on phone level. Pronunciation lexicon and pitch extract from EEG can also be provided based on demands.
For more information, please see the technical document at the following link:
http://www.speechocean.com/en-TTS-Corpora/799.html

US TTS speech database (Female)
The US English Speech Corpus consists in one native US Female professional broadcaster (Female, 36 years old) recorded in a studio with high SNR (>35dB) over two channels (AKG C4000B microphone and Electroglottography (EGG) sensor).
The Corpus includes the following sub-corpora:
1. Sentence sub-corpus: including 3000 short sentences (7~12 words) and 2000 sentences with normal length (13~20 words). Considering all kinds of linguistic phenomena, all sentences were extracted from the daily articles in the Usa, such as national and international news, papers about life, travel, and so on. The sentences with political/religious/obscene/pornographic words which might lead to negative emotions were carefully excluded.
2. Emotional sub-corpus: including 100 exclamatory sentences and 100 interrogative sentences which can be used for emotional TTS study;
3. Digit sub-corpus: including many kinds of digits data, such as isolated digit, connected digits with blocks, natural and ordinal number readings;
4. Expression sub-corpus: consists of general expressions, such as date, time, money and measure expression;
5. Spell sub-corpus: including characters in alphabet, Greek characters and general abbreviations; All reading prompts are manually revised and prosody annotations were made according to real speech.
All speech data are segmented and labeled on phone level. Pronunciation lexicon and pitch extract from EEG can also be provided based on demands.

Italian TTS speech database (Female)
The Italian Speech Corpus consists in one native Italian Female professional broadcaster (Female, 26 years old) recorded in a studio with high SNR (>35dB) over two channels (AKG C4000B microphone and Electroglottography (EGG) sensor).
The Corpus includes the following sub-corpora:
1. Sentence sub-corpus: including 3000 short sentences (7~12 words) and 2000 sentences with normal length (13~20 words). Considering all kinds of linguistic phenomena, all sentences were extracted from the daily articles in Italy, such as national and international news, papers about life, travel, and so on. The sentences with political/religious/obscene/pornographic words which might lead to negative emotions were carefully excluded.
2. Emotional sub-corpus: including 100 exclamatory sentences and 100 interrogative sentences which can be used for emotional TTS study;
3. Digit sub-corpus: including many kinds of digits data, such as isolated digit, connected digits with blocks, natural and ordinal number readings;
4. Expression sub-corpus: consists of general expressions, such as date, time, money and measure expression;
5. Spell sub-corpus: including characters in alphabet, Greek characters and general abbreviations; All reading prompts are manually revised and prosody annotations were made according to real speech.
All speech data are segmented and labeled on phone level. Pronunciation lexicon and pitch extract from EEG can also be provided based on demands.

For more information about our Database and Services please visit our website www.Speechocen.com or visit our on-line Catalogue at http://www.speechocean.com/en-Product-Catalogue/Index.html
If you have any inquiry regarding our databases and service please feel free to contact us:
Xianfeng Cheng mailto: Chengxianfeng@speechocean.com
Marta Gherardi mailto: Marta@speechocean.com


Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA