ISCA - International Speech
Communication Association


ISCApad Archive  »  2018  »  ISCApad #239  »  Resources  »  Database  »  Datatang

ISCApad #239

Friday, May 11, 2018 by Chris Wellekens

5-2-23 Datatang
  

Datatang is a global leading data provider that specialized in data customized solution, focusing in variety speech, image, and text data collection, annotation, crowdsourcing services.

 

1, Speech data collection

2, Speech data synthesis

3, Speech data transcription

I’ve attached our company introduction as reference, as well as available speech data lists as follows:

US English Speech Data

300 people, about 200 hours

Uyghur Speech Data

2,500 people, about 1,000 hours

German Speech Data

100 people, about 40 hours

French Speech Data

100 people, about 40 hours

Spanish Speech Data

100 people, about 40 hours

Korean Speech Data

100 people, about 40 hours

Italian Speech Data

100 people, about 40 hours

Thai Speech Data

100 people, about 40 hours

Portuguese Speech Data

300 People, about 100 hours

Chinese Mandarin Speech Data

4,000 people, about 1,200 hours

Chinese Speaking English Speech Data

3,700 people, 720 hours

Cantonese Speech Data

5,000 people, about 1,400 hours

Japanese Speech Data

800 people, about 270 hours

Chinese Mandarin In-car Speech Data

690 people, about 245 hours

Shanghai Dialect Speech Data

2,500 people, about 1,000 hours

Southern Fujian Dialect Speech Data

2,500 people, about 1,000 hours

Sichuan Dialect Speech Data

2,500 people, about 860 hours

Henan Dialect Speech Data

400 people, about 150 hours

Northeastern Dialect Speech Data

300 people, 80 hours

Suzhou Dialect Speech Data

270 people, about 110 hours

Hangzhou Dialect Speech Data

400 people, about 170 hours

Non-Native Speaking Chinese Speech Data

1,100 people, about 73 hours

Real-world Call Center Chinese Speech Data

650 hours, more than 5,000 people

Mobile-end Real-world Voice Assistant Chinese Speech Data

4,000 hours, more than 2,000,000 people

Heavy Accent Chinese Speech Data

2,000 people, more than 1,000 hours

 

If you find any particular interested datasets, we could provide you samples with costs too.

 

Regards

 

Runze Zhao

zhaorunze@datatang.com 

Oversea Sales Manager | Datatang Technology 

China

M: +86 185 1698 2583

18 Zhongguancun St.

Kemao Building Tower B 18F

Beijing 100190

 

US

M: +1 617 763 4722 

640 W California Ave, Suite 210

Sunnyvale, CA 94086



Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA