ISCA - International Speech
Communication Association


ISCApad Archive  »  2018  »  ISCApad #240  »  Jobs  »  (2018-03-16) SPEECH RESEARCH SCIENTIST (TTS) at ObEN Inc., Pasadena, California, USA

ISCApad #240

Tuesday, June 12, 2018 by Chris Wellekens

6-47 (2018-03-16) SPEECH RESEARCH SCIENTIST (TTS) at ObEN Inc., Pasadena, California, USA
  

SPEECH RESEARCH SCIENTIST (TTS)
 
Come join us and build Personal Artificial Intelligence (PAI) -- intelligent 3D avatars that look, sound, and behave like the individual user!
 ObEN is an artificial intelligence company developing a decentralized AI platform for Personal AI (PAI). Founded in 2014, ObEN is a K11, Tencent, Softbank Ventures Korea and HTC Vive X portfolio company.
 
As a Speech Research Scientist focusing on Text-to-Speech, you will be working on developing cutting-edge deep learning algorithms for voice personalization. This will include the development of structured acoustic models for synthesis allowing the control of factors such as voice timbre, voice quality, language, accent, expressiveness and speaking style and the adaptation/conversion towards a target voice using a reduced amount of data.
 
Responsibilities:
 
● Develop and extend ObEN’s proprietary TTS system, in view of improving the quality and the naturalness of the synthesized voice as well as the similarity to the target voice and reducing the amount of data for speaker adaptation; ● Develop deep generative model of raw speech waveform; ● Develop cross-lingual  approaches (e.g. phonetic posteriorgrams).
 
Requirements: ● PhD with strong research experience in Adaptation of DNN-based TTS systems demonstrated by publications in top Speech journals and conferences (Icassp, Interspeech, etc); ● Strong machine learning background and familiar with standard statistical modeling techniques applied to speech; ● Research experience in deep generative model of raw audio (wavenet) and Generative Adversarial Network (WGAN); ● Fluent in Python and C++, and expert knowledge of deep learning packages (TensorFlow, Theano, Keras, etc); ● Familiarity with linguistic phonetics; ● Knowledge of basic digital signal processing techniques for audio.
 Contact: pierre@oben.com


Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA