ISCA - International Speech
Communication Association


ISCApad Archive  »  2012  »  ISCApad #172  »  Jobs

ISCApad #172

Sunday, October 07, 2012 by Chris Wellekens

6 Jobs
6-1(2012-04-02) Research position in Spoken Language Dialogue Systems Development for Serious Games ; University of Ulm Germany

Research Position with perspective of a PhD degree in Spoken Language
Dialogue Systems Development for Serious Games

The Dialogue Systems Group (www.dialogue-systems.org) in the Faculty of
Engineering and Computer Sciences, University of Ulm is seeking a
researcher at MSc level to work in the area of Spoken Dialogue
Management for Serious Games. The research topic will fit into the
scientific context of the group (including Intelligent, Adaptive and
Proactive Spoken Language Dialogue Interaction, Semantic Analysis, and
Dialogue Modelling) but will be adapted to the expertise of the candidate.

The dialogue management system will manage the communication between
mobile nodes that are connected via a mobile adhoc network (MANET). Due
to the mobility of the nodes and limited range of wireless transmissions
the underlying network topology and link quality frequently changes. In
order to build an adaptive dialogue management system the network can
provide the system with information about the available resources (such
as segmentation and link quality). In turn the dialogue manager can
request different Quality of Service for individual communications (such
as low latency, delay tolerance or high reliability)

Perspective: PhD Thesis.

Requirements: Good programming skills in C, C++, Perl, VoiceXML, Java,
JavaScript and experience with Unix/Linux are highly desirable;
expertise in speech and dialogue technologies would also be appreciated.

The appointment (0,5 TVL) has a fixed duration of 36 months.

Candidates should send their application electronically to
wolfgang.minker@uni-ulm.de. The application should include a short
resume, the names of two referees and a transcript of records with the
results of exams relevant to the MSc Degree. A pdf-version of the MSc
Thesis may also be included.

Dialogue Systems Group
Institute of Communications Engineering
Faculty of Engineering and Computer Sciences
University of Ulm, Germany

-- 
Wolfgang Minker
Ulm University
Communications Engineering - Dialogue Systems
Albert-Einstein-Allee 43
D-89081 Ulm
Phone: +49 731 502 6254/-6251
Fax:   +49 731 501 226254
http://dialogue-systems.org

Back  Top

6-2(2012-04-04) PhD fellowship- Fondazione Bruno Kessler (FBK), Trento, Italy



A PhD fellowship  is available for conducting research  studies in the
field of Automatic Speech Recognition at the Human Language Technology
Research    Unit    (http://hlt.fbk.eu/en/openpositions/phd-ict)    of
Fondazione Bruno  Kessler (FBK), Trento, Italy. Research  work will be
carried out  at FBK as  part of the  PhD Program of  the International
Doctorate  School   in  Information  and   Communication  Technologies
(http://www.ict.unitn.it)   of  the   University  of   Trento,  Italy.
Interested  candidates need to  specify in  the application  form that
they intend  apply for the  project-specific grant offered by  FBK for
the  Automatic  Speech  Recognition  (ASR)  project. FBK  has  a  long
tradition  in developing automatic  transcription systems  for several
languages (information about the research group and ongoing activities
can be  found at  http://hlt.fbk.eu), the aim  of the project  will be
advancing  beyond  the  state-of-the-art  the existing  FBK  automatic
transcription  technology.  Possible research  topics include  but are
not limited  to: improving acoustic modeling for  large vocabulary ASR
(e.g.  discriminative  training algorithms, speaker  adaptive acoustic
modeling,  methods  for  fast  and efficient  adaptation  to  changing
domains, data selection methods for AM training, bootstrap methods for
under resourced  languages) and improving language  modeling for large
vocabulary ASR  (data selection  for LM training,  domain adaptation).
Details   about   requirements  of   candidates   can   be  found   at
http://hlt.fbk.eu/en/openpositions/phd-ict.

Contact: Daniele Falavigna (falavi@fbk.eu)

Back  Top

6-3(2012-04-04) Post-Doctoral Research Position, Aalto University

Post-Doctoral Research Position, Aalto University

 

Title: Statistical speech synthesis

Department: Department of Signal Processing and Acoustics

 

URL: http://spa.aalto.fi/en/

Start date: August-October 2012

Duration: 12-18 months contract

 

Department of Signal Processing and Acoustics, Aalto University (Espoo, Finland), invites applications for a post-doctoral researcher position in speech technology. The position is funded by the Simple4all project (http://simple4all.org/), which is a collaboration between Aalto University, University of Edinburgh (coordinator), University of Helsinki, Universidad Politécnica de Madrid, and Universitatea Tehnica Cluj-Napoca. Simple4All is a 3 year project, funded by EC’s FP7 ICT Programme, whose general aim is to create speech synthesis technology that learns from data with little or no expert supervision and continually improves itself, simply by being used.

 

The work at the Department of Signal Processing and Acoustics focuses on novel vocoding technologies in statistical parametric speech synthesis. More specifically, we are interested in utilizing such speech models in statistical speech synthesis that are closer to the human speech production mechanism and are inherently able to produce many voice qualities. Applicants for the post-doctoral researcher position must have a PhD (or equivalent experience) in speech processing, digital signal processing or computer science. They must have background in statistical speech synthesis, experience in the development of vocoders is particularly appreciated. In addition, experience of project development and project leadership in a research context, together with excellent communication, presentation, and organisational skills are highly desirable.

 

To apply, please send your CV (.pdf format) including a list of publications and your contact information, a statement of research interests and contact information for at least 2 references. Applications must be sent to paavo.alku@aalto.fi using the subject line: Post-doc position in statistical speech synthesis

Application deadline: 30 June 2012

 

Back  Top

6-4(2012-04-15) Full Time Research Programmer, Dialog Research Center, CMU Pittsburgh

Full Time Research Programmer, Dialog Research Center
Language Technologies Institute, CMU Pittsburgh

Minimum Education Level: Bachelor's Degree

The Dialog Research Center (dialrc.org) provides infrastructure support for spoken dialog systems, including distribution of data and software. DIALRC is funded by the US National Science Foundation, and is hosted at the Language Technologies Institute in the School of Computer Science at Carnegie Mellon University's main campus in Pittsburgh PA, US. In addition to distributing open source software and dialog data, DIALRC offers live dialog platforms for researchers to evaluate their techniques with real users in live situations. The person filling the position will be responsible for developing and supporting existing dialog software, managing distributions, supporting other researchers using the system and other programming and support tasks as necessary for the center.

Primary Tasks

Work with graduate student researchers to maintain and further develop existing dialog systems based on the open-source Olympus Spoken Dialog System framework.  Maintain and distribute data corpus.  General programming and research support.

Preferred Skills

Two or more years experience in research programming; some experience in supporting research software systems; experience with the CMU Olympus Spoken Dialog System; other spoken dialog systems; and/or speech recognition and speech synthesis.

Experience in Speech Processing/Natural Language Dialog: Python, Perl, C/C++

More information is avilable from Maxine Eskenazi (max@cs.cmu.edu) and Alan W Black (awb@cs.cmu.edu).

Or goto http://www.cmu.edu/jobs/postings/index.html and search for Job Number 9039

Back  Top

6-5(2012-04-20) PhD grant: Prosodic markers at IRIT Toulouse

Modélisation de trajectoires de marqueurs prosodiques et linguistiques ; application à la caractérisation des intentions des intervenants dans les discours audiovisuels

 

Contact

Jérôme Farinas, jfarinas@irit.fr équipe SAMOVA http://www.irit.fr/recherches/SAMOVA/

 

Description du sujet

Dans le domaine du traitement automatique de l'audio, les systèmes actuels sont parvenus à une assez grande maturité pour extraire de façon plutôt fiable des informations sur les locuteurs présents, la langue utilisée et la transcription de la parole. Un des objectifs de la recherche actuelle consiste à utiliser ces informations afin de structurer les interventions des locuteurs et plus largement le contenu radiophonique et télévisuel.

 

Dans ce contexte, l'équipe SAMOVA de l'IRIT a acquis ces dernières années de fortes compétences en modélisation et segmentation automatique en locuteurs [Louradour 2007, El Khoury 2010], en identification automatique de langues [Pellegrino 1998, Farinas 2002, Rouas 2005], en segmentation parole/musique/chant [Pinquier 2004, Lachambre 2009], en extraction de jingle [Pinquier 2004], en transcription de la parole [Campagne ESTER 2004], en recherche de zones de parole conversationnelle [Projet EPAC 2010] et de mots-clés [Le Blouch 2009]. En s'appuyant sur ces travaux, l'équipe travaille sur la structuration des émissions en se basant sur les interventions des locuteurs et leurs interactions [Bigot 2011] ainsi que sur la vidéo [Ercolessi 2011].

 

A partir d'une caractérisation du rôle des intervenants (présentateur, locuteur dominant...) notre objectif est d'étudier plus précisément les interactions entre locuteurs afin de distinguer ce qui dans le message relève de l'interaction (ouverture, clôture, présentation d'un invité, gestion des tours de parole) et des échanges d'opinion. Plus largement, le sujet de thèse proposé vise à étudier l'intention dans les interventions audiovisuelles de personnes.

La modélisation des intentions est principalement basée sur la modélisation de la prosodie, qui a travers l'intonation et le rythme permet d'influer sur la forme du discours. Cette modélisation devra prendre en compte la prosodie à court ou long terme [Farinas2002,Rouas2004]. Deux niveaux de modélisations seront donc mis en œuvre afin de caractériser la modalité de la phrase et la modification de la prosodie des mots. Cela passera par la choix de paramètres prosodiques appropriée (F0, energie) et la modélisation statistique de ces paramètres. L'évolution temporelle pourra être prise en compte en utilisant des modélisations stochastiques, des modélisations de trajectoires.

Cette étude se déroulera en deux phases :

  • dans un premier temps elle portera sur la détermination de marqueurs linguistiques (par le biais de la détection d'expressions clés) et prosodiques (emphase, modalité de la phrase, intonation locale) caractéristiques de certaines fonctions communicatives présentes dans les interactions entre personnes. Ces indicateurs permettront de localiser les zones du document dans lesquelles les informations sur l'intervenant (son nom, son statut) sont potentiellement présentes et apporteront des précisions sur le contexte dans lequel la personne intervient (interview, débat, …). Ces informations pourront d'une part aider à mieux décrire le contenu et d'autre part renforcer les résultats issues de la reconnaissance de la parole particulièrement difficile en situation de débat et de parole spontanée.

  • dans un second temps, à partir des informations disponibles sur les intervenants, l'étude portera sur l'analyse de leurs intentions. Par exemple, pour un présentateur il s'agira de déterminer les zones qui correspondent à la présentation des invités, la gestion des tours de parole, l'ouverture ou la clôture des débats tandis que pour un invité, il s'agira plutôt de qualifier ses tours de parole afin de caractériser l'objectif de son intervention (donner un avis, apporter une contradiction...) à travers, entre autres, du message, du ton, du comportement, du mode d'expression, de la prosodie locale mais également d'indications issues de la vidéo (texte incrusté, ...).

 

Les applications de cette recherche concernent la structuration de contenus audiovisuels pour aider à l'archivage documentaire et la recherche d'information dans ces contenus. Cette structuration et caractérisation de zones d'interaction présente également un intérêt pour la constitution de résumés audio-visuels.

 

Le candidat devra posséder un Master avec de fortes compétences en informatique. Des connaissances en traitement du signal, en reconnaissance de la parole seraient souhaitables (reconnaissance de la parole et prosodie).

 

Références

[Louradour 2007] Noyaux de séquences pour la vérification du locuteur par Machines à Vecteurs de Support. Thèse de doctorat, Université Paul Sabatier, janvier 2007

[El Khoury 2010] Unsupervised Video Indexing based on Audiovisual Characterization of Persons. Thèse de doctorat, Université de Toulouse, juin 2010

[Pellegrino 1998] Une approche phonétique en identification automatique des langues : la modélisation acoustique des systèmes vocaliques. Thèse de doctorat, Université Paul Sabatier, décembre / december 1998.

[Farinas 2002] Une modélisation automatique du rythme pour l'identification des langues. Thèse de doctorat, Université Paul Sabatier, novembre 2002.

[Rouas 2005] Caractérisation et identification automatique des langues. Thèse de doctorat, Université Paul Sabatier, mars 2005.

[Pinquier 2004] Indexation sonore : recherche de composantes primaires pour une structuration audiovisuelle. Thèse de doctorat, Université Paul Sabatier, décembre 2004.

[Lachambre 2009] Caractérisation de l'environnement musical dans les documents audiovisuels. Thèse de doctorat, Université de Toulouse, décembre 2009.

[Campagne ESTER 2004] G. Gravier, J.F. Bonastre, S. Galliano, E. Geoffrois, K. Mc Tait and K. Choukri. ESTER, une campagne d'évaluation des systèmes d'indexation d'émissions radiophoniques, Proc. Journées d'Etude sur la Parole, Avril 2004.

[projet EPAC 2010] Yannick Estève, Thierry Bazillon, Jean-Yves Antoine, Frédéric Béchet, Jérôme Farinas. The EPAC corpus: manual and automatic annotations of conversational speech in French broadcast news (regular paper). Dans : Language Resources and Evaluation Conference (LREC 2010), Valletta, Malte, 19/05/2010-21/05/2010, Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk (Eds.), European Language Resources Association (ELRA), p. 1686-1689, 2011.

[Le Blouch 2009] Décodage acoustico-phonétique et applications à l'indexation audio automatique. Thèse de doctorat, Université Paul Sabatier, juin 2009.

[Bigot 2011] Benjamin Bigot, Isabelle Ferrané, Julien Pinquier, Régine André-Obrecht. Speaker Role Recognition to help Spontaneous Conversational Speech Detection (regular paper). Dans : International workshop on Searching Spontaneous Conversational Speech SCSS (SCSS 2010), Firenze, Italy, 25/10/2010-29/10/2010, ACM, p. 5-10, octobre 2010.

[Ercolessi 2011] Philippe Ercolessi, Hervé Bredin, Christine Sénac and Philippe Joly, Segmenting TV series into scenes using speaker diarization, WIAMIS 12th International Workshop on Image Analysis for Multimedia Interactive Services, Delft, Pays-Bas,13-15 avril 2011.

 

Mots clés

Traitement automatique de la parole, décodage phonétique, recherche de mots clés, prosodie, acoustique, structuration en émissions, vidéo

 

Kewords

Automatic Speech Processing, Phonetic Decoding, Keyword Spotting, Prosody, Acoustic, Structuring Programs, Video

Back  Top

6-6(2012-04-20) Ingénieur at INRIA France

Inria recherche un ingénieur jeune diplômé pour développer sa boîte à outils de séparation de sources audio FASST (http://bass-db.gforge.inria.fr/fasst/) et effectuer un travail de recherche sur la reconnaissance de la parole robuste au bruit.

Qualification: ingénieur ou master 2, diplôme obtenu en 2011 ou 2012
Durée: 2 ans
Lieu de travail: Nancy, France
Date prévisionnelle d’embauche: 01/12/2012
Salaire: 2527 € brut mensuel

Les candidatures seront examinées au fil de l'eau. Informations supplémentaires et formulaire de candidature sur
http://www.inria.fr/institut/recrutement-metiers/offres/ingenieurs-jeunes-diplomes/%28view%29/details.html?nPostingTargetID=11988

Back  Top

6-7(2012-05-01) PhD Reconnaissance automatique de la parole continue : parole spontanée LORIA Nancy France

Sujet de thèse :Reconnaissance automatique de la parole continue : parole spontanée

 

Encadrants pour ce sujet :
Irina Illina, Maitre de conférences, HDR, Université de Lorraine, bureau C147, tel. 03 83 59 84 90, mel. illina@loria.fr
– Denis Jouvet, Directeur de recherches INRIA, HDR bureau C147, tel. 03 54 95 86 26, mel. denis.jouvet@inria.fr
Type de financement  CONTRAT DOCTORAL

Lieu : Inria-LORIA Nancy

 

Le sujet est affiché sur le site de l'école doctorale IAEM http://www.iaem.uhp-nancy.fr/ , rubrique 'propositions contrats doctoraux'.

Date limite du depot de candidature : le 1-er juin

 


 

Conetxte : La reconnaissance de la parole est un processus par lequel un ordinateur transforme le signal acoustique de la parole prononcée en texte. Pendant ce processus, le système de reconnaissance utilise des modèles acoustiques, des modèles de langage et un lexique de prononciations.
La parole spontanée est définie comme un énoncé perçu et conçu au fil de son élocution. Par rapport à la parole préparée, la parole spontanée se caractérise par :
– des insertions (hésitations, répétitions, pauses, reprises, faux départs) ;
– des variations de prononciations (contraction de mots ou de phonèmes) ;
– des environnements difficiles (rires, parole superposée) ;
– des phrases agrammaticales.
La parole spontanée est présente sous plusieurs formes : interviews, débats, dialogues. Ces spécificités sont peu ou pas prises en compte dans les systèmes de reconnaissance de la parole.
Afin d’améliorer la performance de systèmes de reconnaissance il est nécessaire de s’attaquer à deux problèmes ouverts :
– d’un part, détecter automatiquement ces événements de la parole spontanée ;
– et d’autre part, les prendre en compte dans le système de reconnaissance au niveau acoustique ainsi qu’au niveau linguistique.
Pour caractériser et détecter la parole spontanée, [Dufour et al.2009] propose un ensemble de caractéristiques acoustiques (la durée et le débit phonétique) et linguistiques (morphèmes spécifiques, répétitions et faux départs). Concernant la prise en compte de la parole spontanée, certaines pistes de recherche se sont avérées intéressantes telles que l’analyse de prononciation latente avec les connaissances à priori [Lin2007], l’utilisation de dictionnaires avec des prononciations multiples issues de la parole spontanée et l’étude de différents contextes acoustiques de phonèmes [Dupont et al.2005].

 

 

L’objet de cette thèse est d’apporter des éléments de solution à ce problème en proposant de nouvelles méthodes qui permettent de mieux prendre en compte les caractéristiques de la prononciation spontanée dans le cadre de la reconnaissance automatique de la parole.
Le premier objectif de cette thèse concerne l’augmentation de nos connaissances de la variabilité de la parole spontanée dans différents types de parole (interviews, dialogues, etc.). Nous allons surtout nous intéresser aux aspects segmentaux et acoustiques du problème. Les aspects prosodiques pourraient également être envisagés.
Le second objectif concerne la détection et la localisation de ces phénomènes de parole spontanée, et surtout leur prise en compte pour améliorer la reconnaissance de la parole. Ceci reposera sur l’enrichissement des modèles pour tenir compte des connaissance acquises, ainsi que sur la mise en ouvre de techniques de détection de ces phénomènes. Le travail s’effectuera au sein de l’équipe PAROLE au LORIA en utilisant le système ANTS [Brun et al.2005]. Après une étude bibliographique, l’étudiant aura à analyser des corpus de parole, à développer des modules de traitement de la parole spontanée et à les intégrer dans notre système de reconnaissance de la parole. Puis il devra d’évaluer les améliorations sur différents
corpus de parole. Notre équipe possède déjà un corpus riche en parole spontanée : le corpus d’émissions radiophoniques et télévisées, issu des campagnes d’évaluation ESTER et ETAPE.
Les validations éventuelles sur un corpus de parole de personnes âgées (dans un but d’assistance aux personnes à domicile) nous permettraient probablement de dégager et d’étudier d’autres phénomènes de la parole spontanée.
Les domaines abordés par ce sujet sont : la reconnaissance automatique de la parole,
la modélisation probabiliste, la parole spontanée, modélisation acoustique, modèle de langage.

 

 

Références : [Brun et al.2005] A. Brun, C. Cerisara, D. Fohr et I. Illina. ANTS : le système de transcription automatique du LORIA. WorkShop ESTER, 2005.
[Dufour et al.2009] R. Dufour, V. Jousse, Y. Estève, F. Bechet et G. Linares. Spontaneous speech characterization and detection in large audio database. SpeCom, 2009.
[Dupont et al.2005] S. Dupont, C. Ris, L. Couvreur et J.-M. Boite. A study of implicit and explicit modeling of coarticulation and pronunciation variation. Interspeech, 2005.
[Lin2007] L.-S. Lin, C.-K. Lee. Pronunciation modeling for spontaneous speech recognition using latent analysis (LPA) and prior knowledge. ICASSP, 2007

Back  Top

6-8(2012-05-13) PhD position: Caractérisation de l'ambiance sonore dans des enregistrements ethnomusicologiques IRIT Toulouse France

Titre : Caractérisation de l?ambiance sonore dans des enregistrements ethnomusicologiques

 

Responsables : Régine André-Obrecht et Julien Pinquier (IRIT, équipe SAMoVA) obrecht@irit.fr et pinquier@irit.fr

 

Cette thèse concerne le traitement de données ethnomusicologiques issues des archives du CNRS-Musée de l?Homme, gérées par le Centre de Recherche en EthnoMusicologie (CREM) du Laboratoire d'Ethnologie et de Sociologie Comparative (LESC). Il s?agit de documents en cours de numérisation et d?informatisation (3500 heures d?enregistrements inédits, de 1900 à nos jours, de musiques traditionnelles et d?enquêtes ethnographiques du monde entier et 3500 heures de documents anciens et rares). Cette collection est d?une grande importance historique et est unique au monde. Dans ce contexte applicatif, il est nécessaire de mettre au point un ensemble d'outils de traitement automatique de l'audio (parole, musique, chant, bruits?) afin de produire une indexation (semi)automatique pour un accès intelligent à la collection d'enregistrements sonores. Ce travail est principalement à destination de chercheurs (experts) en ethnomusicologie.

 

L?étude envisagée a pour objectif une caractérisation plus fine des composantes  Parole, Musique, Chant, Bruits afin de définir l?environnement sonore générique. De plus, l?introduction d?une approche semi-supervisée (prise en compte de métadonnées disponibles ou de l?utilisateur) doit permettre la caractérisation d?environnements sonores spécifiques.

 

Après s?être approprié les différents systèmes précédemment développées à l?IRIT, concernant la détection de parole et de musique, le doctorant aura en charge leur adaptation au corpus du projet. L?analyse des zones de parole et de voix chantée détectées doit conduire à une segmentation en tours de parole et en tours de chant, suivie du regroupement de ces segments par recherche de similarité des voix. Dès lors que les enregistrements sonores sont effectués dans des conditions naturelles et lorsque les zones de parole, de musique et de chant sont identifiées, restent des zones sonores digne d?un intérêt pour un ethnomusicologique car leur écoute permet de préciser le contexte sonore de la session de l?enregistrement, ce que l?on appelle « l?ambiance sonore ». Il est proposé de localiser ces zones de bruit d?intérêt et de spécifier un étiquetage. Pour ce faire, deux stratégies sont envisagées :

- un mode supervisé en utilisant les attributs acoustiques classiques (approche générique),

- un mode non-supervisé en introduisant des connaissances issues des ethnomusicologues (approche spécifique) via la plateforme Telemeta (http://crem.telemeta.org/).

 

Ce doctorat sera financé par le projet ANR DIADEMS qui démarrera en octobre 2012. Il serait appréciable que le candidat ait des connaissances en reconnaissance de formes et en traitements de la parole et de la musique.


Date limite de réponse
: 15 juin 2012

Back  Top

6-9(2012-06-01) Two positions at Nuance Belgium

Nuance is the leading provider of speech and imaging solutions for businesses and consumers around the world. Our technologies, applications and services make the user experience more compelling by transforming the way people interact with information and how they create, share and use documents. Every day, millions of users and thousands of businesses, experience Nuance by calling directory assistance, getting account information, dictating patient records, telling a navigation system their destination, or digitally reproducing documents that can be shared and searched. Making each of those experiences productive and compelling is what Nuance is all about.

Speech Recognition Specialist

Merelbeke, Belgium Permanent role Response to Craig.Robertson@Nuance.com

Nuance Mobile builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services.  Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.

As a contributing member of Nuance, you will work within a dynamic team environment to develop, support, market and sell our award-winning software applications. We offer competitive compensation packages and a challenging technical but casual work environment. Join our dynamic, entrepreneurial team that operates worldwide (Europe, US, APAC). Be a part of our fast growing track of continuing success.

For more information, please see www.nuance.com.

Nuance is an equal opportunity employer.

Responsibilities

As a Speech Recognition specialist at Nuiance you will work with peers from other teams arround  the world to investigate new & best usage of speech recognition for music and/or POI vertical domains. You will be closely working together with our R&D dpt to understand what is and what is not doable with the current limitation of the technology, and help customer's and Nuance internal integration teams to include Nuance technologies into successful products in an efficient way.

Representative tasks will include:

  • Investigate new & best usage of our speech recognition technologies for entering a POI (Point of Interest) by voice, considering the platform & technology constraints
  • Investigate new & best usage of our speech recognition technologies for accessing music by voice, considering the platform & technology constraints
  • Crunch navigation POI data from map providers to build proof of concepts and experiments
  • Contribute to research and technology agendas by providing input and improvement requests to our r&d dpt
  • Support customer projects integrating ASR technologies for POI and/or music

 Qualifications

  • Bachelors or Graduate University degree in Electrical Engineering, Computer Engineering, Computer Science or equivalent / related Technical Degree
  • first working experience
  • Strong C/C++ programming skills; proven software/system problem-solving skills.
  • Excellent oral and written communication skills in English is a must
  • Good listener and communicator, who can represent Nuance professional services at the customer’s premises or in written and oral communications with customers.
  • Positive 'can-do' attitude, well organized, focusing on achieving results cost-effectively
  • Ability and willingness to travel 
  • Ability to work independently, including at customer premises, but always as part of the embedded team. 
  • Self learner, with sense of initiative, and perseverance to deliver high quality work.

 

Preferred:

•  Experience with embedded hardware platforms, embedded operating systems, and embedded software development is desirable

•  Experience with Python and SQLite is highly desirable

•  Windows CE or Linux or QNX OS

 

Nuance is the leading provider of speech and imaging solutions for businesses and consumers around the world. Our technologies, applications and services make the user experience more compelling by transforming the way people interact with information and how they create, share and use documents. Every day, millions of users and thousands of businesses, experience Nuance by calling directory assistance, getting account information, dictating patient records, telling a navigation system their destination, or digitally reproducing documents that can be shared and searched. Making each of those experiences productive and compelling is what Nuance is all about.

NLP Processing Engineer

Merelbeke, Belgium Permanent role Response to Craig.Robertson@Nuance.com

Qualifications

Excellent background in statistics, pattern recognition, and/or signal processing

• Expertise in natural language processing, computational linguistics, statistical language modeling, search, and/or machine translation

• Strong programming skills, ideally in Python, Java, and/or C.

• Skills related to text processing, scripting languages, regular expressions

• Excellent oral and written communications skills in English.

• Ability to carry out focused and goal-oriented research and development, ability to assume responsibility for one’s work

• Ability to work in an international team as well as independently in fast-paced environment

• Ability to creatively solve problems while leveraging existing technology with an eye for efficiency.

 

 

PhD or equivalent research experience are a strong asset

 

• Good knowledge of speech recognition theory, acoustics, and/or psychoacoustics

 

• User interface, human—machine interaction, and dialogue system development experience

 

• Operational knowledge of languages other than English

 

MSc, ideally PhD in computer science, engineering, physics, mathematics, or other technical field

 

Craig Robertson

Recruitment Manager EMEA

 

Back  Top

6-10(2012-06-11) PhD Student 'Increasing Robustness of Speech Recognition' Radbout University Nijmegen NL

 

PhD Student 'Increasing Robustness of Speech Recognition' (1,0 fte)

Renewed Job opening !

Faculty of Arts Vacancy number: 23.02.12 Closing date: 7 July 2012

Responsibilities

As a PhD student you will participate in the FP7 Marie Curie Initial Training Network Investigating Speech Processing In Realistic Environments (INSPIRE). This network provides research opportunities for 13 PhD students and 3 postdocs. You will become a member of an international team of researchers whose aim is to gain a better understanding of how listeners recognize speech, even under non-ideal circumstances. You will contribute to urgently needed solutions that help alleviate the serious communication problems that arise, especially for older and hearing-impaired persons, when different combinations of 'adverse' conditions affect the speech processing system. You will conduct your research in the framework of of a project called ’Increasing robustness of speech recognition by using multiple signal representations’. Speech processing in the human brain presumably involves competition between multiple, intermediate signal representations. The redundancy of these different representations are assumed to help improve the robustness of recognition. In some cases, however, they may lead to conflicting interpretations resulting in intelligibility problems. The goal of this PhD project is to investigate to what extent human recognition errors with regard to speech in ’adverse’ conditions can be replicated by machines that were trained on multiple input representations which are partially redundant.

Work environment

The project will be carried out at the Centre for Language and Speech Technology (CLST). CLST is a research unit within the Faculty of Arts of Radboud University Nijmegen and hosts a large international group of senior researchers and PhD students who conduct research at the frontier of science and develop innovative applications.

What we expect from you

You should: - hold a Master's degree in engineering or science; - have a strong background in machine learning (experience with dynamic Bayesian networks would be an advantage), mathematical and/or statistical modelling, and signal processing; - have excellent programming skills; - be willing to spend several months at the Technical University of Denmark. Prior exposure to courses in linguistics or speech- or hearing-related fields would be an advantage. Furthermore, you should comply with the rules set forward by the FP7 Marie Curie ITNs, i.e. you should: - not have resided of performed your main research activity in the Netherlands for more than 12 months in the last three years; - be willing to work in at least one other country in the INSPIRE network; - have less than 4 years of research experience since you obtained your Master’s degree, and not hold a PhD.

What we have to offer

We offer you: - employment: 1,0 fte; - in addition to the salary: an 8% holiday allowance and an 8.3% end-of-year bonus; - the starting salary is €2,042 per month on a full-time basis; the salary will increase to €2,492 per month in the third year; - in addition to the salary, you will receive travel and training allowances on the basis of generous Marie Curie ITN provisions; - duration of the contract: 18 months with the possibility of extension by another 18 months.

Are you interested in our excellent employment conditions

(http://www.ru.nl/newstaff/working_at_radboud/conditions_of/)?

The Radboud University is an equal opportunity employer. Female researchers are strongly encouraged to apply for this vacancy.

Would you like to know more?

Further information on: Investigating Speech Processing In Realistic Environments (http://www.inspire-itn.eu/) Dr. Bert Cranen, assistant professor Speech science Telephone: +31 24 3612904 E-mail: B.Cranen@let.ru.nl

Applications

Are you interested? Please include with your application: - a CV; - a 2-page description of your research interests explaining why the INSPIRE goals appeal to you, how the INSPIRE team may benefit from your participation, and your career perspectives as expected from INSPIRE; - university transcripts; - names and email addresses of two potential referees (or alternatively letters of recommendation). It is Radboud University Nijmegen's policy to only accept applications by e-mail. Please send your application, stating vacancy number 23.02.12, to vacatures@let.ru.nl, for the attention of drs. M.J.M. van Nijnatten, before 7 July 2012. No commercial propositions ple

 

Back  Top

6-11(2012-06-11) Speech Recognition Scientist at Sunnyvale

Job Description

Title: 

Speech Recognition Scientist

Location: 

Sunnyvale

Status: 

Regular, Full-time, Exempt

We are a well-funded start-up with cutting-edge speech recognition with a disruptive technology platform applicable to a variety of markets and environments for spoken dialog interaction.  With an exciting mix of evolving speech-enabled products, we offer excellent opportunities for 'rock star' scientists to grow and share in our success.  We offer competitive compensation, excellent benefits and an ultra-creative work environment. 

We are currently seeking a talented speech recognition scientist to join our hands-on team developing our platform spoken dialog interactions. The ideal candidate has a proven track record for optimizing speech recognition performance. This work includes creating the necessary scripts and tools to experiment with novel algorithms to optimize recognition and natural language understanding throughout all stages of a multi-modal dialog system. Among others, you will be asked to work on statistical language modeling, as well as language model and acoustic model adaptation.

 

Responsibilities

Develop tools and enhance algorithms that facilitate deployment and tuning of spoken dialog systems

Analyze speech recognition performance andimplement solutions to provide optimum accuracy

Use, improve and create research tools to create, update and optimize language models and speech recognition systems for multiple domains

Evaluate and develop different language modeling and rescoring based on machine learning algorithms

Document language model development and adaptation process

Work with the team to design future product releases

 


Required Skills & Experience

Ph.D. or Master’s degree in computer science, electrical engineering, comp. linguistics, or equivalent

Speech and/or language processing background (in research and/or industry)

In-depth scripting experience with Python, Perl or similar

Ability to own and drive experimental definition, investigations and ultimately be responsible for the speech recognition performance

Passion for solving difficult problems

Strong planning and communication skills

Strong analytical and problem solving skills and ability to troubleshoot issues

 


Pluses:

Background in natural language processing, machine learning and/or computational linguistics

Programming experience in C/C++


Qualified candidates are encouraged to send your resume and cover letter to

swright@fluentialinc.com.   Fluential, LLC, is an Equal Opportunity Employer.  To learn more, please visit us online at http://www.fluentialinc.com

 

Back  Top

6-12(2012-06-11) Voice Developer (m/f) Speech Technology Automotive, Nuance Turin (Italy)

Nuance is a leading provider of speech and imaging solutions for businesses and consumers around the world. Our technologies, applications and services make the user experience more compelling by transforming the way people interact with information and how they create, share and use documents. Every day, millions of users and thousands of businesses, experience Nuance by calling directory assistance, getting account information, dictating patient records, telling a navigation system their destination, or digitally reproducing documents that can be shared and searched. Making each of those experiences productive and compelling is what Nuance is all about.

 

As a contributing member of Nuance, you will work within a dynamic team environment to develop, support, market and sell our award-winning software applications. We offer competitive compensation packages and a casual work environment. Join our dynamic, entrepreneurial team that operates worldwide (Europe, US, APAC). Be a part of our fast growing track of continuing success.

 

For more information, please see www.nuance.com.

Nuance is an equal opportunity employer.

 

For our office in Turin / Italy we are currently looking for a fulltime

 

 

Voice Developer (m/f)

Speech Technology Automotive

 

Key responsibilities.

  • ·         Develop voices for the range of Nuance Text-to-Speech Products.
  • Processing speech data for   text-to-speech speech data bases
  • Supervising native contractors in   annotation of speech databases
  • Creating speech   databases
  • Building text-to-speech   voices
  • Testing and product development of   the TTS voice
  • Testing and quality assessment of   the integrated system

Qualifications.

  • MSc degree in phonetics,   computational linguistics or another relevant field.
  • first working   experience
  • Working in Windows and Unix/Linux   environment
  • Strong sense of precision and   quality in your daily job
  • Fluent in   English
  • Understanding of phonological and   phonetic concepts
  • Ability to interpret   spectrograms
  • Ability to write high quality   documentation
  • Ability to work independently as   well as in a team
  • Good problem solving, analytic   skills and troubleshooting skills
  • Basic experience with scripting   languages like Perl or Python
  • Experience with acoustic   phonetics
  • Experience in text-to-speech voice   development
  • Speaking other   languages
  • Experience using tools such as   Audition or Praat

We offer.

At Nuance Communications we believe our people are our most valuable asset.

We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment.

As a Nuance team member you will work within a dynamic international team operating worldwide.

Does Nuance speak to you?

If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7804/job.

 

Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services.  Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.

 

 

ILONA ALEXANDRA HOLTZ

Recruiter - Employment Specialist DACH

Human Resources

Nuance Communications Aachen GmbH

Site Ulm

Soeflingerstr. 100

D-89077 Ulm, Germany

Fon       +49 731 - 379 50 1166

Fax       +49 731 - 379 50 1106 (Zentrale)

Mobil     +49 170 56 15 235

 

WWW.NUANCE.COMThe experience speaks for itself ™

Geschäftsführung/Director: Jan Anthierens

Sitz der Gesellschaft/Registered Office: Aachen

Registergericht/Court of Registration: Aachen

Reg. Nr.: HRB 11872

USt-ID/VAT: DE 813191696

This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.

 

Experience Nuance in the web: http://www.youtube.com/watch?v=32QbXebhiag&list=UUtmZ1Vk2yFJkOe1DYQwLgag&index=1&feature=plcp

Or

 

http://www.youtube.com/watch?v=RkiYr8aw5pE&feature=related

 

Back  Top

6-13(2012-06-11) Speech Output Designer (m/f) Speech Technology Automotive Nuance at Merelbeke Belgium

For our office in Merelbeke / Belgium we are currently looking for a fulltime

 

 

 

Speech Output Designer (m/f)

Speech Technology Automotive

 

 

Key responsibilities.

  • ·         Design and implement voice customizations for Nuance Text-to-Speech Products
  • Gathering and analyzing cutomer   requirements for custom voices
  • Defining strategies for optimized   speech output 
  • Designing text corpora for   recording
  • Developing User Dictionaries and   rules for customer applications
  • Training and supervising native   contractors in prompt tuning
  • Interacting with professional   services teams and contributing to customer project   success

Qualifications.

  • High School diploma or Bachelor   Degree in languages, phonetics, computational linguistics or another relevant   field
  • some years of work experience  
  • Working in Windows (and   Linux/Unix) environment
  • Language skills (grammar,   punctuation, spelling, phonetics, etc.)
  • Ability to appreciate acoustic and   prosodic quality of speech
  • Basic understanding of   Text-to-speech technology
  • Experience in TTS prompt   tuning
  • Ability to work in an   international team and to coordinate contractors
  • Fluent in   English
  • Ability to   write high quality documentation
  • Experience   with speech output applications
  • Experience with acoustic   phonetics
  • Basic experience with scripting   languages like Perl or Python
  • Speaking some other   languages

We offer.

At Nuance Communications we believe our people are our most valuable asset.

We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment.

As a Nuance team member you will work within a dynamic international team operating worldwide.

Does Nuance speak to you?

If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7801/job .

 

Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services.  Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.

 

 

ILONA ALEXANDRA HOLTZ

Recruiter - Employment Specialist DACH

Human Resources

Nuance Communications Aachen GmbH

Site Ulm

Soeflingerstr. 100

D-89077 Ulm, Germany

Fon       +49 731 - 379 50 1166

Fax       +49 731 - 379 50 1106 (Zentrale)

Mobil     +49 170 56 15 235

 

WWW.NUANCE.COMThe experience speaks for itself ™

Geschäftsführung/Director: Jan Anthierens

Sitz der Gesellschaft/Registered Office: Aachen

Registergericht/Court of Registration: Aachen

Reg. Nr.: HRB 11872

USt-ID/VAT: DE 813191696

This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.

 

Back  Top

6-14(2012-06-11) Speech Output Designer (m/f) Speech Technology Automotive at Nuance Turin Italy

For our office in Turin / Italy we are currently looking for a fulltime

 

 

Speech Output Designer (m/f)

Speech Technology Automotive

 

Key responsibilities.

  • ·         Design and implement voice customizations for Nuance Text-to-Speech Products
  • Gathering and analyzing cutomer   requirements for custom voices
  • Defining strategies for optimized   speech output 
  • Designing text corpora for   recording
  • Developing User Dictionaries and   rules for customer applications
  • Training and supervising native   contractors in prompt tuning
  • Interacting with professional   services teams and contributing to customer project   success

Qualifications.

  • High School diploma or Bachelor   Degree in languages, phonetics, computational linguistics or another relevant   field
  • some years of work experience  
  • Working in Windows (and   Linux/Unix) environment
  • Language skills (grammar,   punctuation, spelling, phonetics, etc.)
  • Ability to appreciate acoustic and   prosodic quality of speech
  • Basic understanding of   Text-to-speech technology
  • Experience in TTS prompt   tuning
  • Ability to work in an   international team and to coordinate contractors
  • Fluent in   English
  • Ability to   write high quality documentation
  • Experience   with speech output applications
  • Experience with acoustic   phonetics
  • Basic experience with scripting   languages like Perl or Python
  • Speaking some other   languages

We offer.

At Nuance Communications we believe our people are our most valuable asset.

We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment.

As a Nuance team member you will work within a dynamic international team operating worldwide.

Does Nuance speak to you?

If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7801/job .

 

Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services.  Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.

 

 

ILONA ALEXANDRA HOLTZ

Recruiter - Employment Specialist DACH

Human Resources

Nuance Communications Aachen GmbH

Site Ulm

Soeflingerstr. 100

D-89077 Ulm, Germany

Fon       +49 731 - 379 50 1166

Fax       +49 731 - 379 50 1106 (Zentrale)

Mobil     +49 170 56 15 235

 

WWW.NUANCE.COMThe experience speaks for itself ™

Geschäftsführung/Director: Jan Anthierens

Sitz der Gesellschaft/Registered Office: Aachen

Registergericht/Court of Registration: Aachen

Reg. Nr.: HRB 11872

USt-ID/VAT: DE 813191696

This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.

 

Back  Top

6-15(2012-06-11) Co Producer (m/f) Speech Technology Automotive at Nuance Turin Italy

For our office in Turin / Italy we are currently looking for a fulltime

 

 

Co Producer (m/f)

Speech Technology Automotive

 

Key responsibilities:

  • Produce audio recordings for   Nuance Text-to-Speech Products
  • Assisting with (coached) Casting   of Voice Talents for TTS
  • Assisting with the analysis of   Casting-recordings
  • Coaching Voice Talents for   TTS-recordings – in a Recording Studio and from   remote
  • Supervise Mother-tongue linguists   in a recording situation
  • Preparation of the scripts (in   collaboration with others)
  • Manage the data-flow of the   recorded output
  • Assistance with overhead   (contracts, negotiations)

Qualifications:

  • Degree in languages or another   relevant field
  • some years of working   experience
  • Experience with TTS-technology,   understanding the technology.
  • Experience with the coaching of   TTS-Voice Talents
  • Ability to understand on the fly   what we need from a VT, and to communicate this to the   VT
  • Excellent communicative and   collaborative skills
  • Very good knowledge of English +   some other languages
  • Experience in a Recording-studio   environment
  • Willing to   travel
  • Knowledge of recording software,   like Pro-Tools, Voxover, …
  • Linguistic   knowledge
  • Creative,   flexible
  • Even more languages (Spanish,   east-European, Asian..)

We offer.

At Nuance Communications we believe our people are our most valuable asset.

We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment.

As a Nuance team member you will work within a dynamic international team operating worldwide.

Does Nuance speak to you?

If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7799/job .

 

Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services.  Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.

 

 

ILONA ALEXANDRA HOLTZ

Recruiter - Employment Specialist DACH

Human Resources

Nuance Communications Aachen GmbH

Site Ulm

Soeflingerstr. 100

D-89077 Ulm, Germany

Fon       +49 731 - 379 50 1166

Fax       +49 731 - 379 50 1106 (Zentrale)

Mobil     +49 170 56 15 235

 

WWW.NUANCE.COMThe experience speaks for itself ™

Geschäftsführung/Director: Jan Anthierens

Sitz der Gesellschaft/Registered Office: Aachen

Registergericht/Court of Registration: Aachen

Reg. Nr.: HRB 11872

USt-ID/VAT: DE 813191696

This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.

Back  Top

6-16(2012-06-11) Tools Developer (m/f) Software Engineering C/C++ Speech Technology Automotive at Nuance Turin Italy

For our office in Turin / Italy we are currently looking for a fulltime

 

 

Tools Developer (m/f)

Software Engineering C/C++

Speech Technology Automotive

 

Key responsibilities:

  • Analyze requirements for improving   the Text-to-Speech Voice Building Process
  • Develop methodologies, scripts and   procedures to improve efficiency and quality
  • Develop speech analysis algorithms   to be applied in building text-to-speech voices
  • Adapt and extend existing in-house   voice building technologies in view of large-scale production   
  • Study and experiment automatic   learning and statistical approaches in order to minimize the need for manually   labeled data
  • Design and develop software   components to be used in local, networked or Internet-related tools for voice   building
  • Make sure tools are efficient and   easy to use and provide support to users
  • Document, test, debug and modify   software components of voice building tools

Qualifications:

  • Master degree in Electronic   Engineering / Computer Science / Computer Engineering
  • 2 or more years of medium to   large-scale application development through complete   lifecycle
  • Software Engineering   (C/C++);
  • Experience with process   optimization;
  • Signal Processing, voice   recognition, HMM, neural networks;
  • One or more scripting languages   (PHP, Python, Perl, Awk);
  • Experience working with software   versioning and revision control systems;
  • Comfortable working both   independently and as part of a large international   team;
  • Fluent in   English
  • Data   processing
  • Experience working with   Multithreading / Multithreaded Programming

We offer.

At Nuance Communications we believe our people are our most valuable asset.

We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment.

As a Nuance team member you will work within a dynamic international team operating worldwide.

Does Nuance speak to you?

If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7797/job .

 

Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services.  Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.

 

 

ILONA ALEXANDRA HOLTZ

Recruiter - Employment Specialist DACH

Human Resources

Nuance Communications Aachen GmbH

Site Ulm

Soeflingerstr. 100

D-89077 Ulm, Germany

Fon       +49 731 - 379 50 1166

Fax       +49 731 - 379 50 1106 (Zentrale)

Mobil     +49 170 56 15 235

 

WWW.NUANCE.COMThe experience speaks for itself ™

Geschäftsführung/Director: Jan Anthierens

Sitz der Gesellschaft/Registered Office: Aachen

Registergericht/Court of Registration: Aachen

Reg. Nr.: HRB 11872

USt-ID/VAT: DE 813191696

This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.

Back  Top

6-17(2012-06-11) Voice Manager (m/f) Speech Technology Automotive at Nuance Turin Italy

Voice Manager (m/f)

Speech Technology Automotive

 

 

Key responsibilities:

  • ·       Casting and coaching of Voice Talents
  • ·       Designing text corpora for recording and testing
  • ·       Processing speech data for text-to-speech speech data bases
  • ·       Building and testing text-to-speech voices
  • ·       Managing all technical aspects of a TTS voice development
  • ·       Supporting our professional services teams and contributing to customer project success

Qualifications:

  • MSc degree in Languages /   Computational linguistics / Electronic Engineering or another relevant   field
  • some work   experience
  • Working in a Windows and   Unix/Linux environment
  • Basic experience with scripting   languages like Perl or Python
  • Understanding of phonological and   phonetic concepts
  • Basic understanding of   Text-to-speech technology
  • Strong sense of precision and   quality in your daily job
  • Fluent in   English
  • Ability to write high quality   documentation
  • Experience with acoustic phonetics   preferred
  • Experience in text-to-speech voice   development would be an asset
  • Speaking some other languages   preferred

We offer.

At Nuance Communications we believe our people are our most valuable asset.

We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment.

As a Nuance team member you will work within a dynamic international team operating worldwide.

Does Nuance speak to you?

If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7795/job.

 

Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services.  Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.

 

 

 

 

ILONA ALEXANDRA HOLTZ

Recruiter - Employment Specialist DACH

Human Resources

Nuance Communications Aachen GmbH

Site Ulm

Soeflingerstr. 100

D-89077 Ulm, Germany

Fon       +49 731 - 379 50 1166

Fax       +49 731 - 379 50 1106 (Zentrale)

Mobil     +49 170 56 15 235

 

WWW.NUANCE.COMThe experience speaks for itself ™

Geschäftsführung/Director: Jan Anthierens

Sitz der Gesellschaft/Registered Office: Aachen

Registergericht/Court of Registration: Aachen

Reg. Nr.: HRB 11872

USt-ID/VAT: DE 813191696

This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.

Back  Top

6-18(2012-06-12) Senior research scientist at Pearson

Pearson has one defining goal: to help people progress in their lives through learning. We champion innovation and we invest in models for education that deliver on our promise for effective, accessible, and personal learning from early literacy, college and career readiness to professional education, through data informed instruction and inventive applications for mobile and digital learning.

Pearson, the world's leading learning company, has global-reach and market leading businesses in education, business, and consumer publishing and is listed on the London and New York stock exchanges (UK: PSON; NYSE: PSO). For more information, visit www.pearson.com.

Pearson is an Equal Opportunity and Affirmative Action Employer, and a member of E-Verify. All qualified applicants, including minorities, women, veterans, and people with disabilities are encouraged to apply.

 

Responsible for advancing the state of the company's automated grading technology while working with the sales, marketing and test development teams to solve practical problems from the field. The position demands a person who has broad interests and is motivated to design and implement improvements to the company’s system. Opportunities for further research in applicable areas are available.

Senior Research Scientist:

 

Responsibilities of this role:
• The whole R&D cycle (building language models, training acoustic models, building statistical models for measuring the performance, etc.) for automated grading of different test system.
• Design new algorithms for different purpose (such as improving the grading performance).
• Do data analysis for different requirements.
• Write different scripts and tools to support the sales, marketing and test development teams.
• Maintain grading system.

Qualifications:

 

Personality attributes/skills required:
• Knowledge of speech recognition, probabilistic systems, statistical models.
• Strong background in statistical modeling and machine learning.
• Extensive experience programming in C/C++.
• Proficiency with SQL, Perl and Matlab.
• Demonstrate a willingness to learn and apply a wide range of technologies
• Experience with linguistic and statistical analysis of natural language would be highly desirable.
• Ph.D. with 3-5 years of applied experience in the field

• Experience in designing and running ASR experiments using HTK a plus
• Strong written and verbal communication skills

 

Please apply online at www.pearsoned.com/careers

 

Back  Top

6-19(2012-06-12) Research scientist at Pearson

Pearson has one defining goal: to help people progress in their lives through learning. We champion innovation and we invest in models for education that deliver on our promise for effective, accessible, and personal learning from early literacy, college and career readiness to professional education, through data informed instruction and inventive applications for mobile and digital learning.

Pearson, the world's leading learning company, has global-reach and market leading businesses in education, business, and consumer publishing and is listed on the London and New York stock exchanges (UK: PSON; NYSE: PSO). For more information, visit www.pearson.com.

Pearson is an Equal Opportunity and Affirmative Action Employer, and a member of E-Verify. All qualified applicants, including minorities, women, veterans, and people with disabilities are encouraged to apply.

 

Research Scientist:

 

Responsible for advancing the state of the company's automated grading technology while working with the sales, marketing and test development teams to solve practical problems from the field. The position demands a person who has broad interests and is motivated to design and implement improvements to the company’s system. Opportunities for further research in applicable areas are available.

Day-to-day responsibilities of this role:
• The whole R&D cycle (building language models, training acoustic models, building statistical models for measuring the performance, etc.) for automated grading of different test system.
• Design new algorithms for different purpose (such as improving the grading performance).
• Do data analysis for different requirements.
• Write different scripts and tools to support the sales, marketing and test development teams.
• Maintain grading system.

Qualifications:

 

Personality attributes/skills required:
• Knowledge of speech recognition, probabilistic systems, statistical models.
• Strong background in statistical modeling and machine learning.
• Extensive experience programming in C/C++.
• Proficiency with SQL, Perl and Matlab.
• Demonstrate a willingness to learn and apply a wide range of technologies
• Experience with linguistic and statistical analysis of natural language would be highly desirable.
• Ph.D. or ABD (with graduation in 2012) in EE, CS or relevant areas with 0-2 yrs working experience
• Experience in designing and running ASR experiments using HTK a plus
• Strong written and verbal communication skills

 

Apply online at www.pearsoned.com/careers

Back  Top

6-20(2012-06-15) INESC-ID Open Positions, Lisbon Partugal

INESC-ID Open Positions
   
    INESC-ID invites applications for researchers, starting 2012. We are     interested in PhD researchers, fluent in English and with autonomous     research abilities. We expect that the candidate develop scientific     research in the topics described below.
   
   
       

The    Institution

Instituto de Engenharia de Sistemas e  Computadores, Investigação e Desenvolvimento em Lisboa (INESC-ID) is one of the         most dynamic research institutes in Portugal in the areas of communication and information technologies. The activity of INESC-ID is focused on the following  area: Interactive Intelligent Systems  (http://www.inesc-id.pt).

 

 

Job           Description

Successful candidates will be integrated in an existing research group at INESC-ID and will conduct research  focusing on:

 

  •           Development    and evaluation of human-systems,
  •           Natural  language understanding,
  •           Agents,
  •           Graphical modeling and visualization and robot relations.

 

By addressing the  creation of intelligent and  affective relations with machines, together with intelligent  visualization and         virtual environments, the candidate will develop techniques that  will advance  the state of the art in building dialogue, multi-modal  interaction and  affective relations with machines.

 

The work will be  carried out in one of the  three groups: Spoken Language Systems, Intelligent Agents and  Synthetic         Characters or Visualization and Multi-modal Interactions.

   

 

   

The position holder is expected to develop scientific research preferably within these topics, and will be encouraged to start their own projects in coordination with colleagues. Excellent candidates in related areas are also strongly encouraged to apply.

 

Qualifications Required          

Applicants should  hold a PhD, be fluent in  English and show evidence of autonomous research abilities. The  should be         willing to work in a team and also have a strong publication  record. The successful  candidate should propose an innovative research project with relevance for the  research area where she/he will be integrated.

 

Contract   Conditions

INESC-ID is an  equal opportunity employer that implements the principle of equal treatment and training  irrespective of         religion or belief, disability, age or sexual orientation in  employment. The contract corresponds to a salary of a Research Assitant  Professor in Portugal.   Successful applicants will be in post from August 2012.

 

Application  Deadline

July 15th  2012

 

Application   details

   

INESC-ID invites eligible individuals to submit their expressions of interest, which must include an application letter, a detailed CV, a 1-page outline of a proposed research program, and is highly recommended to include reference letters.

All documents  should be sent by email to applications@inesc-id.pt and/or snail mail to: Direcção INESC-ID, R. Alves Redol, 9, 1000-029  Lisboa, Portugal.

                           

Back  Top

6-21(2012-06-15) Language Processing Software Engineer at ONMOBILE

Language Processing Software Engineer

At ONMOBILE SA, an IT and Telecom VAS company, we hire at the earliest possible date an experienced natural language processing (NLP) software engineer for new research and development projects on automatic speech recognition, text processing, and multilingual question-answering systems.

 

We are looking for a software development engineer with NLP development or research background on either commercial or academic speech recognition systems. You will be familiar with and have practical experience in the following areas:

State-of-the-art NLP technologies (robust parsing, finite-state transducers FSTs, statistical language modelling etc.)

Semantic Web technologies like RDF, OWL, SPARQL

Ability to develop with Eclipse RCP

Strong programming skill with modern programming languages (C++, Java) and scripting languages

Expertise in speech recognition, acoustic modelling, and audio/video processing is a plus

You should have an engineer school or university degree in computer science or related disciplines. A PhD or an equivalent level of experience would be helpful.

 

An application-oriented perspective and a concern for customers with strong analytical and problem solving skills are required. You should be capable to work independently when needed.

 

Other skills: transparent behaviour, clarity of expression, ability to work in a multi-cultural team.

 

Languages: fluent in English, French

 

The position is based in Paris, France.

 

Contact by email to WenXuan TENG (

teng.wenxuan@onmobile.com)

Back  Top

6-22(2012-06-15) Speech and Audio Processing Software Engineer at ONMOBILE

Speech and Audio Processing Software Engineer

At ONMOBILE SA, an IT and Telecom VAS company, we are hiring at the earliest possible date an experienced Speech and Audio Processing Software Engineer for research and development projects on automatic speech recognition, text processing, and multilingual question-answering systems.

 

We are looking for a software development engineer with speech recognition and audio processing development or research background on either commercial or academic speech recognition systems. You will be familiar with and have practical experience in the following areas:

State-of-the-art speech recognition technologies (decoder, language models, acoustic models, signal processing) and their implementation within efficient recognition and training systems

Audio and signal processing for de-noising, acoustic feature extraction, audio fingerprinting, karaoke scoring etc.

Strong programming skill with modern programming languages (C++, Java) and scripting languages

Expertise in NLP technologies is a plus

You should have an engineer school or university degree in signal processing or computer science or related disciplines. A PhD or an equivalent level of experience would be helpful.

 

An application-oriented perspective and a concern for customers with strong analytical and problem solving skills are required. You should be capable to work independently when needed.

 

Other skills: transparent behaviour, clarity of expression, ability to work in a multi-cultural team.

 

Languages: fluent in English, French

 

The position is based in Paris, France.

Contact by email to WenXuan TENG (

teng.wenxuan@onmobile.com)

Back  Top

6-23(2012-06-17) 2-4 PhD positions in Speech Technology and Communication at KTH Stockholm Sweden

2-4 PhD positions in Speech Technology and Communication   

The goal of the positions is to contribute to the research foundation for speech technology in tomorrow's conversational systems.

Anticipated specializations

  • Data-driven Dialogue Management
  • Incremental Input Fusion and Understanding
  • Avatars that Interact Through Speech, Gesture or Sign Language
  • Novel Methods for Automatic Speech Recognition and Understanding

The positions include free tuition and are salaried 4 year employments, presently starting at 2800 euro/month increasing to 3400 euro/month for the last year.

To get information about how to apply for the positions go to http://www.speech.kth.se/vacancies/

Back  Top

6-24(2012-06-21) Mandarin TTS (Text To Speech) Manager / Expert / Research Engineer Positions at Nuance: 6 Location : Shanghai, China
Nuance Communications is a listed US$1.5B global software company and the world leader in speech, text and imaging solutions for businesses and consumers around the world, with aggressive growth plans in the Asia Pacific region.
 
Job title: Mandarin TTS (Text To Speech) Manager / Expert / Research Engineer Positions: 6 Location : Shanghai, China
 
Job description and requirements Overview: Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services.  Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities. Responsibilities:
Reporting to TTS manager, the research scientist will conduct and lead innovative research and development on speech synthesis technologies for Asian languages. In the role as research scientist, your goal will be to continuously drive improvements and innovation to the Nuance Chinese TTS system, for commercial deployment in all types of markets and platforms. Representative tasks will include: •Design, implementation, evaluation, optimization and testing of new algorithms and tools for text-to-speech synthesis, for both signal generation and text processing/understanding. •Product integration supervision of proven innovation results •Defining the team's innovation and technical agenda, in cooperation with TTS management •Creation of demonstrators and evaluators of new technologies. Required skills: Digital speech processing, strong mathematics knowledge, excellent computer programming skills preferably in C, C++ and scripting languages, familiarity with different OS and computing platforms, excellent English and communication skills, strong team player, proven track record of achievements in Chinese TTS R&D, fluent Mandarin speaking Preferred skills: Hands-on experiences in one or more of these areas, TTS R&D, software engineering, natural language processing and understanding, project management, parametric and/or unit selection TTS development Education: PhD or Master degree in EE or CS from a well known university
 
You can also find the job details via searching job 7812 and 7806, 7808~7811 at http://www.nuance.com/company/careers/index.htm
 
Contact information: Further questions and resumes can be sent to Lily He, Recruiter for Nuance Greater China at lily.he@nuance.com Thank you for your time and consideration, I look forward to hearing from you!

Regards,

Lily He

 

Recruiter, GreaterChina

NUANCE COMMUNICATIONS, INC.

 
Back  Top

6-25(2012-06-30) Thèse CIFRE à Orange Labs (Issy les Moulineaux, Paris).

Thèse CIFRE à Orange Labs (Issy les Moulineaux, Paris).

Sujet de la thèse : Apprentissage par renforcement dans un système de dialogue incrémental

L’objectif général de la thèse est d’implanter, d’adapter, d’optimiser et d’intégrer un algorithme d’apprentissage par renforcement dans un système de dialogue incrémental.

La thèse produira une analyse extensive, proche de l’exhaustivité, de l’apprentissage par renforcement dans un environnement événementiel, domaine très peu exploré par la littérature scientifique actuellement. Cette analyse débouchera sur la réalisation de plusieurs algorithmes qui passeront un banc d’essai. L’algorithme le plus performant/prometteur fera l’objet d’une analyse plus poussée et sera intégrée à une application de dialogue expérimentale, pour démontrer son efficacité dans son environnement naturel.

Les défis scientifiques sont nombreux. La thèse rapproche plusieurs domaines scientifiques à la fois très pointus et hétérogènes : l’apprentissage automatique, les systèmes distribués, et le dialogue, qui est lui-même une discipline par nature multi-compétence, du traitement automatique de la langue naturelle au traitement du signal, en passant par la psychologie cognitive. Un premier défi est donc de prendre suffisamment de recul pour intégrer toutes ces composantes dans une seule image globale. Un second défi, plus mathématique, consiste à adapter les algorithmes d’apprentissage par renforcement à un modèle de décisions événementiel. Un troisième défi, de l’ordre de l’ingénierie, concerne l’intégration dans, et le développement d’une application de dialogue de bout en bout. Et enfin, un quatrième défi, auquel nous serons particulièrement attentifs, concerne le souci d’inscrire ce travail scientifique dans un objectif d’industrialisation, à long terme.

Contacts : Romain Laroche, Orange Labs (romain.laroche...at___orange.com) et Fabrice Lefevre, Université d'Avignon (fabrice.lefevre''''at***univ-avignon.fr).

Back  Top

6-26(2012-07-05) PhD at LIG (Grenoble-France)

PhD proposal : Collaborative Annotation of multi-modal, multi-lingual and multimedia documents
Project objective
This PhD will be proposed and funded in the context of the CHIST-ERA / ANR Camomile Project (Collaborative Annotation of multi-MOdal, MultI-Lingual and multi-mEdia documents) Human activity is constantly generating large volumes of heterogeneous data, in particular via the Web. These data can be collected and explored to gain new insights in social sciences, linguistics, economics, behavioral studies as well as artificial intelligence and computer sciences. In this regard, 3M (multimodal, multimedia, multilingual) data could be seen as a paradigm of sharing an object of study, human data, between many scientific domains. But, to be really useful, these data should be annotated, and available in very large amounts. Annotated data is useful for computer sciences which process human data with statistical-based machine learning methods, but also for social sciences which are more and more using the large corpora available to support new insights, in a way which was not imaginable few years ago. However, annotating data is costly as it involves a large amount of manual work, and in this regard 3M data, for which we need to annotate different modalities with different levels of abstraction is especially costly. Current annotation frameworks involve some local manual annotation, with the help sometimes of some automatic tools. The Camomile Project aims at developing a first prototype of collaborative annotation framework on 3M data, in which the manual annotation will be done remotely on many sites, while the final annotation will be localized on the main site. Furthermore, with the same principle, some systems devoted to automatic processing of the modalities (speech, vision) present in the multimedia data will help the transcription, by producing automatic pre-annotations.
PHD proposal
This PhD is dedicated to the proposal of semi-supervised and unsupervised methods for the annotation of MMM data. Different scenarios of semi-supervised annotations will be experimented, for different type of videos. More precisely, we shall study: ? innovative retraining / adaptation strategies to update the different systems using new annotations. Since we consider a real scenario where new annotations are produced continuously, we will specially focus on iterative learning techniques where models are updated instead of being fully retrained; ? new data selection methods for active learning strategies ; we will focus on active learning for multimodal and heterogeneous systems which makes the data selection task much more difficult. As a case study we shall focus our work on developing technologies in order to answer to the questions ?who is seen??, ?who is speaking?? in videos. Depending on the type of video and the feedback from the supervision group, we may extend our work to the automatic annotation of objects (?what is seen??) or activities (?what is going on??).
Required Skills
The applicant must have a master degree in either computer science or computer engineering and have some knowledge in speech, image or video processing and in machine learning. We also search for a candidate with very good programming skills.
LIG GETALP and MRIM collaboration
PHD work is to be carried out between the GETALP and MRIM teams of LIG. LIG / GETALP website http://getalp.imag.fr LIG / MRIM website http://mrim.imag.fr
Contacts Laurent Besacier Laurent.Besacier@imag.fr Georges Quénot Georges.Quenot@imag.fr
Targeted starting date: fall 2012

Back  Top

6-27(2012-07-08) Faculty position in Phonetic Science and Speech Technology at Nanjing Normal University, China

Faculty position in Phonetic Science and Speech Technology at Nanjing Normal University, China

(Urgent job announcement)

The Institute of Linguistic Science and Technology at Nanjing Normal University, China,

invites applications for a faculty position in the area of Phonetic Science and Speech

Technology. The position can be Lecturer, Associate Professor, or Professor, depending on

the qualifications and experience of the applicant.

Nanjing Normal University (NNU) is situated in Nanjing, a city in China not only

famous for its great history and culture but also pride for excellence in education and

academy. With Chinese-style buildings and garden-like environment, the Suiyuan Campus of

NNU is often entitled the “

Most Beautiful Campus in the Orient.”

Nanjing Normal University is among the top 5 universities of China in the area of

Linguistics. Placing strong emphasis on interdisciplinary research, the Institute of Linguistic

Science and Technology at NNU is unique in that it bridges the studies of theoretical and

applied linguistics, phonetics, cognitive sciences, neural sciences, and information

technologies. The phonetic laboratory is very well equipped, with sound-proof recording

studio, professional audio facilities, physiological instruments (e.g., WAVE system,

PowerLab, EGG, EPG, airflow and pressure module, and nasality sensor), EEG for ERP

studies, eye tracker, etc. The laboratory just organized an international symposium TAL 2012

www.TAL2012.org

very successfully at the end of May.

We welcome interested colleagues to join us. The research can cover any areas in

phonetic sciences and speech technologies, including but not limited to speech production,

speech perception, prosodic modeling, speech synthesis, automatic speech recognition and

understanding, spoken language acquisition, computer-aided language learning, and ERP

study for spoken languages. Outstanding research support will be offered.

Requirements:

* A PhD degree (or an expected one) in related disciplines (e.g., linguistics, psychology,

physics, applied mathematics, computer sciences, and electronic engineering);

* Good publication/patent record in phonetic sciences or speech technologies;

* Good oral and written communication skills in both Chinese and English;

* Team work spirit in a multidisciplinary group.

Interested candidates should submit a CV, a detailed list of publication, the copies of the best

two or three publications, and the contact information of two references to:

Prof. Wentao GU

Email:

wtgu@njnu.edu.cn; wentaogu@gmail.com

Phone: (office) +86-25-8359-8624, (mobile) +86-189-3687-2840

The position will keep open until it is filled. An earlier application is strongly recommended

Back  Top

6-28(2012-07-26) Offre de thèse en correction orthographique par traduction statistique, Univ. Le Mans, France
Offre de thèse financée au sein du laboratoire d'Informatique de l'Université du Maine (LIUM) dans le domaine de la correction orthographique automatique par méthodes de traduction statistique. Lieu : LIUM (Le Mans) Date : 1/10/2012 Durée : 3 ans Cette thèse s'inscrit dans le projet 'investissement d'avenir' PACTE, porté par l'entreprise Diadeis, et dont sont également partenaires l'équipe Alpage (INRIA et Paris 7), et les entreprises A2ia et Isako. PACTE a pour objectif l'amélioration de la qualité orthographique des textes issus de différentes méthodes de capture textuelle. L'accent est mis sur les sorties d'OCR (reconnaissance optique de caractères sur des textes imprimés scannés), mais concerne également des données obtenues par reconnaissance d'écriture manuscrite, par saisie manuelle, et par rédaction directe. Les techniques qui seront utilisées sont à la fois statistiques et hybrides, faisant usage d'outils et de ressources de linguistique computationnelle. Le domaine d'application principal du projet est celui de la numérisation du patrimoine écrit, dans un contexte multilingue. Une deuxième thèse démarrera à Alpage avec un accent sur l'utilisation des connaissances linguistiques pour aider à optimiser automatiquement ou quasi-automatiquement la qualité orthographique des textes. Dans le cadre du projet PACTE, une étroite collaboration aura lieu entre le LIUM, Alpage et la société Diadeis. Dans ce contexte, l'enjeu de la thèse au LIUM est d'analyser comment utiliser les techniques de traduction automatique statistique pour la correction d'erreur. En effet, on peut considérer la correction d'erreur comme un processus de passage d'une langue erronée vers une langue correcte. Une approche similaire a déjà été utilisée avec succès pour corriger les sorties des systèmes de traduction par règles, connue sous le nom 'statistical post-editing (SPE)'. Dans le cadre de cette thèse, il s'agit donc d'étudier comment une approche similaire peut être utilisée pour la correction orthographique. Un aspect important de cette thèse concerne le développement de modèles de langue efficaces, donnant de bons résultats avec une faible empreinte mémoire. Les modèles n-grammes à repli seront privilégiés, mais d'autres méthodes seront également explorées, notamment la modélisation dans l'espace continu (continuous space language models). Nous nous intéresserons aussi à l'intégration de connaissances morphosyntaxiques, en collaboration avec l'équipe Alpage. Les langues étudiées seront prioritairement le français et l'anglais, ainsi que l'allemand. Une application à l'espagnol, l'italien, voire d'autres langues européennes est possible. Profil recherché : - bonnes compétences en informatique (la maîtrise de Linux est indispensable, programmation en C++, utilisation de scripts, Perl, etc); - des connaissances en traduction automatique statistique sont souhaitées, ou, à défaut, en apprentissage automatique; - une expérience avec l'outil Moses est un plus. La thèse se déroulera au sein de l'équipe LST du LIUM. Le LIUM est connu au niveau international pour ses recherches dans le domaine de la traduction statistique, et nous avons de nombreuses collaborations avec des universités et entreprises en Europe et aux États-Unis. Contact : Holger Schwenk Holger.Schwenk@lium.univ-lemans.fr 
Back  Top

6-29(2012-08-03) PhD Studentship in Speaker Diarization at EURECOM, Sophia Antipolis, Alpes Maritimes, France

PhD Studentship in Speaker Diarization at EURECOM

Department: Multimedia Communications
URL:        http://www.eurecom.fr/mm
Start date: 01/10/12
Duration:   Duration of the thesis

Description:

EURECOM’s Multimedia Communications Department invites applications for a PhD studentship in speaker diarization within its Speech and Audio Processing Research Group.  Speaker diarization is commonly referred to as the task of detecting ‘who spoke when’ in a multiple-speaker audio signal.  In its most general form it is performed without any prior
knowledge regarding the number of speakers or speaker identities. applications include speech recognition, speaker recognition (biometrics), multimedia indexing, content structuring and general multimedia document processing.

As with any modelling or statistical pattern recognition task, performance is affected by unwanted nuisance variation and by the amount of data available for any given class.  In the case of speaker diarization performance  can be affected by background noise, varying
linguistic content and differences in speaker floor times.   Our recent work has developed new normalization approaches to marginalise linguistic variation in order to increase speaker discrimination and improve speaker diarization performance.

This fully-funded PhD position aims to extend this work to further improve the robustness of speaker diarization in the case of linguistic  variation and varying speaker floor times.  The work will develop a novel phone adaptive training algorithm and investigate other, new normalisation and marginalization approaches to improve speaker modelling.  The position is an opportunity to make a contribution in an increasingly important field of speech and audio processing.  You will join a small, but dynamic research group which participates in a growing number of European, national and industrially-funded research projects and will have the opportunity for international travel and participation in competitive evaluations.


Requirements:

The successful candidate will have a Master’s degree in engineering, mathematics, computing or a related, relevant discipline.  You will be highly motivated to undertake challenging research, have strong expertise in mathematics and programming and have excellent communication skills.  Knowledge of C/C++ and Matlab is strongly desirable.  Good English language speaking and writing skills are essential.  Knowledge of French is a bonus.  Application  Screening of applications will begin immediately, and the search will continue until the position is filled. Applicants should send, to the address below (i) a one page statement of research interests and motivation, (ii) your CV and (iii) contact details for three referees.

Applications should be submitted by e-mail to secretariat@eurecom.fr

Contact:         Dr. Nicholas Evans
Postal address:  2229 route des Crêtes
                 B.P. 193
                 06904 Sophia Antipolis
                 France
Email:           evans@eurecom.fr
Web page:        http://www.eurecom.fr/mm
Phone number:    +33 4 93 00 81 14
Fax number:      +33 4 93 00 82 00

EURECOM is a graduate school and a Research Centre in Communication Systems, located in Sophia Antipolis technology park, in close proximity with a large number of research units of leading multinational corporations in the telecommunications, semiconductor and biotechnology sectors, as well as other outstanding research and teaching institutions. EURECOM was founded in 1991 by TELECOM ParisTech (Ecole Nationale Supérieure des Télécommunications) and EPFL (Swiss federal institute of Lausanne) in a consortium form, combining academic and industrial partners.

EURECOM deploys its expertise around three major fields: Networking and security, Multimedia Communications and Mobile Communications and has a strong international scope and strategy. EURECOM is particularly active in research in its areas of excellence while also training a large number of doctoral candidates. Its contractual research is recognized across Europe and contributes largely to its budget.

Back  Top

6-30(2012-08-08) Poste de chercheur junior en sciences du langage-Univ Mons-Hainaut Belgique

Offre d’emploi : « Chercheur en sciences de la parole »

_____________________________________________________________________________________

 

Service de Métrologie et Sciences du langage, Laboratoire de phonétique,

 

Université de Mons, Mons, Belgique

_____________________________________________________________________________________

 

Le service de métrologie et sciences du langage  constitue une réserve ouverte de recrutement en vue de l’engagement possible dans des postes de chercheur junior (M/F) boursier en sciences de la parole.

Profil du candidat (M/F) :

  • Niveau à l’entrée : « bac +5» (master 300 crédits) au moins, ou équivalent au 1er octobre 2012.
  • Formation initiale attestant d’intérêt et de compétences dans le domaine des sciences du langage et plus spécialement des sciences de la parole.
  • Formation complémentaire en sciences de la parole soit accomplie soit concomitante à l’engagement.
  • Aptitude au travail en équipe, créativité, autonomie, curiosité scientifique.
  • Des compétences en traitement statistique des données, une maîtrise de l’anglais scientifique, de même que des connaissances des langues étrangères constituent des atouts complémentaires.

Profil de poste :

Le titulaire du poste (M/F) contribue aux efforts de recherche du Service dans le cadre de l’une des thématiques ressortissant à l’un des deux projets décrits en annexe. Il prépare une thèse de doctorat articulée avec l’un de ces projets. Il peut être amené à prendre une part mineure aux activités d’encadrement pédagogique du service.

Certains postes sont d’ores et déjà disponibles au service, d’autres sont susceptibles de lui être attribués moyennant une candidature au sein d’un concours (interne et/ou externe à l’UMONS), avec support du Service.

Bourse de recherche d’une durée de quatre ans, par tranches renouvelables d’un an, avec prise de fonctions au plus tôt le 1er octobre 2012.

Les personnes intéressées sont priées d’adresser au plus vite un dossier comportant :

  • une lettre de motivation,
  • un curriculum vitae,
  • la préférence (argumentée) pour l’un ou l’autre des deux projets décrits en annexe,
  • tout document jugé utile.

au format pdf (exclusivement) à l’adresse : bernard.harmegnies@umons.ac.be

Annexe 1 : Projet PAROLPATHOS

Titre du projet

 

Evaluation acoustique et auditive du signal de parole de locuteurs francophones en situation de handicap. Apports de la phonétique clinique au développement de procédures d’évaluation holistique du sujet communicant situé dans son écosystème.

 

Résumé et objectifs du projet

 

Les phénomènes acoustiques que produit le locuteur dans l’actualisation de son intention de communication ne comportent pas seulement la manifestation matérielle des formes linguistiques prescrites par les systèmes de la langue. Le signal de parole charrie aussi quantité d'éléments sans rapport avec les signes linguistiques, mais causalement liés à divers aspects du fonctionnement ou de l'état du locuteur, et susceptibles, dès lors, d'avoir valeur d'indices.

 

Dans les cas où le sujet se trouve en situation de handicap, des éléments liés à un contexte pathologique peuvent ainsi affleurer dans le signal de parole. Les productions vocales du locuteur peuvent dès lors porter la marque tant de problèmes localisés au niveau de l'une ou l'autre des procédures de traitement du langage que de difficultés non langagières mais ayant sur le langage des répercussions plus ou moins directes.

 

La phonétique clinique, discipline en émergence depuis deux décennies dans le monde anglo-saxon, et depuis quelques années seulement en Francophonie, se centre sur ces phénomènes avec le projet de mettre ses méthodes et techniques de laboratoire au service de la compréhension du fonctionnement du locuteur, là où l'existence d'une pathologie lui fait affronter la situation de communication comme situation de handicap. Aujourd'hui, si les approches de la phonétique clinique apparaissent, au crible de ces travaux, d'un très haut intérêt, force est cependant de reconnaître que l'état des connaissances scientifiques en la matière demeure embryonnaire et inégal. D'une part, les recherches se centrent le plus souvent sur des phénomènes observés dans des échantillons de parole de langue anglaise: la langue française fait, de ce point de vue, figure de parent pauvre. D'autre part, certains secteurs pathologiques ont fait l'objet de bien moins d'efforts que d’autres (sinon aucun). En outre, souvent, des équipes différentes ont recours à des méthodologies différentes, même si elles travaillent sur des cadres pathologiques identiques ; il est donc malaisé d'évaluer la fiabilité et la validité des différentes approches métrologiques. Par ailleurs, dans de nombreux cas, les approches quantitatives à base acoustique, quelque sophistiquées qu’elles soient, ne parviennent pas à une finesse clinique comparable à celles de l'expert humain. Enfin, la plupart des outils aujourd'hui disponibles s'assortissent de contraintes techniques et méthodologiques qui en rendent l'utilisation difficile en contexte écologique de communication.

 

Notre projet vise, en conséquence, à dégager une synthèse générale des moyens d’évaluation disponibles, valable pour les productions des locuteurs francophones s’exprimant dans des situations de handicap comportant une dimension pathologique. Il recourt à une large variété de tableaux cliniques (troubles de l’articulation, troubles de la fluence, laryngopathies acquises, carcinomes des voies aéro-digestives supérieures, pathologies neurologiques non-spécifiquement liées à la sphère du langage, vieillissement langagier physiologique vs. pathologique) et à un panel d'approchesméthodologiques diversifié, afin de pouvoir étudier le croisement des tableaux cliniques et des méthodologies dans un large territoire conceptuel. Il compare ces approches métrologiques à entrée acoustique aux évaluations formées par des auditeurs humains dotés de types et de niveaux d’expertise variables. Ce faisant, il vise non seulement à faire oeuvre d’intervalidation, mais interroge également les processus cognitifs permettant à l’observateur de construire et d’exercer son expertise. Il étudie en outre la généralisabilité des mesures et des évaluations pratiquées en contexte artificiel (laboratoire, hôpital) aux contextes écologiques de vie (famille, services, institutions) et propose une approche intégrée de l’évaluation de la contribution de la qualité communicationnelle à la qualité de vie.


 

 

Annexe 2 : Projet COGNIPHON

Titre du projet

 

 

Contrôle cognitif de la production des sons de parole en phase d’acquisition de la L2

 

Résumé et objectifs du projet

 

 

L’individu qui, ayant acquis la maîtrise du langage au travers de sa seule langue maternelle, souhaite en apprendre une autre, se trouve confronté à la nécessité de traiter, en L2, (en perception comme en production) des sons similaires à ceux de la L1 d’une manière différente (par exemple, certaines réalisations du /e/ de l’espagnol peuvent être acoustiquement fort similaires de celles du /ɛ/ du portugais),  voire de percevoir et produire en L2 des sons inexistants en L1 (par exemple, la production du /ʔ/ en arabe ne correspond à la réalisation d’aucun phonème de l’anglais). L’idée que, dans le cadre de cet apprentissage, le sujet recourt mal à propos aux stratégies qui lui sont habituelles en L1 a, depuis longtemps, inspiré les linguistes, les pédagogues et, singulièrement, les scientifiques intéressés à la cognition humaine, qui peuvent y voir la mise en œuvre inappropriée de stratégies routinisées par le biais de l’usage de la L1.

 

Les pédagogues de l’oral en langue étrangère (particulièrement le courant verbo-tonal), soucieux de prendre en charge cette propension de l’apprenant, ont proposé divers moyens d’intervention dans le cadre de démarches qu’il est de coutume de rassembler sous l’appellation correction phonétique. De cela résulte un corps technique rassemblant des procédés didactiques appuyés essentiellement sur l’expertise des praticiens.

Les enseignants de langue s’accordent certes en général sur l’intérêt de ces techniques. Néanmoins, l’étude objective non seulement de leur efficacité mais aussi, plus profondément, de leur mode de fonctionnement, n’a fait l’objet que de fort peu de développements, hormis les travaux de quelques trop rares équipes.

La situation actuelle est donc paradoxale, car même si les échanges verbaux sont désormais au centre des pratiques de classe en L2, la question du traitement pédagogique de l’acquisition des processus de gestion cognitive de la matière phonique a largement été scotomisée dans le domaine de la recherche empirique et très peu de connaissances fiablement basées sur des évidences expérimentales sont en fait disponibles. C’est d’autant plus regrettable que la demande sociale pour des prestations orales multilingues de qualité s’accroît continûment et que diverses officines vendent aujourd’hui au prix fort un savoir-faire souvent banal, prétendument fondé sur des évidences scientifiques, en vérité non démontrées.

Notre projet vise au développement d’un programme de recherche dont la finalité est précisément de contribuer à combler cette lacune par la mise en œuvre de dispositifs expérimentaux susceptibles d’apprécier le poids des divers éléments causaux impliqués dans l’acquisition de nouvelles compétences de contrôle phonique et susceptibles d’être appliqués à tout sujet, quelles que soient ses caractéristiques intrinsèques.

Si, dans son origine, notre réflexion part de constats opérés dans le cadre de situations concrètes d’enseignement-apprentissage et si, par ailleurs, elle se nourrit de l’expertise des enseignants, nous nous inscrivons cependant résolument dans une perspective de recherche fondamentale. Notre objet d’étude n’est autre que l’ensemble des facteurs extrinsèques susceptibles d’être manipulés en vue de favoriser la maîtrise, par l’apprenant de langue étrangère, de nouvelles possibilités de contrôle phonique, que ces facteurs aient ou non été identifiés et/ou délibérément exploités dans le cadre pédagogique. Nous nous trouvons donc bien ici au cœur de ces processus cognitifs “ mis en jeu au cours de l’acquisition, la perception, la compréhension et la production du langage parlé […] ” que Ferrand & Grainger (2004, p. 11) définissent précisément comme constituant l’objet-même des préoccupations scientifiques de la psycholinguistique cognitive.

 

 

 

 
 
Back  Top

6-31(2012-08-24) Post-docs call for application at Brain and Language Research Institute, Aix en Provence France

Brain and Language Research Institute

               Post-docs call for application

                      http://www.blri.fr

>>> Deadline extension

The 'Brain and Language Research Institute' is a new 'Laboratoire d'Excellence' at Aix-Marseille Université. It federates 6 top-level labs in the domain of language studies, neurosciences, cognitive psychology, medicine and computer science. BLRI is now starting different interdisciplinary research programs investigating language production and perception and its cerebral correlates.

Applications are sought for one-year positions of Postdoctoral Research Fellow. Five research subjects are proposed (see detailed presentation available from http://www.blri.fr), all to be started fall 2012.

1. Introducing humour in vocal human-machine interaction systems
   Domains: linguistics and computer science
   Location: Aix-en-Provence (LPL) or Avignon (LIA)
   Contact: bea.priego-valverde@lpl-aix.fr; fabrice.lefevre@univ-­avignon.fr

2. Handwriting sonification: a tool for early diagnostics and treatment of micrographia in Parkinson Disease
   Domains: neurosciences and medicine
   Location: Marseille (LNC)
   Contact: jean-luc.velay@univ-amu.fr; serge.pinto­@lpl-aix.fr

3. Recording and processing the vocalizations of baboons
   Domains: psychology and linguistics
   Location: Marseille (LPC)
   Contact: arnaud.rey@univ-amu.fr; thierry.legou@lpl-aix.fr; joel.fagot@univ-amu.fr

4. Oculomotor and Visuo-attentional Prerequisites to Reading Development in Preschool Children in 4- and 5-year-olds
   Domains: psychology and linguistics
   Location: Aix-en-Provence (LPL)
   Contact: stephanie.ducrot@lpl-aix.fr; jonathan.grainger@univ-amu.fr

5. Phonetic alignment: Analysis and simulation
   Domains: computer science and linguistics
   Location: Aix-en-Provence (LPL) or Avignon (LIA)
   Contact: brigitte.bigi@lpl-aix.fr; georges.linares@univ-­avignon.fr


Applications
------------
Candidates should send a detailed CV plus a 3-pages research project corresponding to one of the subjects. Elaborating the project should be done in coordination with the project supervisors (see contacts for each project).

  . Position: 1 year
  . Salary: 1,950euros net per month (all taxes deduced, excepted income tax)
  . Deadline: September 15th
  . Starting date: not later than November 2012

Application should be sent to philippe.blache@blri.fr with copy to nadera.bureau@blri.fr.

Back  Top

6-32(2012-08-30) Post-doc position at LIMSI-CNRS in the Spoken Language Processing group, Paris

Post-doc position at LIMSI-CNRS Post-doc positionin the Spoken Language Processing     group    

A post-doc position will be proposed at LIMSI-CNRS, in the       context of the ANR-funded CHIST-ERA CAMOMILE Project       (Collaborative Annotation of multi-MOdal, MultI-Lingual and       multi-mEdia documents).
   

   

Description

    Human activity is constantly generating large volumes of     heterogeneous data, in particular via the Web. These data can be     collected and explored to gain new insights in social sciences,     linguistics, economics, behavioural studies as well as artificial     intelligence and computer sciences.
    In this regard, 3M (multimodal, multimedia, multilingual) data could     be seen as a paradigm of sharing an object of study, human data,     between many scientific domains. But, to be really useful, these     data should be annotated, and available in very large amounts.     Annotated data is useful for computer sciences which process human     data with statistical-based machine learning methods, but also for     social sciences which are more and more using the large corpora     available to support new insights, in a way which was not imaginable     few years ago. However, annotating data is costly as it involves a     large amount of manual work, and in this regard 3M data, for which     we need to annotate different modalities with different levels of     abstraction is especially costly. Current annotation framework     involves some local manual annotation, with the help sometimes of     some automatic tools (mainly pre-segmentation).
    The project aims at developing a first prototype of collaborative     annotation framework on 3M data, in which the manual annotation will     be done remotely on many sites, while the final annotation will be     localized on the main site. Furthermore, with the same principle,     some systems devoted to automatic processing of the modalities     (speech, vision) present in the multimedia data will help the     transcription, by producing automatic annotations. These automatic     annotations are done remotely in each expertise point, which will be     then combined locally to produce a meaningful help to the     annotators.
    In order to develop this new annotation concept, we will test it on     a practical case study: the problem of person annotation (who is     speaking?, who is seen?) in video, which needs collaboration of high     level automatic systems dealing with different media (video, speech,     audio tracks, OCR, ...). The quality of the annotated data will be     evaluated through the task of person retrieval.
    This new way to envision the annotation process, should lead to some     methodologies, tools, instruments and data that are useful for the     whole scientific community who have interest in 3M annotated data.
   

Skills

    A PhD in a field related to the project is required.
   

Contacts

   

   

Agenda

   

         
  • Starting date:  Fall 2012
  •      
  • Duration of the project: 36 months
  •    
Back  Top

6-33(2012-08-30) A Post-doc position at Bruno Kessler Foundation, Center for Information Technology (Trento-Italy)

A Post-doc position is available in the Speech-acoustic scene analysis and interpretation - SHINE Unit at Bruno Kessler Foundation, Center for Information Technology.

The Bruno Kessler Foundation (FBK) conducts research activities in Information Technology, Materials and Microsystems, Theoretical Physics, Mathematics, Italian-Germanic historical studies, Religious studies and International Relations. Through its network, it also develops research in the fields of international relationships, conflict causes and effects, European economic institutions, behavioral economics and evaluative assessment of public policies.

Workplace description

The SHINE unit conducts research on acoustic signal processing and interpretation, mainly concerning speech signals acquired by multi-microphone systems in indoor environment. The research aims to progress in the scientific areas of Acoustic Scene Analysis and Speech Interaction under noisy and reverberant conditions, in particular with a speaker at distance from the microphones.

More information about SHINE unit is available at the following link: http://shine.fbk.eu 

 

Job description

The SHINE Research Unit is looking for a candidate to carry out research activities in the field of Distant Speech Recognition. Applications are invited for a post-doctoral researcher who will work under the DIRHA project funded by the EU (http://dirha.fbk.eu) and other internal research activities. This project aims to study voice-based systems in domestic environments supporting natural speech interaction using distant microphones e.g. for supporting motor-impaired persons. Main field of research are multi- channel acoustic processing, distant speech recognition and understanding, speaker identification and verification, spoken dialogue management.

page1image14568
page1image14840

Job requirements

  • PhD degree in computer science or engineering, involving speech processing;

  • background in one or more of the following areas: speech enhancement; noise robust speech recognition; adaptation techniques for acoustic modelling; experience of the design and construction of speech recognition systems; familiarity with software tools such as HTK, Kaldi or Praat; 

  • strong research track record with significant publications at leading international conferences or in journals;

  • skills in experimental work and development of algorithms;

  • highly motivated to undertake challenging applied research;

  • oral and written proficiency in English.

    In adherence to FBK's policy to promote equal opportunity and gender balance, in case of equal applications, female candidates will be given preference.

Employment

Type of contract: 30-month contract Number of position: 1 Gross salary: from 33.000 to 41.000 € per year (depending on the candidate’s experience)

Benefit: company subsidized cafeteria or meal vouchers, internal car park, welcome office support for visa formalities, accommodation, social security, etc., reductions on bank accounts, public transportation, sport, accommodation and language courses fees.

Start date: Autumn 2012 Place: Povo, Trento (Italy)

Application process

To apply online, please send your detailed CV (.pdf format) including a list of publications, a statement of research interests and contact information for at least 2 references. Please include in your CV your authorization for the handling of your personal information as per the Italian Personal data Protection Code, Legislative Decree no. 196/2003 June 2003.

Applications must be sent to jobs@fbk.eu

Emails should have the following reference code: SHINE_PostDoc2012_DSR

Application deadline: September 25th 2012

For more information, please contact: Maurizio Omologo (e-mail: omologo@fbk.eu)

Those candidates who will pass the preliminary curricula screening will be contacted shortly for an interview. Those applicants who will not be selected, will be notified of the exclusion at the end of the selection process.

Please note that FBK may contact short-listed candidates who were not selected for the current openings within a period of 6 months for any selection process for similar positions.

For transparency purposes, the name of the selected candidate, upon his/her acceptance of the position, will be published on the FBK website at the bottom of the selection notice. 

Back  Top

6-34(2012-09-07) 4 positions as Google's Dublin office as Speech Linguistic Project Managers
There are four temporary positions opening at Google's Dublin office as Speech Linguistic Project Managers for French, Italian, German and Spanish (see description below). The role would suit someone with an advanced degree in (Computational) Linguistics (Master's degree or Ph.D.) and a native speaker of one of these languages.
 
These positions were recently advertised on the Linguist List (http://linguistlist.org/jobs/get-jobs.cfm?JobID=98660&SubID=4551801) where all the relevant information can be found. A description can be found below as well.
Job title:
Speech Linguistic Project Manager (French, German, Italian, Iberian Spanish)
 
Job description:
As a Linguistic Project Manager and a native speaker of one of the target languages, you will oversee and manage all work related to achieving high data quality for speech projects in your own language.
 
You will be based in the Dublin office, managing a team of Data Evaluators and working on a number of projects towards Speech research: ASR, TTS, and NLP
 
This includes:
- managing and overseeing the work of your team
- creating verbalisation rules, such as expanding URLs, email addresses, numbers
- providing expertise on pronunciation and phonotactics
- building and maintaining a database of speech recognition patterns
- creating pronunciations for new lexicon entries, maintaining the lexicon
- working with QA tools according to given guidelines and using in-house tools
 
Job requirements:
- native-level speaker of one of the target languages (with good command of the standard dialect) and fluent in English
- keen ear for phonetic nuances and attention to detail; knowledge of the language's phonology 
- must have attended elementary school in the country where the language is spoken 
- ability to quickly grasp technical concepts
- excellent oral and written communication skills
- good organizational skills, previous experience in managing external resources
- previous experience with speech/NLP-related projects a plus
- advanced degree in Linguistics, Computational Linguistics preferred
- also a plus: proficiency with HTML, XML, and some programming language; previous experience working in a Linux environment
 
Project duration: 6-9 months (with potential for extension)
 
For immediate consideration, please email your CV and cover letter in English (PDF format preferred) with 'Speech Linguistic Project Manager [language]' in the subject line.
 
Email Address for Applications: DataOpsMan@gmail.com 
Contact Person: Linne Ha
Closing date: open until filled
Back  Top

6-35(2012-09-26) Speech Recognition/Machine Learning Engineers ar Cantab Research, Cambridge,UK

Speech Recognition/Machine Learning Engineers

Cantab Research was founded in 2006 and exists to supply automatic speech recognition products to a wide variety of customers.   It has previously
supplied SpinVox (now Nuance) to create a voicemail to text system that handles over a million calls a day.  It currently supplies speech recognition to a medical transcription company and to several other companies with high potential novel applications.

We are expanding and seeks a graduate/post-graduate with the following skills:

* Good degree in a numerate discipline (ideally MSc/MPhil/PhD)
* Excellent analytical ability, ability to conduct research experiments as well as develop code
* Practical experience of machine learning on large datasets
* C/C++, perl/python, matlab/octave, Linux
* Previous experience in speech recognition or large scale web information mining an advantage
* Keen to learn automatic speech recognition operation and applications

This role offers an interesting combination of the application of research, experimentation, and product development. The post involves extending Cantab's existing speech recognition software and applying it to new tasks. You will be joining a small but rapidly expanding team and enjoy the challenges and rewards of a startup culture.

Location: Cambridge, UK.  Full/part time and employment/contracting may be negotiated depending on experience.

Contact: Dr Tony Robinson (tonyr@cantabResearch.com)

Back  Top

6-36(2012-10-05) ASSOCIATE RESEARCH SCIENTIST POSITION at ETS Princeton, NJ, USA

ASSOCIATE RESEARCH SCIENTIST POSITION

Speech

Educational Testing Service

Headquartered in Princeton, NJ, ETS is the world’s premier educational measurement institution and a leader in

educational research. As a nonprofit corporation and an innovator in developing tests for clients in education,

government, and business we are dedicated to advancing educational excellence for the communities we serve.

ETSs Research & Development division has an opening for a researcher in the NLP & Speech Group. The Group

currently consists of about 15 Ph.D. level research scientists in areas related to NLP and speech. Its main focus is on

foundational research as well as on development of new capabilities to automatically score written and spoken test

responses in a wide range of ETS test programs including TOEFL(R)iBT and GRE(R).

PRIMARY RESPONSIBILITIES

Provide scientific and technical skills in conceptualizing, designing, obtaining support for, conducting,

managing, and disseminating results of research projects in the field of speech technology, or portions of largescale

research studies or programs in the same field.

Develop and/or modify speech theories to conceptualize and implement new capabilities in automated scoring

and speech-based analysis and evaluation systems which are used to improve assessments, learning tools and test

development practices.

Apply scientific, technical and software engineering skills in designing and conducting research studies and

capability development in support of educational products and services.

Develop and oversee the conduct of selected portions of research proposals and project budgets.

Design and conduct complex scientific studies functioning as an expert in major facets of the projects.

Assist in the conduct of research projects by accomplishing directed tasks according to schedule and within

budget.

Participate in dissemination activities through the publications of research papers, progress and technical reports,

the presentation of seminars or other appropriate communication vehicles.

Develop professional relationships as a representative, consultant or advisor to external advisory and policy

boards and councils, research organizations, educational institutions and educators.

REQUIREMENTS

A Ph.D. in Language Technologies, Natural Language Processing, Computer Science or Electrical Engineering, with

strong emphasis on speech technology and preferably some education in linguistics is required.

Evidence of substantive research experience and/or experience in developing and deploying speech capabilities is

required. Demonstrable contributions to new and/or modified theories of speech processing and their implementation in

automated systems. Demonstrable expertise in the application of speech recognition systems and fluency in at least one

major programming language (e.g., Java, Perl, C/C++, Python).

HOW TO APPLY

Please apply online at

www.ets.org/careers – position #124337. ETS offers competitive salaries, outstanding benefits, a

stimulating work environment, and attractive growth potential. ETS is an Equal Opportunity, Affirmative Action Employer.

Back  Top

6-37(2012-10-05) Researcher in Speech Technology at Vicomtech-IK4, San Sebastian, Spain

Researcher in Speech Technology
Vicomtech-IK4, an international applied research centre in Visual Interaction and Communication Technologies located in San Sebastian (Spain) is looking for a Researcher in Speech Technology. We are looking for someone who combines experience in speech processing and software engineering, with research interests in multilingual Speech Recognition, Speech Synthesis, Voice Transformation and Conversion, and also motivated by the transfer of this knowledge into real world applications by building advanced research prototypes developed to solve real needs. The selected candidate will have an important role in the Human Speech and Language Technologies Department, including advanced research, project management responsibilities and technical leadership in high profile projects.
Requirements: - Masters/Ph.D. degree (or equivalent) in Speech Technology or related field - Experience in Speech Recognition, Speech Synthesis, Voice Transformation or Voice Conversion is desired. - Good written and spoken Spanish and English. Any other languages will be valued.
If you are: - An accomplished researcher with management abilities and interests. - A team player with an ambitious and creative personality.
We offer: - A multi-cultural research environment. - A multidisciplinary research team. - A group active in the international Human Speech and Language Technologies research field. - The opportunity to develop management as well as research skills.
To apply, please submit your CV and a cover letter describing your experience and interest in the position to:
Dr.-Ing. Jorge Posada (Associate Director) E-mail: jposada@vicomtech.org Telephone: +34 943 30 92 30 http://www.vicomtech.org
Deadline for submission: Open until filled

Back  Top

6-38(2012-10-10) Dolby Research Beijing looking for world-class talent!

Dolby Research Beijing looking for world-class talent!

 

Be part of the exciting future of entertainment and add your talents to those of an amazing team. For more than 40 years, Dolby has led the way in developing innovative entertainment products and technologies used by consumers and professionals worldwide. Innovations from Dolby can be heard in consumer audio and video products, entertainment software, and professional sound applications, including music recording, broadcasting, and sound for motion pictures.


Our company philosophy encourages creativity, collaboration and a strong focus on creation, development and delivery of innovative technology solutions that enhance the entertainment experience. Our team-oriented research environment offers the opportunity for market-savvy innovators to apply their theoretical knowledge, awareness of technology trends and alertness to emerging market opportunities to help create technology solutions that are broadly applied in the marketplace through Dolby’s global market reach. We offer great benefits, including an assortment of life insurance and health coverage options, and the opportunity for innovators to make a difference and to experience the satisfaction of seeing technology solutions to which they contributed, in the marketplace!

 

 

Senior Research Engineer

(Audio/speech algorithm architecture and design)

Dolby Sound Technology Research, Beijing

 

 

 

Position Summary

 

This position is in the Research Organization of Dolby Laboratories (www.dolby.com) and is located inBeijing, China. The senior staff research engineer position focuses on the creation of audio signal processing technologies including the whole range of research from the underlying theoretical concepts to the development of prototypes that provide a proof of concept. As a part of an international team, the senior staff research engineer will work on ideas exploring new horizons in the audio processing, analysis, replay and organization. The researcher is responsible for performing fundamental new research, transfer technology to product groups, and draft patent applications. The position includes project leaderships for projects being part of Dolby’s global technology initiatives. This requires efficient interactions which different functional divisions within the company. The position also requires the mentoring for more junior staff.

 

Dolby Laboratories is looking for a self-motivated, highly talented individual interested in applying his or her skills in technologies involving a fundamental understanding of the way that sound sources (audio and voice) are captured, manipulated, coded, delivered, enhanced and played back using digital signal processing techniques. Applications include pre-processing, coding and post-processing solutions in market areas such as consumer electronics, mobile, broadcast, PC and digital cinema applications and other technologies crucial to Dolby Laboratories’ success. The position involves working in cooperation with other technology developers/researchers within Dolby’s global research network, and the opportunity to propose new ideas for further investigation.

 

 

Education, Skills, Abilities, and Experience Required

 

  • M.S.E.E. (required) plus 3 years of applicable, hands-on commercial experience (strongly preferred), or Ph.D. in Electrical Engineering (desired) plus 3 years of closely relevant academic post-doc Research and Development experience
  • Demonstrated ability to create fundamentally new, novel (patentable) signal processing technologies and to envision applications those technologies in the form of innovative product solutions
  • Strong innovator
  • Project leadership skills
  • Mentoring skills
  • Experienced in global project and collaboration work
  • Proficient in advanced theory and application of audio signal processing techniques
  • Highly skilled in C/C++ language and Matlab programming
  • Team-oriented work ethic and interest to work in cross-continental teams
  • Strong personal interest sound technologies and in learning, researching, and creating relevant new technologies with high commercial impact
  • Independent, self-motivated worker requiring minimal supervision
  • Fluent in Chinese and English. Excellent communication skills
  • Good understanding of general acoustics

 

Strongly Desired

 

  • Experience working in a software development team, including software version control tools
  • Real-time windows programming
  • Real-time audio processing
  • Willing to do occasional international travel
  • Personal interest in audio in entertainment applications

 

Please send your English and Chinese resume to cb@dolby.com

 

Back  Top

6-39(2012-10-11) Faculty Position at the Center for Spoken Language Understanding, Portland, Oregon


    Job title: Assistant, Associate, or Full Professor
    Institution: Oregon Health & Science University, Portland, Oregon
    Department: Center for Spoken Language Understanding
           

The Institute on Development & Disability/Center for Spoken Language Understanding invites applications at all ranks for a faculty position in Natural Language Processing, to include technologies for analysis of speech, language, or both. Special interest in applications to behavioral  manifestations of neurological disorders is essential.

       

The primary interest is to extend our existing program in developing behavioral technologies that allow early detection and remediation of a wide range of neurological disorders, in  including Autism and Parkinson’s.

       

The Institute on Development & Disability/ Center for Spoken Language Understanding is at the forefront of this new, exciting area of research. The faculty member will be expected to teach courses supporting the research program and appropriate background areas such Machine Learning and Computational Linguistics. We seek a researcher with a well-developed program in Natural Language Processing, to collaborate with the CSLU team and with clinicians throughout OHSU. The appointee will be expected to maintain an independent, extramurally funded research program.

       

Requirements:          

  •  Ph.D.         
  •      
  • Experience with Computational Linguistics, Machine Learning and Natural Language Processing.         
  •    

   

Please contact: Jan van Santen, vansantj@ohsu.edu

    
   

Back  Top

6-40(2012-10-11) Research Programmer at the College of Pharmacy at the University of Minnesota
 
 
Brief Description: The College of Pharmacy at the University of Minnesota is seeking a talented, pro-active and innovative individual for a Research Programmer position to work on several projects in Center for Clinical and Cognitive Neuropharmacology (C3N). C3N is engaged in conducting interdisciplinary research focused on cognitive effects of medications and neurodegenerative disorders such as Alzheimer's disease. Computerized assessment is used to measure these cognitive effects. The successful candidate for this position will be responsible for a variety of computer-related tasks including creating and maintaining innovative computer-based neuropsychological testing applications that involve spontaneous speech and language collection and analysis. The successful candidate will also be responsible for creating and maintaining databases used to store and organize experimental samples and web-enabled interfaces to the databases and data analysis tools. The successful candidate will also be expected to work with graduate and undergraduate students on specific programming and research projects to meet the needs of the Center.  
 
 
Full Description is available here on the official University of Minnesota job posting site:
Back  Top



 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA