ISCApad #216 |
Tuesday, June 14, 2016 by Chris Wellekens |
6-1 | (2015-12-02) Master2 position at Multispeech Team, LORIA (Nancy, France)
Master2 position at Multispeech Team, LORIA (Nancy, France) Automatic speech recognition: contextualisation of the language model based on neural networks by dynamic adjustment Framework of ANR project ContNomina The technologies involved in information retrieval in large audio/video databases are often based on the analysis of large, but closed, corpora, and on machine learning techniques and statistical modeling of the written and spoken language. The effectiveness of these approaches is now widely acknowledged, but they nevertheless have major flaws, particularly for what concern proper names, that are crucial for the interpretation of the content. In the context of diachronic data (data which change over time) new proper names appear constantly requiring dynamic updates of the lexicons and language models used by the speech recognition system. As a result, the ANR project ContNomina (2013-2017) focuses on the problem of proper names in automatic audio processing systems by exploiting in the most efficient way the context of the processed documents. To do this, the student will address the contextualization of the recognition module through the dynamic adjustment of the language model in order to make it more accurate. Subject Current systems for automatic speech recognition are based on statistical approaches. They require three components: an acoustic model, a lexicon and a language model. This stage will focus on the language model. The language model of our recognition system is based on a neural network learned from a large corpus of text. The problem is to re-estimate the language model parameters for a new proper name depending on its context and a small amount of adaptation data. Several tracks can be explored: adapting the language model, using a class model or studying the notion of analogy. Our team has developed a fully automatic system for speech recognition to transcribe a radio broadcast from the corresponding audio file. The student will develop a new module whose function is to integrate new proper names in the language model. Required skills Background in statistics and object-oriented programming. Localization and contacts Loria laboratory, Multispeech team, Nancy, France Irina.illina@loria.frdominique.fohr@loria.fr Candidates should email a detailed CV and diploma References [1] J. Gao, X. He, L. Deng Deep Learning for Web Search and Natural Language Processing , Microsoft slides, 2015 [2] X. Liu, Y. Wang, X. Chen, M. J. F. Gales, and P. C. Woodland. Efficient lattice rescoring using recurrent neural network langage models, in Proc. ICASSP, 2014, pp. 4941?4945. [3] M. Sundermeyer, H. Ney, and R. Schlüter. From Feedforward to Recurrent LSTM Neural Networks for Language Modeling. IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 23, number 3, pages 517-529, March 2015.
| |||||||
6-2 | (2015-12-03) ,PostDoc position in the field of automatic speaker recognition (ASR) at Idiap, Martigny, Switzerland The Idiap Research Institute (http://www.idiap.ch ) seeks qualified candidates for one
| |||||||
6-3 | (2015-12-12) Ph.D. Position in Speech Recognition at Saarland University, Germany Ph.D. Position in Speech Recognition at Saarland University
The spoken Language Systems group from Saarland University in Germany anticipates the availability of a Ph.D. position in the area of speech recognition. This position is part of the Horizon 2020 project MALORCA, a research project on long term unsupervised adaptation of the acoustic and the language models of a speech recognition system. The research will be carried out together with a European consortium of high-profile research institutes and companies. Requirements:
Salary:
Research at Saarland University: Saarland University is one of the leading European research sites in computational linguistics and offers an active, stimulating research environment. Close working relationships are maintained between the Departments of Computational Linguistics and Computer Science. Both are part of the Cluster of Excellence, which also includes the Max Planck Institutes for Informatics (MPI-INF) and Software Systems (MPI-SWS) and the German Research Center for Artificial Intelligence (DFKI).
Each application should include: Curriculum Vitae including a list of relevant research experience in addition to a list of publications (if applies).
Applications (documents in PDF format in a single file) should be sent no later than, Sunday, January 10th to: sekretariat@LSV.Uni-Saarland.De
Further inquiries regarding the project should be directed to: Youssef.Oualil@LSV.Uni-Saarland.De or Dietrich.Klakow@LSV.Uni-Saarland.De
| |||||||
6-4 | (2015-12-12) PostDoc Position in Speech Recognition at Saarland University, Germany PostDoc Position in Speech Recognition at Saarland University
The spoken Language Systems group from Saarland University in Germany anticipates the availability of a PostDoc position in the area of speech recognition. This position is part of the Horizon 2020 project MALORCA, a research project on long term unsupervised adaptation of the acoustic and the language models of a speech recognition system. The research will be carried out together with a European consortium of high-profile research institutes and companies. Requirements:
Salary:
Research at Saarland University: Saarland University is one of the leading European research sites in computational linguistics and offers an active, stimulating research environment. Close working relationships are maintained between the Departments of Computational Linguistics and Computer Science. Both are part of the Cluster of Excellence, which also includes the Max Planck Institutes for Informatics (MPI-INF) and Software Systems (MPI-SWS) and the German Research Center for Artificial Intelligence (DFKI).
Each application should include: Curriculum Vitae including a list of relevant research experience in addition to a list of publications (if applies).
Applications (documents in PDF format in a single file) should be sent no later than, Sunday, January 10th to: sekretariat@LSV.Uni-Saarland.De
Further inquiries regarding the project should be directed to: Youssef.Oualil@LSV.Uni-Saarland.De or Dietrich.Klakow@LSV.Uni-Saarland.De
| |||||||
6-5 | (2015-12-14) Ussher Assistant Professor in Irish Speech and Language Technology (IRL) School of Linguistic, Speech and Communication Sciences
Ussher Assistant Professor in Irish Speech and Language Technology
The Ussher Assistant Professor in Irish Speech and Language Technology in the School of Linguistic, Speech and Communication Sciences will lead the development of the Irish Speech and Language Technology Research Centre (ITUT). This research will embed innovative speech and language technology resources in pedagogically sound language learning applications and in assistive technologies, enhancing teaching and learning of Irish nationally and globally. The appointee will have a strong background in speech-language technology, a specialisation in technology-assisted language learning, and a track record in linguistics research. Outreach and dissemination are key features of this post. The appointee will contribute to teaching and research supervision. A high level of competence in Irish language is desirable.
Appointment will be made at a maximum of the 8th point of the New Assistant Professor Merged Salary Scale.
Candidates wishing to discuss the post informally should contact: Professor Elaine Uí Dhonnchadha, E-mail: uidhonne@tcd.ie
Applications will only be accepted through e-recruitment
Further information and application details can be found at: https://jobs.tcd.ie
Closing date for receipt of completed applications is: no later than 12 Noon GMT on Thursday 14th January 2016
| |||||||
6-6 | (2015-12-16) Stage à Avignon, France: Synchronisation automatique de sous-titrages pour le spectacle vivant Sujet de stage Master « Synchronisation automatique de sous-titrages pour le spectacle vivant » Encadrant : Jean-François Bonastre (jean-francois.bonastre@univ-avignon.fr) Ce stage a pour objet de réaliser une étude de faisabilité pour un dispositif de synchronisation automatique de sous-titrages pour le spectacle vivant et, plus particulièrement, pour le théâtre. Les textes intégraux initiaux correspondant à la pièce jouée sont ramenés à des sous-titres de longueur plus réduite, pour permettre un affichage et une lecture en temps réel sur une tablette ou des lunettes de réalité virtuelle. Plusieurs versions des sous-titres sont réalisées dans plusieurs langues. Ces sous-titres sont manuellement synchronisés avec le texte de référence. L’objectif principal visé par le projet est de permettre aux théâtres français de pouvoir proposer leurs programmations à des publics non francophones et d’ouvrir ainsi l’ensemble du répertoire théâtral en langue française aux 80 millions de visiteurs étrangers qui viennent chaque année en France. Le travail demandé consiste à réaliser une maquette fonctionnelle de l’application de synchronisation automatique. Basée sur la plateforme ALIZE de reconnaissance du locuteur, cette application cherchera à reconnaitre en temps réel l’acteur en train de parler et utilisera la succession des prises de parole pour synchroniser le live avec les sous-titres. L’application sera développée en C++ et en python. Des compétences solides en développement logiciel sont demandées. Des connaissances en traitement automatique de la parole, en traitement du signal et en apprentissage automatique sont également souhaitées, sans que cette liste soit impérative ou exclusive. Ce projet est présenté en partenariat étroit avec un centre spécialisé dans les applications numériques pour le monde du théâtre. Ce partenaire apportera ses compétences, les données servant à réaliser et à évaluer la maquette ainsi que les théâtres partenaires (déjà identifiés) volontaires pour tester l’application « in situ ». Une poursuite en thèse de Doctorat est souhaitée. Le stage est prévu pour le premier semestre 2016.
| |||||||
6-7 | (2015-12-17) Faculty position at the Associate,Professor level in machine learning, Telecom Paristech, France, Faculty position at the Associate,Professor level in machine learning
| |||||||
6-8 | (2015-12-19) GENERAL EDUCATION INSTRUCTORS/Speech Communications , SC-Columbia, USA
Job Overview JOB TITLE: GENERAL EDUCATION INSTRUCTORS/Speech Communications JOB TYPE: Part-Time LOCATION: US-SC-COLUMBIA DEPARTMENT: Academics SUPERVISORY: No TRAVEL REQ’D: No Job Description If you’re a dedicated, enthusiastic, experienced speech communications professional, preferably with teaching experience, who believes in the power of sharing your knowledge, motivating others, and putting students first, we want to hear from you! We’re looking for talented general education instructors to join the academic team at our Columbia Campus for our day and evening class sessions. These individuals will report to the Campus’s Degree Program Department Chairperson. Essential Duties/Responsibilities: Educates and trains students in his or her field of expertise using accepted and approved instructional methodology. Prepares lesson plans using industry-standard approaches (e.g., multimedia, adult learning methodology). Teaches courses as assigned, instructs and evaluates students, develops students’ skills and encourages growth, and tracks their attendance, performance, and grades. Participates in various administrative activities (e.g., attends faculty/staff meetings or in-service meetings). Participates in graduation ceremonies, as assigned. Participates regularly in continuing professional development activities. Performs other duties or special projects as assigned. Education/Experience Needed: A related master's degree from a regionally accredited institution, with 18 or more graduate hours in speech, speech communications, or mass communications. A minimum of four (4) years of experience in a related field. Registration, license, or certification as required by the state or accrediting agencies. Excellent interpersonal, organizational, and communications skills a must. Computer literacy and teaching experience desired. Learn more about us at Remington College – Columbia Campus. We offer a competitive salary, along with a comprehensive benefits package that includes health, dental, disability, life, vision, 401K, and flexible spending accounts, for full-time employees. How to Apply Help us train tomorrow’s work force! Qualified candidates: Please click the APPLY NOW button. Or, you may email your résumé and cover letter for consideration to audrey.breland@remingtoncollege.edu. We provide reasonable accommodation where appropriate to applicants with disabilities.
| |||||||
6-9 | (2015-12-26) POST-DOC OPENING IN SPEECH INTELLIGIBILITY AT IRIT-TOULOUSE, FRANCE POST-DOC OPENING **************************************************************** Title: Postdoctoral position in speech intelligibility Application deadline: 1/31/2016 Description: The decreasing mortality of Head and Neck Cancers highlights the importance to reduce the impact on Quality of Life (QoL). But, the usual tools for assessing QoL are not relevant for measuring the impact of the treatment on the main functions involved by the sequelae. Validated tools for measuring the functional outcomes of carcinologic treatment are missing, in particular for speech disorders. Some assessments are available for voice disorders in laryngeal cancer but there are based on very poor tools for oral and pharyngeal cancers involving more the articulation of speech than voice. In this context, the C2SI (Carcinologic Speech Severity Index) project proposes to develop a severity index of speech disorders describing the outcomes of therapeutic protocols completing the survival rates. There is a strong collaboration between linguists, phoneticians, speech therapists and computer science researchers, in particular those from the Toulouse Institute of Computer Science Research (IRIT), within the SAMoVA team (http://www.irit.fr/recherches/SAMOVA/). Intelligibility of speech is the usual way to quantify the severity of neurologic speech disorders. But this measure is not valid in clinical practice because of several difficulties as the familiarity effect of this kind of speech and the poor inter-judge reproducibility. Moreover, the transcription intelligibility scores do not accurately reflect listener comprehension. Therefore, our hypothesis is that an automatic assessment technic can measure the impact of the speech disorders on the communication abilities giving a severity index of speech in patients treated for head and neck and particularly for oral and pharyngeal cancer. The main objective is then to demonstrate that the C2SI, obtained by an automatic speech processing tool, produces equivalent or superior outcomes than a score of speech intelligibility obtained by human listeners, in terms of QoL foreseeing the speech handicap, after the treatment of oral and/or pharyngeal cancer. The database is actually recorded at the Institut Universitaire du Cancer in Toulouse with CVC pseudo-words, readings, short sentences focusing on prosody and spontaneous descriptions of pictures. Roadmap to develop an automatic system that will evaluate the intelligibility of impaired speech: - Study existing SAMoVA technologies and evaluate them with the C2SI protocol, - Find relevant features with the audio signal that support intelligibility, - Merge those features to obtain the C2SI, - Correlate it with the speech intelligibility scores obtained by human listeners, - Study in which way the features support understandability as well. Skills: For this project, we are looking for one candidate with a PhD degree in the areas of machine learning, signal processing, and also with: programming skills, scientific rigour, creativity, good publication record, excellent communication skills, enjoying teamwork... Salary and other conditions of employments will follow CNRS (French National Center for Scientific Research) standard rules for non-permanent researchers, according to the experience of the candidate. Location: the work will be conducted in the SAMoVA team of the IRIT, Toulouse (France). Contact: Jérôme Farinas jerome.farinas@irit.fr , Julie Mauclair julie.mauclair@irit.fr Duration: 12 to 24 months Candidates should email a letter of application, a detailed CV including a complete list of publications, and source code showcasing programming skills if available. Julie Mauclair
Assistant Professor
IRIT
Toulouse, France
| |||||||
6-10 | (2016-01-08) Software/research engineer at LIMSI, Orsay, France LIMSI (www.limsi.fr) is looking for a software/research engineer to work on the
| |||||||
6-11 | (2016-01-09) Ingénieur R&D en CDI pour le département de la recherche de l'Ina, Paris
| |||||||
6-12 | (2016-01-10) TTS research engineer at Nuance Shanghai, China TTS Research Engineer – Nuance Shanghai, China Reporting to TTS manager, the research engineer will conduct innovative research and development with focus on TTS Front-end or Back-end technologies. Responsibilities: -As part of the TTS R&D organization, you will be contributing to the development of text-to-speech technology for all types of markets and platforms with focus on Asian languages. Representative tasks will include: - Improve TTS front-end or back-end with algorithm innovations. - Develop product and tools, - Maintenance and support (PS / Bug fixes) - Active contribution to the improvement of all QA processes Qualification: Native Mandarin, Good English, additional Asia languages is a plus - Experience with TTS research and development - Experience with NLP research and development - Excellent scripting / programming skills - Experience with SCM tools - Self-starter, team player - Passion for quality - Innovative and curious - 'free thinker' - Master degree in EE/CS/Comp. Ling (or similar) Please send your resume application to eva.li@nuance.com
| |||||||
6-13 | (2016-01-11) Stage Master 2 de Recherche en Traitement automatique des langues/Extraction d'information, LIMSI Orsay, France Stage Master 2 de Recherche en Traitement automatique des langues/Extraction d'information
| |||||||
6-14 | (2016-01-14) Technical Engineer / Scientist at ELDA
| |||||||
6-15 | (2016-01-20) Language Engineer at the TTS team of Google The TTS team at Google is looking for a Language Engineer to help improve synthesis in English and French. Based in Google London offices, you will be helping with technical tasks involved in creating a speech synthesizer. This includes: 1. Developing rules for a text normalization system; 2. Large scale data mining; 3. Customizing language building tools for English and French. 4. Text-to-Speech quality evaluation and testing Requirements: 1. Recent Computer Science graduate or closely related discipline 2. Native-level speaker in French/English and fluent in English. 3. Proficiency in Unix/Version Control System and a modern programming language (Python/C++ preferred) 4. Ability to build and understand regular expressions 5. Interest in data mining and natural language processing a plus This is an opportunity to work on cutting edge technology in a dynamic team of world-class experts. Project duration: 6-11 months (with potential for extension) **This is not a permanent position but a contract position through an employment agency. Applicants must be currently authorized to work in the UK.** For immediate consideration, please email your CV and cover letter in English (PDF format preferred) with 'Language Engineer English or French' in the subject line. Application Deadline: (Open until filled)
Email Address for Applications: tts_jobs@google.com
| |||||||
6-16 | (2016-01-21) Researcher in machine learning at Reykjavik University, Iceland Machine learning and language technology
| |||||||
6-17 | (2016-01-25) Poste de MCF au LIG (équipe GETALP) pour la recherche et au département I3L (informatique pour les lettres, langues et langage) de l'Univ. Grenoble Alpes (France) Poste de MCF au LIG (équipe GETALP) pour la recherche et au département I3L (informatique pour les lettres, langues et langage) de l'Univ. Grenoble Alpes (UGA= université grenobloise fusionnée depuis 1/1/2016 regroupant les 3 établissements U. Joseph Fourier, U. Pierre Mendes-France et U. Stendhal) pour l'enseignement.
Short description of the position in English
'The associate professor position concerns informatics and speech processing. The selected candidate will join the GETALP group of LIG laboratory and reinforce the speech / spoken language processing axis.
Teaching will be given in student programs covering human and social sciences. Intitulé du poste pour la publication : Informatique et traitement de l?oral
Composante ou service : département I3L (informatique pour les lettres, langues et langage)
Numéro du poste* : 0209 Section CNU : 27-07 Unité de recherche ou unité mixte de recherche de rattachement (nom et n°) : LIG UMR 5217
Localisation du poste : Grenoble Mots-clés renseignés pour la recherche du poste dans Galaxie par les candidats
Traitement automatique des langues Traitement de la Parole Plurilinguisme Interaction non-verbale Communication homme-machine Profil Enseignement Les objectifs pédagogiques sont de préparer l?ensemble des étudiants de Lettres-Langues-Langage à l?intégration du numérique dans leurs filières afin de faire face à la diversité, la synergie et l?évolution des services numériques et dispositifs d'interaction centrés-humains et des contextes d'usage. Les besoins en enseignement du département I3L se situent dans les domaines suivants : TAL, web dynamique, corpus électroniques, ingénierie linguistique, évaluation des outils du TAL, bureautique pour Lettres, Langues, Langage. Plus spécifiquement dans les masters, les enseignements devront être au plus près des activités de recherche et répondre aussi bien aux besoins de l?insertion professionnelle qu?à l?innovation en R&D. Filières de formation concernées : ? Licence : Globalement dans le cadre des cours d'informatique (toutes les mentions sont concernées), plus spécifiquement dans le cadre du module « Métiers des humanités numériques »
? Master : Globalement dans les enseignements liés à l?informatique des masters des deux UFR LLASIC ET LE et plus spécifiquement dans le master Sciences du langage, spécialité Industries de la Langue (IDL) ;
La personne recrutée devra avoir une connaissance du tissu industriel du secteur et de son devenir, lui permettant de travailler à une bonne identification de compétences en adéquation avec les évolutions rapides des entreprises du domaine. Profil Recherche La personne recrutée intégrera l?équipe GETALP du LIG qui s'intéresse à tous les aspects théoriques, méthodologiques et pratiques de la communication et du traitement de l'information multilingue (écrite ou orale). GETALP porte par ailleurs un intérêt spécifique aux situations d?interactions et aux contextes atypiques (langues peu-dotées, locuteurs atypiques, relation sociale endommagée, etc.) en prenant en compte la diversité des langues, des locuteurs, des cultures et des relations socio-affectives. La pluridisciplinarité de GETALP (informaticiens, linguistes, phonéticiens, traducteurs, roboticiens, etc.) croise les approches expertes vs. empiriques et s?appuie sur des corpus langagiers de grande taille tout en développant des corpus annotés sur des hypothèses théoriques riches (« beautiful data »). Les aspects méthodologiques (évaluation, expérimentation écologique dans les plateformes Domus, le FabMSTIC ou in situ, éthique) sont centraux, en particulier dans le transfert vers les partenaires industriels. La personne recrutée devra permettre de renforcer les aspects interdisciplinaires de la recherche et participera à l?élaboration des méthodes d?évaluations dans une démarche éthique, concernant les processus d?expérimentation et les conséquences sociétales des innovations potentielles. Un point essentiel sera d?entretenir et développer les collaborations avec les autres équipes du LIG, autant sur les aspects informatiques et méthodologiques que sur les situations d?observation ou les applications. Plus largement la personne recrutée sera incitée à enrichir les collaborations avec les autres laboratoires impliqués dans le pôle de recherche ALLSHS de l?UGA.
Activités administratives La personne recrutée pourra avoir en charge la responsabilité des enseignements I3L en licence, et participer aux responsabilités administratives dans l?UFR LLASIC. Contact pour la recherche : BESACIER, Laurent Laurent.Besacier@imag.fr Contact pour l?enseignement AUBERGE, Véronique Veronique.Auberge@u-grenoble3.fr dates audition: 23 Mai 2016
| |||||||
6-18 | (2016-01-30) PhD position in AVSR, Trinity College Dublin, Ireland PhD position in AVSR at Trinity College Dublin, Ireland With a link to this advert: http://adaptcentre.ie/careers/tcd_phd_IG_PhD5_D.pdf
| |||||||
6-19 | (2016-02-06) 2 post-doc positions at the Italian Institute of Technology. 1 post-doc in Brain- and biosignal-based speech recognition - 1 post-doc in New techniques for vision-assisted speech processing with event-drive cameras (within the EU H2020 Ecomode project)
| |||||||
6-20 | (2016-02-27) Maître de Conférences en informatique au LIA , Avignon France Un poste de Maître de Conférences en informatique (section 27) est mis au concours pour
| |||||||
6-21 | (2016-02-27) 3 Teaching and Research positions at Lium, Université du Maine, France Trois postes d'enseignants-chercheurs en informatique sont à pourvoir à l'Université du Maine lors de la campagne de recrutement 2016 : un poste de Professeur et deux postes de Maîtres de conférences.
| |||||||
6-22 | (2016-03-01) Professeur , Grenoble, France Intitulé du poste pour la publication : Informatique et traitement de l?oral
Composante ou service : département I3L (informatique pour les lettres, langues et langage)
Numéro du poste* : 0209 Section CNU : 27-07 Unité de recherche ou unité mixte de recherche de rattachement (nom et n°) : LIG UMR 5217
Localisation du poste : Grenoble Mots-clés renseignés pour la recherche du poste dans Galaxie par les candidats
Traitement automatique des langues Traitement de la Parole Plurilinguisme Interaction non-verbale Communication homme-machine Short description of the position in English
'The associate professor position concerns informatics and speech processing. The selected candidate will join the GETALP group of LIG laboratory and reinforce the speech / spoken language processing axis.
Teaching will be given in student programs covering human and social sciences. Profil Enseignement Les objectifs pédagogiques sont de préparer l?ensemble des étudiants de Lettres-Langues-Langage à l?intégration du numérique dans leurs filières afin de faire face à la diversité, la synergie et l?évolution des services numériques et dispositifs d'interaction centrés-humains et des contextes d'usage. Les besoins en enseignement du département I3L se situent dans les domaines suivants : TAL, web dynamique, corpus électroniques, ingénierie linguistique, évaluation des outils du TAL, bureautique pour Lettres, Langues, Langage. Plus spécifiquement dans les masters, les enseignements devront être au plus près des activités de recherche et répondre aussi bien aux besoins de l?insertion professionnelle qu?à l?innovation en R&D. Filières de formation concernées : ? Licence : Globalement dans le cadre des cours d'informatique (toutes les mentions sont concernées), plus spécifiquement dans le cadre du module « Métiers des humanités numériques »
? Master : Globalement dans les enseignements liés à l?informatique des masters des deux UFR LLASIC ET LE et plus spécifiquement dans le master Sciences du langage, spécialité Industries de la Langue (IDL) ;
La personne recrutée devra avoir une connaissance du tissu industriel du secteur et de son devenir, lui permettant de travailler à une bonne identification de compétences en adéquation avec les évolutions rapides des entreprises du domaine. Profil Recherche La personne recrutée intégrera l?équipe GETALP du LIG qui s'intéresse à tous les aspects théoriques, méthodologiques et pratiques de la communication et du traitement de l'information multilingue (écrite ou orale). GETALP porte par ailleurs un intérêt spécifique aux situations d?interactions et aux contextes atypiques (langues peu-dotées, locuteurs atypiques, relation sociale endommagée, etc.) en prenant en compte la diversité des langues, des locuteurs, des cultures et des relations socio-affectives. La pluridisciplinarité de GETALP (informaticiens, linguistes, phonéticiens, traducteurs, roboticiens, etc.) croise les approches expertes vs. empiriques et s?appuie sur des corpus langagiers de grande taille tout en développant des corpus annotés sur des hypothèses théoriques riches (« beautiful data »). Les aspects méthodologiques (évaluation, expérimentation écologique dans les plateformes Domus, le FabMSTIC ou in situ, éthique) sont centraux, en particulier dans le transfert vers les partenaires industriels. La personne recrutée devra permettre de renforcer les aspects interdisciplinaires de la recherche et participera à l?élaboration des méthodes d?évaluations dans une démarche éthique, concernant les processus d?expérimentation et les conséquences sociétales des innovations potentielles. Un point essentiel sera d?entretenir et développer les collaborations avec les autres équipes du LIG, autant sur les aspects informatiques et méthodologiques que sur les situations d?observation ou les applications. Plus largement la personne recrutée sera incitée à enrichir les collaborations avec les autres laboratoires impliqués dans le pôle de recherche ALLSHS de l?UGA.
Activités administratives La personne recrutée pourra avoir en charge la responsabilité des enseignements I3L en licence, et participer aux responsabilités administratives dans l?UFR LLASIC. a) la procédure pour candidater est désormais en ligne sur
http://www.univ-grenoble-alpes.fr/fr/grandes-missions/universite/recrutement/enseignants-chercheurs/
b) le profil du poste concerné est accessible sur
(référence galaxie n°7 'informatique et traitement de l'oral')
Contact pour la recherche : BESACIER, Laurent Laurent.Besacier@imag.fr Contact pour l?enseignement AUBERGE, Véronique Veronique.Auberge@u-grenoble3.fr dates audition: 23 Mai 2016
| |||||||
6-23 | (2016-03-15) Thèse ministérielle à l'IRISA intitulée 'Transformation de registres par extraction de motifs langagiers' Vous trouverez à l'URL suivante et en pièce jointe une offre de thèse ministérielle à l'IRISA intitulée 'Transformation de registres par extraction de motifs langagiers' : https://www-expression.irisa.fr/files/2011/07/sujet1.pdf . Merci de diffuser auprès de personnes éventuellement intéressées.
Pour plus de précisions ou pour postuler, merci de contacter : - Pascale Sébillot : pascale.sebillot@irisa.fr - Nicolas Béchet : nicolas.bechet@irisa.fr - Gwénolé Lecorvé : gwenole.lecorve@irisa.fr
Date limite de candidature : 30 avril 2016.
| |||||||
6-24 | (2016-04-03) Post Doctoral Position in Machine Learning at Univ. Lille, France We are happy to advertise for a Post Doctoral Position in Machine Learning at Univ. Lille, France
| |||||||
6-25 | (2016-04-11) Research Associate Position at York University, Toronto, Canada Research Associate Position at York University, Toronto, Canada
| |||||||
6-26 | (2016-04-20) PhD position at Idiap Research Institute, Martigny, Switzerland A new PhD position is open in 'Multilingual Affective Speech Synthesis' at Idiap Research
| |||||||
6-27 | (2016-04-20) Speech Recognition Researcher position (full-time) at Sony, in Stuttgart, Germany. Speech Recognition Researcher position (full-time) at Sony, in Stuttgart, Germany. For details, please check out http://www.stuttgart.sony.de/home/career-jobs/open-positions.html. We welcome your e-mail to denise.biermann@eu.sony.com.
| |||||||
6-28 | (2016-04-21) RESEARCH SCIENTIST, EMR.AI Inc., San Francisco, CA, USA
EMR.AI Inc., San Francisco, CA RESEARCH SCIENTIST, NATURAL LANGUAGE PROCESSING Headquartered in San Francisco, CA, EMR.AI Inc. is a leading provider of AI solutions to the medical sector. EMR.AI transforms unstructured information, in form of written, spoken, or typed reports, clinical test results, and radiographs into international standard codes saved in common EMR systems. The wealth of discrete medical data provided through this transformation in conjunction with EMR.AI's suite of medical analytics solutions enables stakeholders, practitioners, researchers, health providers, and policy makers to obtain a comprehensive picture of the available medical data in their organization.
SUMMARY EMR.AI Research & Development has openings for Research Scientists in the field of Natural Language Processing in our Downtown San Francisco offices. Scientists will work on projects spanning a variety of tasks including the semantic interpretation of written and spoken medical reports, the design of language models for a variety of NLP tasks and speech recognition, the summarization of written and spoken language in the medical domain, the incorporation of lexica, ontologies, relational databases, and other sources of structured and unstructured knowledge sources into EMR.AI’s medical NLP tool set, and others. This is a unique opportunity to be part of a cutting-edge R&D team in the epicenter of the world’s AI tech industry with true impact on medical research.
RESPONSIBILITIES Process huge corpora of medical textual documents to perform syntactic and semantic analyses and train, tune, and test probabilistic and other data-driven models, using both existing tool benches, proprietary and open-source, as well as self-developed algorithms and techniques. Produce high-quality programs and scripts to embed scientific algorithms into effective prototypes and demos to be shared with EMR.AI’s leadership team, its customers, partners, and vendors. Create and document technological innovations by means of patent disclosures, scientific publications, media alerts, and other channels.
Work closely with EMR.AI’s speech processing team and its software engineering division to produce innovative and effective solutions for a range of AI products and services in the medical domain. Represent the R&D division in communications with EMR.AI’s leadership team, its customers, partners, and vendors at meetings, conventions, and other venues as well as in written statements.
SKILLS PhD in computer science, computational linguistics, electrical engineering, or a related field. Experience in the state of the art of NLP and its standard tools is required. Candidates must be very skilled in programming and must have a proven scientific track record. They must be excellent team players, including with distributed teams, and strong in oral and written English communication. Knowledge of the US medical sector is desirable, so are experience with start-ups and strong scientific connections throughout the Bay Area and beyond.
BENEFITS EMR.AI offers competitive salaries, an excellent benefit package, and a stimulating work environment in the heart of San Francisco with manifold local, domestic, and international commercial and academic partnerships.
HOW TO APPLY Please send your application documents to jobs@emr.ai
CONTACT EMR.AI Inc. 90 New Montgomery St San Francisco, CA 94105, USA phone: +1-415-200-8535 e-mail: info@emr.ai www: http://emr.ai
| |||||||
6-29 | (2016-04-28) 3-year postdoctoral research position in speech processing, Trinity College, Dublin, Ireland A 3-year postdoctoral research position in speech processing is available in the Sigmedia Group in Trinity College Dublin, Ireland.
Full details of the position are given in the advert available at: http://adaptcentre.ie/careers/TCD_postdoc_themeD-2.pdf
| |||||||
6-30 | (2016-05-02) Postdoctoral Position in Linguistics (Prosodic Phrasing and Discourse Comprehension), Université catholique de Louvain, Belgique Postdoctoral Position in Linguistics (Prosodic Phrasing and Discourse Comprehension), Université catholique de Louvain The Institute for Language & Communication of the Université catholique de Louvain (Belgium) is currently seeking a postdoctoral researcher to perform corpus analysis and experimental research on the role of prosody in discourse comprehension. The successful candidate will work under the supervision of Prof. Anne Catherine Simon and will join the project “PhraDiCo: Prosodic phrasing for discourse comprehension. Perception of prosodic boundaries and their contribution to the construal of discourse coherence”. He/she will be employed on a fixed-term contract for 24 months. Project Description [PhraDiCo] Prosodic phrasing for discourse comprehension. Perception of prosodic boundaries and their contribution to the construal of discourse coherence. Prosodic phrasing is central to language comprehension and discourse coherence. Spoken language is highly transitory and dynamic. Speech is processed in real-time, as the hearer allocates his attention to each incoming unit and integrates it into a coherent whole. Speech is chunked into prosodic units which are presumably used as basic units of discourse processing. During its first phase, this project aims to explore how ordinary listeners (as opposed to trained experts) perceive prosodic boundaries in spoken language. What is the relationship between the prosodic cues (pauses, lengthening, melodic discontinuity etc.) and the perceived strength of prosodic boundaries? To what extent does the syntactic structure contribute to the perception of prosodic boundaries? Are naïve listeners consistent in their appreciation of boundary strength on word boundaries? Once those questions have been sufficiently tackled, we will focus on analysing the contribution of prosodic boundaries to discourse coherence: relational coherence, which seeks to establish discourse relations between discourse “building blocks”, and referential coherence, i.e. the identification of large-scale discourse topics. Four perceptual experiments have been designed to investigate (i) the acoustic and linguistic factors involved in the perception of prosodic boundary strength; (ii) the influence of speaking styles variability (speech rate, pitch register) on prosodic boundary perception; (iii) the contribution of prosodic units to the construal of discourse relations and (iv) to the identification of large-scale discourse topics. The project is undertaken by the research centre “Valibel – Discours & Variation” (Discourse & Variation, www.uclouvain.be/valibel) and is funded by F.S.R. – FNRS (www.frs-fnrs.be). Job Description
The appointed researcher will work in close collaboration with the project’s supervisor and another postdoctoral researcher. He/she will participate in research leading to scientific publications.
Qualifications
Place of employment: Louvain-la-Neuve, Belgium (30 km south of Brussels). With 21000 students and 5000-member staff, the University of Louvain is the largest university of the Brussels and Wallonia regions of Belgium. Contract length: 24 months. Contract term: Full-time Starting date: July 2016 Deadline for applications: 5 June 2016
Applications (motivation letter, detailed CV, 1-3 recommendation letters and 3 relevant publications), should be sent in electronic format to Prof. Anne Catherine Simon (anne-catherine.simon@uclouvain.be).
| |||||||
6-31 | (2016-05-10) PhD positions at Inria, France PhD positions are available in our group at Inria, in France, on deep learning for:
| |||||||
6-32 | (2016-05-12) PhD Position at LORIA, Nancy, France PhD Position: Merging acquisition and processing of cineMRI of the vocal tract
SubjectTracking the position of speech articulators along time is crucial to understand speech production better. For a long time X-ray imaging was the only technology able to acquire images at a sufficiently high sampling frequency (around 50 images per second) for visualizing articulatory gestures. However, this technique has been abandoned at the end of the eighties because of the health hazard implied by ionizing radiations. Furthermore, since the whole vocal tract is projected on the image plane contours of organs (especially the mandible, teeth and tongue) overlap on the images making the processing of images very difficult.
The interest of Magnetic Resonance Imaging (MRI) is to provide an excellent contrast of soft tissues for a slice placed in any orientation and dynamic MRI is acknowledged as a powerful tool for imaging speech production [3]. However, current performance of cineMRI remains inadequate in terms of sampling rate and spatial resolution, and the objective of this thesis is to develop more efficient acquisition protocols and algorithms.
The objective is to develop protocols by exploiting the latest advances in MRI, particularly parallel imaging and reconstruction under parsimony constraints called ?compressed sensing? [1]. IADI laboratory developed MRI reconstruction techniques with movement compensation [2] and multi-slice dynamic reconstruction enabling super resolution. These techniques have been already applied to cardio-respiratory movements by using physiological signals (ECG and respiratory) as constraints for the reconstruction algorithms.
A first preliminary work consisting of applying these techniques to the domain of speech production has been carried out [4]. A second work was dedicated to the development of an acquisition protocol based on ?compressed sensing?. The idea is to exploit the parsimony of the image Fourier transform coefficients in order to acquire only a small number of them, and then to reconstruct the image in an optimal manner.
However, it is possible to do better since the speech signal is acquired simultaneously then denoised before being segmented into speech sounds. Therefore, the contribution of each line acquired in the image Fourier space can be related to the speech sound it corresponds to, and one can take advantage of this information to improve the resolution of reconstructed images. This idea will be exploited with the objective of realizing a proof of concept of automatic acquisition/reconstruction of MRI images of vocal tract during speech production. We would like to go further by utilizing the knowledge of the speech sound and the approximative vocal tract shape predicted for the sound by an articulatory model to pilot acquisition.
Keywords: MRI acquisition, compressed sensing, speech processing, articulatory modeling, vocal tract
Some references[1]Michael Lustig, David Donoho, and John M. Pauly. Sparse MRI: The application of compressed sensing for rapid MR imaging. MAGNETIC RESONANCE IN MEDICINE, 58(6):1182?1195, December 2007. [2]F. Odille, P. A. Vuissoz, P. Y. Marie, and J. Felblinger. Generalized reconstruction by inversion of coupled systems (GRICS) applied to free-breathing MRI. Magn Reson Med, 60(1):146?57, July 2008. [3]Andrew D Scott, Marzena Wylezinska, Malcolm J Birch, and Marc E Miquel. Speech MRI: Morphology and function. Physica Medica, 30(6):604?618, 2014. [4]P.A. Vuissoz, F. Odille, Y. Laprie, E. Vincent, G. Hossu, and J. Felblinger. Speech Cine SSFP with optical microphone synchronization and motion compensated reconstruction. In ISMRM Workshop on Motion Correction in MRI, Tromso, Norvège, May 2014. EnvironmentBoth laboratories IADI and LORIA have developed a narrow and fruitful collaboration for years which in particular resulted in the development of a ?compress sensing? acquisition algorithm and in a research contract on articulatory synthesis. A working environment covering articulatory modeling and MRI data acquisition domains is now available and will offer very favorable conditions for this work. Supervisors· Pierre-André Vuissoz (IADI ? Imagerie Adaptative Diagnostique et Interventionnelle, unité INSERM U947) pa.vuissoz@chu-nancy.fr ApplicationWe are looking for a highly motivated person with a master degree in computer sciences, applied mathematics or computer sciences. The applicant should have a solid background in signal processing (and Matlab software) and computer sciences. Knowledge in speech processing will be also appreciated.
a) Motivation letter, b) CV, c) academic transcripts (with explanation of the grade scale adopted), d) 2 references (letters or names)
Expected start date: 1st October 2016
| |||||||
6-33 | (2016-05-14) Senior Speech Analytics Engineer at SRI
Title: Senior Speech Analytics Engineer
Job Description SRI’s Speech Technology and Research (STAR) Lab seeks a self-motivated, team-oriented, experienced leader and innovator to work in the area of speaker state analysis (for example, emotion and health states). The work will include advancing technology capabilities, determining client needs and understanding client data, winning new funding, interacting with clients, both government and commercial, and leading current or new projects in this area. The successful candidate must be proactive and entrepreneurial. This is an opportunity to have a significant impact in an emerging research area. STAR Lab engages in leading-edge research in speech recognition, speaker characterization, speaker and language identification, machine translation, natural language processing and other areas of speech/language technology, offering opportunities from basic research to prototyping, productization, and widespread deployment. The multidisciplinary research team consists of excellent speech researchers, linguists and software engineers. Characteristics of STAR staff are enthusiasm, self-motivation, initiative, passion for learning, taking ownership, thriving in a flat organizational hierarchy, and a desire to control their own career paths. SRI International is a premier nonprofit research organization based in Menlo Park, California with a mission to create world-changing solutions making people safer, healthier, and more productive. Located in the heart of Silicon Valley, the world’s center for innovation and technology, SRI’s focus spans everything from basic research to delivered systems, government and commercial work and spinning off successful companies like SIRI. Flexible cross-laboratory teams often form dynamically to solve challenging problems spanning multiple disciplines. Requirements A PhD degree or equivalent with at least 10 years of experience in speech or NLP technology. Prior interest in the area of speaker characterization is an advantage. Experience of leading teams of projects in either academia or industry along with a record of either successful proposals (government or industry) or managing a start-up in the speech area is required. Strong oral and written communication skills in English are expected. US citizenship is a plus but not required. Must be flexible to traveling. Apply online at www.sri.com/careers Job ID: 1020BR SRI is an Equal Employment Opportunity/Affirmative Action Employer
| |||||||
6-34 | (2016-05-14) Research Associate Position at York University, Toronto, Canada Research Associate Position at York University, Toronto, Canada
| |||||||
6-35 | (2016-05-14) Speech Technologist @ Voice Tech Startup!, Belmont, MA, USA Speech Technologist @ Voice Tech Startup! Location: Belmont, MA Available: Immediately VocaliD is a voice technology start-up that is bringing speaking machines to life by creating personalized digital voices. Our award-winning technology is grounded in over a decade of research at Northeastern University and has enabled us to attract Small Business Innovation awards from the National Science Foundation and the National Institutes of Health. We successfully delivered a set of beta voices in 2015 and have a waiting list of pre-orders though our oversubscribed crowdfunding campaign. We are now seeking an experienced research-based speech technologist to join our team as we launch our personalized voice service in mid 2016. Communications applications and devices are getting smarter and using more sophisticated AI, yet the voices remain robotic and unnatural. Until now, the cost and time to build a custom voice has been prohibitive. VocaliD creates personalized digital voices at scale for use on any device that speaks. Our proprietary voice search and speech blending algorithms leverage our “Human Voicebank”, a collection of 3.5 million of recordings crowdsourced from around the world. Our breakthrough is that we only need 2-3 seconds of someone’s voice to (re)create a voice when combined with recordings of a matched speaker from the Voicebank. Our initial market addresses the 10M+ people worldwide who use a machine to speak. VocaliD enables these individuals to speak in their own voice instead of the familiar “Stephen Hawking” voice. Join our dynamic team as we push the envelop on innovation while having real social impact! Responsibilities: - Create tools for filtering and parsing recorded voices - Implement machine learning algorithms to match donor and recipient voices - Enhance source-filter separation and voice conversion algorithms - Design and implement novel algorithms for tracking and monitoring speech changes - Create a framework for algorithmic synthesis of voices from donor samples - Implement a SaaS framework for voice deployment Required Qualifications: - MS or PhD in Electrical or Computer Engineering, Computer Science or related field - Experience programming in C/C++, Python, Java - Experience productizing algorithms from prototypes in Matlab (or other) - Experience with Digital Signal Processing and ideally speech/voice technologies - Experience with AWS services EC2, S3, RDS (Postgres), Route 53 - Experience with Windows and Linux If you are interested in joining a thriving and passionate team, please submit a cover letter and resume to rupal@vocaliD.co Visit us at www.vocaliD.co for more information about VocaliD.
| |||||||
6-36 | (2016-05-18) Researcher at laboratoire Parole et Langage, Aix-Marseille University, France
Call for Postdoctoral Positions in Cognitive Science/Psychology/Linguistics
Institution: Laboratoire Parole et Langage (CNRS, Aix-Marseille Université)
Location: Aix-en-Provence, France
No. of positions: 1
Duration: 2 years (see below)
Application deadline: July 20, 2016
The Laboratoire Parole et Langage (CNRS, Aix-Marseille Université, Aix-en-Provence, FR) invites applications for one postdoctoral fellowship supported by a grant from the French National Research Agency (ANR). The funded project aims at exploring the role of prosody in speech planning and how planning strategies are affected both by the pragmatic situation and by individual cognitive-emotional abilities. Cognitive-emotional constraints are investigated by comparing healthy adults and clinically impaired individuals (patients affected by multiple sclerosis). The project involves an interdisciplinary team from across the Laboratoire Parole et Langage (which includes experts in linguistics, speech analysis, psychology) including collaboration with the 'Pôle de Neurosciences Cliniques' of the Hospital la Timone (Marseille) and the Neurology Department of the Centre Hospitalier du Pays d'Aix (Aix-en-Provence).
Applicants must have a Ph.D. in a relevant field of study (e.g., Cognitive Science, Linguistics, Psychology), and a strong background in experimental design and statistics. Special attention will be given to candidates with expertise in cognitive aspects of social coordination and in emotion in spoken language. Experience with psychophysiological methodologies and conducting studies with neurological patients is also highly desirable.
The postdoctoral appointment is for a duration of up to two years depending on the starting date, and subject to review after the first year. Applicants should hold a PhD or expect to complete a PhD by the starting date. The starting date is flexible, though the position should be filled by the beginning of the fall of 2016.
Applications should include (i) a cover letter that relates the applicants’ experience and interests to the project, (ii) a comprehensive CV, (iii) the names and contact information of two references, and (iv) a PDF copy of all publications or a list of links where these can be accessed.
Applications in French or English may be sent by email to Elisa Sneed German at elisa.sneed@lpl-aix.fr.
For further inquiries regarding the position or the project, please contact Caterina Petrone (Principal Investigator) at caterina.petrone@lpl-aix.fr.
| |||||||
6-37 | (2016-05-25) PhD Positions in Signal Processing for Speech Communication, Graz University, Austria =============================================
| |||||||
6-38 | (2016-06-01) Ph.D. grant at IRISA, Rennes
Merci de diffuser cette information auprès de personnes qui pourraient être intéressées. Peuvent candidater à cette thèse des étudiants ayant un master recherche en linguistique (spécialité phonétique et phonologie, prosodie, discours), en linguistique informatique ou en informatique. Ils doivent être intéressés par un travail collaboratif et par le traitement automatique de la parole. Pour plus de précisions ou pour postuler, merci de contacter : - Elisabeth Delais-Roussarie : elisabeth.roussarie@wanadoo.fr - Damien Lolive : damien.lolive@irisa.fr Date limite de candidature : 4 juillet 2016
|