ISCApad #170 |
Monday, August 06, 2012 by Chris Wellekens |
6-1 | (2012-02-02) Research and Development Opportunities for Next Generation Technology at Microsoft
Research and Development Opportunities for Next Generation Technology at Microsoft Do you want to impact billions of people all over the world with speech technology that you create?
We are looking for PhD level scientists and senior scientists, who will work on research problems in spoken language understanding, statistical dialog modeling, natural language generation, machine learning, statistical language modeling, and acoustic modeling.
Microsoft is all-in on the Natural User Interface to bring computing to larger audiences in more applications. To drive this mission we are bringing together scientists and engineers in the areas of speech recognition, natural language understanding, dialog modeling, machine learning and synthesis to develop and deliver robust, natural and scalable solutions across a rich set of scenarios and languages.
Join the excitement to be part of the newly formed team of scientists within Microsoft and to impact the lives of billions of people all over the world. We’re talking about Bing, Windows, XBOX, Mobile, Exchange Server and Tellme, just to name a few. Microsoft is dedicated to improving everyday life using speech. And not just in a few countries - but around the world.
How to apply: MICROSOFT CORPORATION Attention: Recruiting, One Microsoft Way, STE 303, Redmond WA 98052-8303
Or email resume to: Tom Swanson toswanso@microsoft.com Please reference Speech in the subject line.
| ||
6-2 | (2012-02-05) NSF-Supported Summer Research for Undergraduates NSF-Supported Summer Research for Undergraduates
| ||
6-3 | (2012-02-02) Research and Development Opportunities for Next Generation Technology at Microsoft Research and Development Opportunities for Next Generation Technology at Microsoft Do you want to impact billions of people all over the world with speech technology that you create?
We are looking for PhD level scientists and senior scientists, who will work on research problems in spoken language understanding, statistical dialog modeling, natural language generation, machine learning, statistical language modeling, and acoustic modeling.
Microsoft is all-in on the Natural User Interface to bring computing to larger audiences in more applications. To drive this mission we are bringing together scientists and engineers in the areas of speech recognition, natural language understanding, dialog modeling, machine learning and synthesis to develop and deliver robust, natural and scalable solutions across a rich set of scenarios and languages.
Join the excitement to be part of the newly formed team of scientists within Microsoft and to impact the lives of billions of people all over the world. We’re talking about Bing, Windows, XBOX, Mobile, Exchange Server and Tellme, just to name a few. Microsoft is dedicated to improving everyday life using speech. And not just in a few countries - but around the world.
How to apply: MICROSOFT CORPORATION Attention: Recruiting, One Microsoft Way, STE 303, Redmond WA 98052-8303
Or email resume to: Tom Swanson toswanso@microsoft.com Please reference Speech in the subject line.
| ||
6-4 | (2012-02-15) Maître de Conférences contractuel, ESPCI ParisTechUn poste de Maître de Conférences contractuel (1 an renouvelable) est disponible au laboratoire SIGMA (SIGnaux, Modèles, Apprentissage statistique) de l'ESPCI ParisTech à partir d'avril 2012.
| ||
6-5 | (2012-02-10) Poste MCF Informatique [27MCF519 Paris-Sorbonne] Le poste requiert une double compétence : un haut niveau d’excellence scientifique en Informatique et en applications de l’Informatique aux sciences humaines et sociales (notamment le traitement paralinguistique de la parole et du langage, la sociologie computationnelle, …). L’intérêt porté aux applications de la théorie informatique aux sciences humaines et sociales constitue une des spécificités de l’enseignement de l’Informatique à l’Université Paris-Sorbonne. Le candidat enseignera l’Informatique dans différentes formations de licence (LFTI) et de master (ILGII, IILGI). Il s’impliquera également dans l’encadrement de nouvelles licences bi-cursus (licence Sciences-Sciences du langage, …) en projet au sein du PRES Sorbonne Universités.
| ||
6-6 | (2012-02-15)PhDs at Tilburg Center for Cognition and Communication (TiCC) research program 'Language, Communication and Cognition' (LCC), The Netherlands For the Tilburg Center for Cognition and Communication (TiCC) research program 'Language, Communication and Cognition' (LCC), we are looking for two new, enthusiastic and competent PhD colleagues.
If you are interested in one of these positions, you will need to identify a potential research topic related to one of the research themes of the LCC program. Current themes include:
- Social media and interpersonal communication. - Professional communication (medical, business, etc.). - Alignment and adaptation in communication. - Social exclusion and other social aspects of interaction. - Emotion and speech. - Language acquisition and learning. - Multimodality and communication. - Language and speech production. - Visual communication (diagrams, metaphors, etc.). - Gesture and other forms of non-verbal behavior
For the positions we seek candidates with a background in a relevant discipline, including Psycholinguistics, Communication & Information Sciences, Linguistics, Cognitive Science, Psychology or some related area, with experience in doing experiments and analyzing data.
The PhD candidates have a good (research) master degree in one of the aforementioned areas, a strong interest in doing research, excellent writing skills and a good command of English. Developing and defending a research plan is part of the procedure.
Tilburg University is rated among the top Dutch employers, offering excellent terms of employment. The collective labour agreement of Tilburg University applies. The selected candidates will start with a contract for one year, concluded by an evaluation. Upon a positive outcome of the first-year evaluation, the candidate will be offered an employment contract for the remaining years. Candidates with a Research Master (MPhil) will be offered a 1+2 years-contract. Master students might be offered a 1+3 years-contract. It is also possible to work 80% instead of fulltime. The PhD candidates will be ranked in the Dutch university job ranking system (UFO) as a PhD-student (promovendus) with a starting salary of € 2.042,-- gross per month in the first year, up to € 2.612,-- in the fourth year (amounts fulltime). The selected candidate is expected to have written a PhD thesis by the end of the contract (which may be based on articles).
Research in the Department of Communication and Information Sciences is located in the Tilburg Center for Cognition and Communication (TiCC). TiCC consists of two research programs: Language, Communication and Cognition (LCC) and Creative Computing (CC). There is a strong emphasis on experimental research and interdisciplinary cooperation. More information about the research programs can be found at http://www.tilburguniversity.edu/research/institutes-and-research-groups/ticc/. There is a strong emphasis on experimental work and interdisciplinary cooperation. The department DCI is responsible for a flourishing academic programme Communication and Information Sciences (CIW), that annually attracts about 120 Bachelor students, 130 Pre-master and 200 Master students. The department is also co-responsible for the Research Master Language and Communication. More information about the DCI department can be found at www.tilburguniversity.nl/faculties/humanities/dci/.
For more information on the positions, please contact one of LCC program leaders prof.dr. Emiel Krahmer (E.J.Krahmer@uvt.nl, +311346630700) or prof.dr. Marc Swerts (M.G.J.Swerts@uvt.nl, +31134662922).
Applications should include.
- a cover letter. - a Curriculum Vitae. - a 2-page research proposal on a selected theme, plus names of potential supervisor and promotor. - names of two references.
The only way to apply is via the online link at the bottom of this vacancy: 'apply direct'. If you receive this vacancy via eg. E-mail, please look at the vacancy located at: http://www.tilburguniversity.edu/about-tilburg-university/working-at/wp/. Applications should be sent before the application deadline of March 24, 2012. Interviews are expected to be held in April 2012. Starting dates are flexible, so applicants who expect to graduate in the summer of 2012 are also invited to apply.
| ||
6-7 | (2012-02-15) Maitre de Conférences, l'Université Sorbonne Nouvelle, Paris Un poste de Maitre de Conférences est ouvert au recrutement pour la rentrée 2012 à l'Université Sorbonne Nouvelle Paris 3. Voici le descriptif ci-dessous. (plus de détails sur http://lpp.univ-paris3.fr/postes/offres.htm)
| ||
6-8 | (2012-02-15) Postdoc at University of Trento, Italy - Machine Translation/Social Computing Postdoc at University of Trento, Italy - Machine Translation/Social Computing
| ||
6-9 | (2012-02-21) Two Postdoctoral Research Associates in Speech Technology,University of Edinburgh Two Postdoctoral Research Associates in Speech Technology -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.
| ||
6-10 | (2012-02-21) Senior Researcher in Speech Technology, University of Edinburgh Senior Researcher in Speech Technology -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.
| ||
6-11 | (2012-02-16) 4 PhD positions in spoken dialogue systems research / Charles University in Prague 4 PhD positions in spoken dialogue systems research
| ||
6-12 | (2012-03-03) 3 Post-doctoral positions at the Bruno Kessler Foundation, Center for Information Technology, Trento Italy
3 Post-doctoral positions available in the 'Human Language Technologies - HLT' Research Unit at the Bruno Kessler Foundation, Center for Information Technology. Workplace description: The Human Language Technology is a multi-disciplinary research unit that addresses the automatic processing of human language for a range of tasks. In particular, the research unit focuses on: automatic speech recognition, machine translation and content processing. The HLT unit has been developing state-of-the-art technology in all the main research areas it operates in. The group has consistently performed well in several international evaluations, and is currently engaged in international projects for open source software development (e.g. the Moses platform for statistical machine translation). The unit also provides technological support and high-level services in order to optimize the internal research activities, namely a shared and efficient computing environment, software tools, up to the creation and management of large scale linguistic resources. The HLT group is part of the larger network of research labs focusing on Human Language Technologies and related domains in the Trento region, that is quickly becoming one of the areas with the highest concentration of researchers in HLT and related fields anywhere in Europe. More information about the HLT Unit is available at http://hlt.fbk.eu The HLT Research Unit, is looking for 3 candidates to carry out research activities in the field of Textual Inferences, Machine Translation and Speech Recognition. Each research position will be funded through the following European research projects: MateCat: http://www.matecat.com EU-Bridge: http://www.eu-bridge.eu EXCITEMENT: website in progress Open positions: A Postdoctoral position in Textual Inferences (Ref.Code HLT_PostDoc2012_TI) The candidate is expected to carry out research activities in the context of the EU-funded project EXCITEMENT on multilingual semantic processing. The goal of the EXCITEMENT project is to develop generic semantic 'engines' or platforms for robust textual inference that are applicable across languages and linguistic frameworks. These inference platforms will be leveraged for unsupervised text exploration on customer interaction data. Concrete systems will be developed for English, German, and Italian. Project partners are Bar-Ilan University, DFKI Saarbrücken, University of Heidelberg, Almawave S.r.l, NICE Systems, and OMQ GmbH. The selected candidate will join the FBK research group with the aim of advancing the state of the art on component-based textual entailment. A Postdoctoral position in Machine Translation (Ref.Code HLT_PostDoc2012_MT) The candidate is expected to contribute original research results inside leading edge international projects. The aim is to advance the state of the art in the integration of statistical MT in computer assisted translation and in adaptive MT, by drawing ideas and contributions from different areas, such as machine learning, statistical language processing, high performance computing, etc. A Postdoctoral position in Speech Recognition (Ref.Code HLT_PostDoc2012_SR) The candidate is expected to contribute original research results inside a leading edge international project. The aim is to advance the state-of-the-art in multilingual speech processing by improving acoustic modelling, language modelling, and adaptation to different domains, conditions and genres. The contribution will be evaluated on application scenarios that include both efficient annotation of audiovisual archives and live processing of audio streams. Job requirements: Applicants should have a PhD degree related to any of the specific research areas mentioned (computational linguistics, speech processing or related fields) Experience in statistical modelling, speech processing or machine learning (preferable on approaches applied to NLP tasks) Experience in distributed software development (open source) Skills in experimental work and development of algorithms Ability to work and deliver in funded research projects Oral and written proficiency in English In adherence to FBK's policy to promote equal opportunity and gender balance, in case of equal applications, female candidates will be given preference. Employment: Contract type: Full time, 30-month contract (may be extended up to 6 months). Number of positions: 3 Gross salary: from 37,500 to 41,500 €per year (depending on the candidate’s experience) Benefits: 28 vacation days per year, flexi-time, company subsidized cafeteria or meal vouchers, internal car park, welcome office support for visa formalities, accommodation, social security, etc., reductions on bank accounts, public transportation, sport, accommodation and language courses fees. Start date: Spring 2012 Location: Povo, Trento (Italy) Application process: To apply online, please send your detailed CV (.pdf format) including a list of publications, a statement of research interests and contact information for at least 2 references. Please include in your CV your authorization for the handling of the your personal information as per the Personal data Protection Code, Legislative Decree no. 196/2003 June 2003. Applications must be sent to jobs@fbk.eu Emails should have the reference code related to the position of interest ( HLT_PostDoc2012_TI, HLT_PostDoc2012_MT or HLT_PostDoc2012_SR) Application deadline: 9 April 2012 Short-listed candidates will be contacted for an interview. Non-selected applicants will be notified of their exclusion at the end of the selection process. Please note that FBK may contact short-listed candidates who were not selected for the current openings within a period of 6 months for any selection process for similar positions. For transparency purposes, the name of the selected candidate, upon his/her acceptance of the position, will be published on the FBK website at the bottom of the selection notice
| ||
6-13 | (2012-03-08) Post-docs at the Speech Processing and Transmission Lab ,Universidad de Chile,Santiago,Chile The Speech Processing and Transmission Lab (LPTV, Laboratorio de Procesamiento y Transmisión de Voz) at Universidad de Chile,Santiago,Chile, is looking for post-doc researchers in the following fields:
Robust speech recognition Robust speaker verification Second language learning assessment
The grants are funded by Conicyt (Chilean funding Agency): http://www.conicyt.cl
The applicant are required to present a brief research proposal prepared in collaboration with the director of the LPTV. For further information, contact:
Néstor Becerra Yoma, Ph.D. Professor Speech Processing and Transmission Laboratory Department of Electrical Engineering Universidad de Chile Av. Tupper 2007,POBox412-3 Santiago,Chile
Tel. +56 2 978 4205 Fax. +56 2 695 3881 E-mail: nbecerra@ing.uchile.cl http://www.cec.uchile.cl/~labptvoz/
| ||
6-14 | (2012-03-10) Senior Researcher/Research Associate in Statistical Dialogue Systems at Cambridge UK Senior Researcher/Research Associate in Statistical Dialogue Systems
Applications are invited at either the Senior Research Associate or Research Associate
level to work on an EU-funded project called Parlance which aims to build mobile voice-
driven systems for interactive hyper-local search.
Candidates should have a PhD or comparable research experience in spoken dialogue
systems and noise robust automatic speech recognition and understanding. Good
programming skills are essential and familiarity with HTK would be an advantage.
Appointment at the senior level will require at least 3 years post-doctoral experience
and evidence of independent standing. Salary range is from £27578 to £46846.
This is an exciting opportunity to join one of the leading groups in statistical speech and
language processing. Cambridge provides excellent research facilities and there are
extensive opportunities for collaboration, visits and attending conferences.
Contact Prof Steve Young (sjy@eng.cam.ac.uk) for further information.
Application details can be found at: http://www.jobs.cam.ac.uk/job/-14472
| ||
6-15 | (2012-03-12) Postdoc position: Acoustic to articulatory mapping of fricative sounds LORIA Nancy France Postdoc position: Acoustic to articulatory mapping of fricative sounds
15 months, start between September and December 2012 at LORIA (Nancy, France).
Contact : Yves.Laprie@loria.fr
Context This subject deals with acoustic to articulatory mapping [Maeda et al. 2006], i.e. the recovery of the vocal tract shape from the speech signal possibly supplemented by images of the speaker’s face. This is one of the great challenges in the domain of automatic speech processing which did not receive satisfactory answer yet. The development of efficient algorithms would open new directions of research in the domain of second language learning, language acquisition and automatic speech recognition.
The objective is to develop inversion algorithms for fricative sounds. Indeed, there exist now numerical simulation models for fricatives. Their acoustics and dynamics are better known than those of stops and it will be the first category of sounds to be inverted after vowels for which the Speech group has already developed efficient algorithms. The production of fricatives differs from that of vowels about two points:
The approach proposed is analysis-by-synthesis. This means that the signal, or the speech spectrum, is compared to a signal or a spectrum synthesized by means of a speech production model which incorporates two components: an articulatory model intended to approximate the geometry of the vocal tract and an acoustical simulation intended to generate a spectrum or a signal from the vocal tract geometry and the noise source. The articulatory model is geometrically adapted to a speaker from MRI images and is used to build a table made up of couples associating one articulatory vector and the corresponding acoustic image vector. During inversion, all the articulatory shapes whose acoustic parameters are close to those observed in the speech signal are recovered. Inversion is thus an advanced table lookup method which we used successfully for vowels [Ouni & Laprie 2005] [Potard et al. 2008].
Activities The success of an analysis by synthesis method relies on the implicit assumption that synthesis can correctly approximate the speech production process of the speaker whose speech is inverted. There exist fairly realistic acoustic simulations of fricative sounds but they strongly depend on the precision of the geometrical approximation of the vocal tract used as an input. There also exist articulatory models of the vocal tract which yield very good results for vowels. On the other hand, these models are inadequate for those consonants which often require a very accurate articulation at the front part of the vocal tract. The first part of the work will be about the elaboration of articulatory models that are adapted to the production of consonants and vowels. The validation will consist of piloting the acoustic simulation from the geometry and of assessing the quality of the synthetic speech signal with respect to the natural one. This work will be carried out for some X-ray films, whose the acoustic signal recorded during the acquisition of them is sufficiently good.
The second part of the work will be about several aspects of the inversion strategy. Firstly, it is now accepted that spectral parameters implying a fairly marked smoothing and frequency integration have to be used, which is the case of MFCC (Mel Frequency Cepstral Coefficients) vectors. However, the most adapted spectral distance to compare natural and synthetic spectra has to be investigated. Another solution consists in modeling the source so as to limit its impact on the computation of the spectral distance.
The second point is about the construction of the articulatory table which has to be revisited for two reasons: (i) only the cavity downstream the constriction plays an acoustic role, (ii) the location of the noise source is an additional parameter but it depends on the other articulatory parameters. The third point concerns the way of taking into account the vocal context. Indeed, the context is likely to provide important information about the vocal tract deformations before and after the fricative sound, and thus constraints for inversion.
A very complete software environment already exists in the Speech group for acoustic-to-articulatory inversion, which can be exploited by the post-doctoral student.
References - [S. Ouni and Y. Laprie 2005] Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion, Journal of the acoustical Society of America, Vol. 118, pp. 444-460 - [B. Potard, Y. Laprie and S. Ouni], Incorporation of phonetic constraints in acoustic-to-articulatory inversion, JASA, 123(4), 2008 (pp.2310-2323). - [Maeda et al. 2006] Technology inventory of audiovisual-to-articulatory inversion http://aspi.loria.fr/Save/survey-1.pdf
Expected skills Knowledge of speech processing and articulatory modeling, Acoustics, Computer sciences, Applied mathematics
| ||
6-16 | (2012-03-15) Ingénieur de recherche à IRISA Lannion France Un poste d'ingénieur de recherche (CDD 24 mois) est ouvert dans l'équipe de recherche Cordial de l'Irisa à Lannion. Ce recrutement, dont le profil de recherche se situe en traitement de la parole et du signal de parole, est effectué dans le cadre du projet ANR Phorevox. Le poste est à pourvoir dès que possible. Le profil détaillé est disponible en suivant le lien :
| ||
6-17 | (2012-03-15) INGENIEUR D’ETUDES ET RECHERCHE au Labo Nat. Métrologie et essais. DIRECTION DES ESSAIS Pôle Essais en Environnement
INGENIEUR D’ETUDES ET RECHERCHE EN TRAITEMENT AUTOMATIQUE DES LANGUES H/F Réf : CL/TAL/DE
Contexte :
Le Laboratoire National de Métrologie et d’Essais propose des prestations d’évaluation de la performance des systèmes de traitement automatiques des langues et de la parole pour une tâche donnée (transcription, traduction, extraction d’informations,…). Au sein du Département CEM, Sécurité Electrique et Technologies de l’information, l’équipe de traitement de l’information multimedia travaille sur les différentes étapes qui définissent une évaluation. Ses principales missions sont :
- De définir des tâches pertinentes à évaluer en fonction des besoins applicatifs et/ou théoriques, - De déterminer des caractéristiques des données à utiliser pour répondre à la tâche considérée, - D’établir des mesures qui permettent de rendre compte de la pertinence d’un système pour une tâche donnée.
Missions :
Dans le cadre de programmes d’études et recherche, vous aurez pour mission de contribuer au développement de l’activité, notamment au travers des éléments suivants :
- Le montage et la gestion de projets de recherche et développement dans le domaine du multimédia,
- L’élaboration des protocoles pour répondre aux problématiques de l’évaluation en Traitement Automatique du Langage :
- Le développement des partenariats au niveau international afin de renforcer la position du LNE dans le domaine.
Profil :
Docteur Ingénieur en Informatique, spécialisé en Traitement Automatique des Langues (TAL). Vous possédez une première expérience professionnelle (3 à 5 ans en plus de la thèse), durant laquelle vous avez travaillé sur l’évaluation des systèmes automatiques. Vous maîtrisez la gestion de projet et êtes à l’aise dans l’approche clients et l’organisation et l’animation de réunions et/ou séminaires. Vous avez des connaissances solides en programmation et analyse de données (fouille de données). Rigoureux, dynamique, déterminé et d’un relationnel facile, vous saurez rapidement vous intégrer au sein des équipes et démontrer le leadership et l’expertise nécessaires à la réussite de votre mission. Anglais courant impératif. Déplacements à prévoir (une dizaine par an de 1 à 3 jours, majoritairement en France). Poste en CDI basé à Trappes (78).
Contact :
Postuler sous la référence CL/TAL/DE A l’attention de Mlle Christelle LEBRAULT - Par mail : recrut@lne.fr
| ||
6-18 | (2012-03-25) Audio Indexing Researcher W/M position at IRCAM – 3DTV project If you already applied for this position, please just send us a quick email telling us you are still interrested and we get back to you. Audio Indexing Researcher W/M position at IRCAM – 3DTV project Starting : April 2012 (as soon as possible) Duration : 18 months Introduction to IRCAM IRCAM is a leading non-profit organization associated to Centre Pompidou, dedicated to music production, R&D and education in acoustics and music. It hosts composers, researchers and students from many countries cooperating in contemporary music production, scientific and applied research. The main topics addressed in its R&D department include acoustics, audio signal processing, computer music, interaction technologies, musicology. Ircam is located in the centre of Paris near the Centre Pompidou, at 1, Place Igor Stravinsky 75004 Paris. Introduction to 3DTVs project The goal of the 3DTVS project is to devise scalable 3DTV AV content description, indexing, search and browsing methods across open platforms, by using mobile and desktop user interfaces and to incorporate such functionalities in 3D audiovisual content archives. 3D multichannel audio analysis targets audio event detection based on fusion techniques that combine the feature analysis performed in the individual channels as well as source localization and separation algorithms for the detection of moving audio sources. The results will be used in 3D audio/cross-modal indexing and retrieval. Multimodal 3D audiovisual content analysis will built on the results of 3D video and audio analysis. 3DTV content description and search mechanisms will be developed to enable fast reply to semantic queries. Role of IRCAM in the 3DTV Project In the 3DTVs project, IRCAM is in charge of the research and development of technologies related to - Audio event detection using multi-channel audio scenes - Speaker diarization - Segmentation into Movie scene from the audio signal - Sound source separation, localization and identification Position description Hired Researcher will be in charge of the development of technologies related to:
The Researchers will also collaborate with the development team and participate in the project activities (evaluation, meetings, specifications). Required profiles
· High productivity, methodical works, excellent programming style. Salary According to background and experience Applications Please send an application letter together with your resume and any suitable information addressing the above issues preferably by email to: peeters_a_t_ircam dot fr with cc to vinet_a_t_ircam dot fr, roebel_at_ircam_dot_fr
L’Ircam recrute un Chercheur H/F – en CDD de 18 mois et à temps plein – Projet 3DTVs Poste disponible à partir d'avril 2012
Présentation de l’Ircam L'Ircam est une association à but non lucratif, associée au Centre National d'Art et de Culture Georges Pompidou, dont les missions comprennent des activités de recherche, de création et de pédagogie autour de la musique du XXème siècle et de ses relations avec les sciences et technologies. Au sein de son département R&D, des équipes spécialisées mènent des travaux de recherche et de développement informatique dans les domaines de l'acoustique, du traitement des signaux sonores, des technologies d’interaction, de l’informatique musicale et de la musicologie. L'Ircam est situé au centre de Paris à proximité du Centre Georges Pompidou au 1, Place Stravinsky 75004 Paris.
Introduction au projet 3DTVs L'objectif du projet 3DTVs est de concevoir des descriptions évolutives des contenus 3DTV, leur indexation, leur recherche ainsi que la conception de méthodes de navigation sur toutes des plateformes ouvertes, en utilisant des interfaces utilisateurs mobiles et fixes et d'intégrer de telles fonctionnalités 3D dans les archives de contenus audiovisuels. L’analyse multi canal audio 3D vise la détection d’événements audio basés sur des techniques de fusion combinant l'analyse audio effectuée dans les canaux individuels ainsi que des algorithmes de localisation et de séparation de source pour la détection des mouvements des sources audio. Les résultats seront utilisés pour l’indexation 3D audio et cross modale ainsi que pour la recherche. L’indexation audio/ video multimodale 3D des contenus audiovisuels s’appuiera sur les résultats de l’indexation vidéo 3D et audio 3D. Des méthodes de description de contenu et de recherche seront développées afin de permettre des réponses rapides aux recherches sémantiques.
Rôle de l’Ircam dans le projet Quaero Dans le projet 3DTVs, l'Ircam est en charge de la recherche et du développement des technologies relatives à la - Détection des événements audio en utilisant les scènes audio multi canal - Segmentation en tours de parole - Séparation, localisation et identification des sources sonores
Missions Le Chercheur sera en charge du développement des technologies liées à: - Détection des événements audio en utilisant les scènes audio multi canal Le chercheur collaborera également avec l'équipe de développement et participera aux activités du projet (évaluation, réunions, spécification).
Profil recherché
Salaire Selon formation et expérience professionnelle
Candidatures Prière d'envoyer une lettre de motivation et un CV détaillant le niveau d'expérience/expertise dans les domaines mentionnés ci-dessus (ainsi que tout autre information pertinente) à peeters _a_t_ ircam dot fr avec copie à vinet _a_t_ ircam dot fr, roebel _at_ ircam _dot_ fr If you already applied for this position, please just send us a quick email telling us you are still interrested and we get back to you.
| ||
6-19 | (2012-03-27) Post Doctoral Fellow or Research Associate, Toronto, Canada Position: Post Doctoral Fellow or Research Associate (scientific) Site: Toronto Rehabilitation Institute, University Centre, Toronto, Canada
KEY RESPONSIBILITIES:
KEY REQUIREMENTS:
ASSET REQUIREMENTS:
Alshaer dot Hisham at torontorehab dot on dot ca
| ||
6-20 | (2012-04-02) Research position in Spoken Language Dialogue Systems Development for Serious Games ; University of Ulm Germany Research Position with perspective of a PhD degree in Spoken Language -- Wolfgang Minker Ulm University Communications Engineering - Dialogue Systems Albert-Einstein-Allee 43 D-89081 Ulm Phone: +49 731 502 6254/-6251 Fax: +49 731 501 226254 http://dialogue-systems.org
| ||
6-21 | (2012-04-04) PhD fellowship- Fondazione Bruno Kessler (FBK), Trento, Italy
| ||
6-22 | (2012-04-04) Post-Doctoral Research Position, Aalto University Post-Doctoral Research Position, Aalto University
Title: Statistical speech synthesis Department: Department of Signal Processing and Acoustics
URL: http://spa.aalto.fi/en/ Start date: August-October 2012 Duration: 12-18 months contract
Department of Signal Processing and Acoustics, Aalto University (Espoo, Finland), invites applications for a post-doctoral researcher position in speech technology. The position is funded by the Simple4all project (http://simple4all.org/), which is a collaboration between Aalto University, University of Edinburgh (coordinator), University of Helsinki, Universidad Politécnica de Madrid, and Universitatea Tehnica Cluj-Napoca. Simple4All is a 3 year project, funded by EC’s FP7 ICT Programme, whose general aim is to create speech synthesis technology that learns from data with little or no expert supervision and continually improves itself, simply by being used.
The work at the Department of Signal Processing and Acoustics focuses on novel vocoding technologies in statistical parametric speech synthesis. More specifically, we are interested in utilizing such speech models in statistical speech synthesis that are closer to the human speech production mechanism and are inherently able to produce many voice qualities. Applicants for the post-doctoral researcher position must have a PhD (or equivalent experience) in speech processing, digital signal processing or computer science. They must have background in statistical speech synthesis, experience in the development of vocoders is particularly appreciated. In addition, experience of project development and project leadership in a research context, together with excellent communication, presentation, and organisational skills are highly desirable.
To apply, please send your CV (.pdf format) including a list of publications and your contact information, a statement of research interests and contact information for at least 2 references. Applications must be sent to paavo.alku@aalto.fi using the subject line: Post-doc position in statistical speech synthesis Application deadline: 30 June 2012
| ||
6-23 | (2012-04-15) Full Time Research Programmer, Dialog Research Center, CMU Pittsburgh Full Time Research Programmer, Dialog Research Center
| ||
6-24 | (2012-04-20) PhD grant: Prosodic markers at IRIT Toulouse Modélisation de trajectoires de marqueurs prosodiques et linguistiques ; application à la caractérisation des intentions des intervenants dans les discours audiovisuels
Contact Jérôme Farinas, jfarinas@irit.fr équipe SAMOVA http://www.irit.fr/recherches/SAMOVA/
Description du sujet Dans le domaine du traitement automatique de l'audio, les systèmes actuels sont parvenus à une assez grande maturité pour extraire de façon plutôt fiable des informations sur les locuteurs présents, la langue utilisée et la transcription de la parole. Un des objectifs de la recherche actuelle consiste à utiliser ces informations afin de structurer les interventions des locuteurs et plus largement le contenu radiophonique et télévisuel.
Dans ce contexte, l'équipe SAMOVA de l'IRIT a acquis ces dernières années de fortes compétences en modélisation et segmentation automatique en locuteurs [Louradour 2007, El Khoury 2010], en identification automatique de langues [Pellegrino 1998, Farinas 2002, Rouas 2005], en segmentation parole/musique/chant [Pinquier 2004, Lachambre 2009], en extraction de jingle [Pinquier 2004], en transcription de la parole [Campagne ESTER 2004], en recherche de zones de parole conversationnelle [Projet EPAC 2010] et de mots-clés [Le Blouch 2009]. En s'appuyant sur ces travaux, l'équipe travaille sur la structuration des émissions en se basant sur les interventions des locuteurs et leurs interactions [Bigot 2011] ainsi que sur la vidéo [Ercolessi 2011].
A partir d'une caractérisation du rôle des intervenants (présentateur, locuteur dominant...) notre objectif est d'étudier plus précisément les interactions entre locuteurs afin de distinguer ce qui dans le message relève de l'interaction (ouverture, clôture, présentation d'un invité, gestion des tours de parole) et des échanges d'opinion. Plus largement, le sujet de thèse proposé vise à étudier l'intention dans les interventions audiovisuelles de personnes. La modélisation des intentions est principalement basée sur la modélisation de la prosodie, qui a travers l'intonation et le rythme permet d'influer sur la forme du discours. Cette modélisation devra prendre en compte la prosodie à court ou long terme [Farinas2002,Rouas2004]. Deux niveaux de modélisations seront donc mis en œuvre afin de caractériser la modalité de la phrase et la modification de la prosodie des mots. Cela passera par la choix de paramètres prosodiques appropriée (F0, energie) et la modélisation statistique de ces paramètres. L'évolution temporelle pourra être prise en compte en utilisant des modélisations stochastiques, des modélisations de trajectoires. Cette étude se déroulera en deux phases :
Les applications de cette recherche concernent la structuration de contenus audiovisuels pour aider à l'archivage documentaire et la recherche d'information dans ces contenus. Cette structuration et caractérisation de zones d'interaction présente également un intérêt pour la constitution de résumés audio-visuels.
Le candidat devra posséder un Master avec de fortes compétences en informatique. Des connaissances en traitement du signal, en reconnaissance de la parole seraient souhaitables (reconnaissance de la parole et prosodie).
Références [Louradour 2007] Noyaux de séquences pour la vérification du locuteur par Machines à Vecteurs de Support. Thèse de doctorat, Université Paul Sabatier, janvier 2007 [El Khoury 2010] Unsupervised Video Indexing based on Audiovisual Characterization of Persons. Thèse de doctorat, Université de Toulouse, juin 2010 [Pellegrino 1998] Une approche phonétique en identification automatique des langues : la modélisation acoustique des systèmes vocaliques. Thèse de doctorat, Université Paul Sabatier, décembre / december 1998. [Farinas 2002] Une modélisation automatique du rythme pour l'identification des langues. Thèse de doctorat, Université Paul Sabatier, novembre 2002. [Rouas 2005] Caractérisation et identification automatique des langues. Thèse de doctorat, Université Paul Sabatier, mars 2005. [Pinquier 2004] Indexation sonore : recherche de composantes primaires pour une structuration audiovisuelle. Thèse de doctorat, Université Paul Sabatier, décembre 2004. [Lachambre 2009] Caractérisation de l'environnement musical dans les documents audiovisuels. Thèse de doctorat, Université de Toulouse, décembre 2009. [Campagne ESTER 2004] G. Gravier, J.F. Bonastre, S. Galliano, E. Geoffrois, K. Mc Tait and K. Choukri. ESTER, une campagne d'évaluation des systèmes d'indexation d'émissions radiophoniques, Proc. Journées d'Etude sur la Parole, Avril 2004. [projet EPAC 2010] Yannick Estève, Thierry Bazillon, Jean-Yves Antoine, Frédéric Béchet, Jérôme Farinas. The EPAC corpus: manual and automatic annotations of conversational speech in French broadcast news (regular paper). Dans : Language Resources and Evaluation Conference (LREC 2010), Valletta, Malte, 19/05/2010-21/05/2010, Nicoletta Calzolari, Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk (Eds.), European Language Resources Association (ELRA), p. 1686-1689, 2011. [Le Blouch 2009] Décodage acoustico-phonétique et applications à l'indexation audio automatique. Thèse de doctorat, Université Paul Sabatier, juin 2009. [Bigot 2011] Benjamin Bigot, Isabelle Ferrané, Julien Pinquier, Régine André-Obrecht. Speaker Role Recognition to help Spontaneous Conversational Speech Detection (regular paper). Dans : International workshop on Searching Spontaneous Conversational Speech SCSS (SCSS 2010), Firenze, Italy, 25/10/2010-29/10/2010, ACM, p. 5-10, octobre 2010. [Ercolessi 2011] Philippe Ercolessi, Hervé Bredin, Christine Sénac and Philippe Joly, Segmenting TV series into scenes using speaker diarization, WIAMIS 12th International Workshop on Image Analysis for Multimedia Interactive Services, Delft, Pays-Bas,13-15 avril 2011.
Mots clés Traitement automatique de la parole, décodage phonétique, recherche de mots clés, prosodie, acoustique, structuration en émissions, vidéo
Kewords Automatic Speech Processing, Phonetic Decoding, Keyword Spotting, Prosody, Acoustic, Structuring Programs, Video
| ||
6-25 | (2012-04-20) Ingénieur at INRIA France Inria recherche un ingénieur jeune diplômé pour développer sa boîte à outils de séparation de sources audio FASST (http://bass-db.gforge.inria.fr/fasst/) et effectuer un travail de recherche sur la reconnaissance de la parole robuste au bruit.
| ||
6-26 | (2012-05-01) PhD Reconnaissance automatique de la parole continue : parole spontanée LORIA Nancy France Sujet de thèse :Reconnaissance automatique de la parole continue : parole spontanée
Encadrants pour ce sujet : Lieu : Inria-LORIA Nancy
Le sujet est affiché sur le site de l'école doctorale IAEM http://www.iaem.uhp-nancy.fr/ , rubrique 'propositions contrats doctoraux'. Date limite du depot de candidature : le 1-er juin
Conetxte : La reconnaissance de la parole est un processus par lequel un ordinateur transforme le signal acoustique de la parole prononcée en texte. Pendant ce processus, le système de reconnaissance utilise des modèles acoustiques, des modèles de langage et un lexique de prononciations.
L’objet de cette thèse est d’apporter des éléments de solution à ce problème en proposant de nouvelles méthodes qui permettent de mieux prendre en compte les caractéristiques de la prononciation spontanée dans le cadre de la reconnaissance automatique de la parole.
Références : [Brun et al.2005] A. Brun, C. Cerisara, D. Fohr et I. Illina. ANTS : le système de transcription automatique du LORIA. WorkShop ESTER, 2005.
| ||
6-27 | (2012-05-13) PhD position: Caractérisation de l'ambiance sonore dans des enregistrements ethnomusicologiques IRIT Toulouse France Titre : Caractérisation de l?ambiance sonore dans des enregistrements ethnomusicologiques
Responsables : Régine André-Obrecht et Julien Pinquier (IRIT, équipe SAMoVA) obrecht@irit.fr et pinquier@irit.fr
Cette thèse concerne le traitement de données ethnomusicologiques issues des archives du CNRS-Musée de l?Homme, gérées par le Centre de Recherche en EthnoMusicologie (CREM) du Laboratoire d'Ethnologie et de Sociologie Comparative (LESC). Il s?agit de documents en cours de numérisation et d?informatisation (3500 heures d?enregistrements inédits, de 1900 à nos jours, de musiques traditionnelles et d?enquêtes ethnographiques du monde entier et 3500 heures de documents anciens et rares). Cette collection est d?une grande importance historique et est unique au monde. Dans ce contexte applicatif, il est nécessaire de mettre au point un ensemble d'outils de traitement automatique de l'audio (parole, musique, chant, bruits?) afin de produire une indexation (semi)automatique pour un accès intelligent à la collection d'enregistrements sonores. Ce travail est principalement à destination de chercheurs (experts) en ethnomusicologie.
L?étude envisagée a pour objectif une caractérisation plus fine des composantes Parole, Musique, Chant, Bruits afin de définir l?environnement sonore générique. De plus, l?introduction d?une approche semi-supervisée (prise en compte de métadonnées disponibles ou de l?utilisateur) doit permettre la caractérisation d?environnements sonores spécifiques.
Après s?être approprié les différents systèmes précédemment développées à l?IRIT, concernant la détection de parole et de musique, le doctorant aura en charge leur adaptation au corpus du projet. L?analyse des zones de parole et de voix chantée détectées doit conduire à une segmentation en tours de parole et en tours de chant, suivie du regroupement de ces segments par recherche de similarité des voix. Dès lors que les enregistrements sonores sont effectués dans des conditions naturelles et lorsque les zones de parole, de musique et de chant sont identifiées, restent des zones sonores digne d?un intérêt pour un ethnomusicologique car leur écoute permet de préciser le contexte sonore de la session de l?enregistrement, ce que l?on appelle « l?ambiance sonore ». Il est proposé de localiser ces zones de bruit d?intérêt et de spécifier un étiquetage. Pour ce faire, deux stratégies sont envisagées : - un mode supervisé en utilisant les attributs acoustiques classiques (approche générique), - un mode non-supervisé en introduisant des connaissances issues des ethnomusicologues (approche spécifique) via la plateforme Telemeta (http://crem.telemeta.org/).
Ce doctorat sera financé par le projet ANR DIADEMS qui démarrera en octobre 2012. Il serait appréciable que le candidat ait des connaissances en reconnaissance de formes et en traitements de la parole et de la musique.
| ||
6-28 | (2012-06-01) Two positions at Nuance Belgium Nuance is the leading provider of speech and imaging solutions for businesses and consumers around the world. Our technologies, applications and services make the user experience more compelling by transforming the way people interact with information and how they create, share and use documents. Every day, millions of users and thousands of businesses, experience Nuance by calling directory assistance, getting account information, dictating patient records, telling a navigation system their destination, or digitally reproducing documents that can be shared and searched. Making each of those experiences productive and compelling is what Nuance is all about. Speech Recognition Specialist Merelbeke, Belgium Permanent role Response to Craig.Robertson@Nuance.com Nuance Mobile builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services. Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities. As a contributing member of Nuance, you will work within a dynamic team environment to develop, support, market and sell our award-winning software applications. We offer competitive compensation packages and a challenging technical but casual work environment. Join our dynamic, entrepreneurial team that operates worldwide (Europe, US, APAC). Be a part of our fast growing track of continuing success. For more information, please see www.nuance.com. Nuance is an equal opportunity employer. Responsibilities As a Speech Recognition specialist at Nuiance you will work with peers from other teams arround the world to investigate new & best usage of speech recognition for music and/or POI vertical domains. You will be closely working together with our R&D dpt to understand what is and what is not doable with the current limitation of the technology, and help customer's and Nuance internal integration teams to include Nuance technologies into successful products in an efficient way. Representative tasks will include:
Qualifications
Preferred: • Experience with embedded hardware platforms, embedded operating systems, and embedded software development is desirable • Experience with Python and SQLite is highly desirable • Windows CE or Linux or QNX OS
Nuance is the leading provider of speech and imaging solutions for businesses and consumers around the world. Our technologies, applications and services make the user experience more compelling by transforming the way people interact with information and how they create, share and use documents. Every day, millions of users and thousands of businesses, experience Nuance by calling directory assistance, getting account information, dictating patient records, telling a navigation system their destination, or digitally reproducing documents that can be shared and searched. Making each of those experiences productive and compelling is what Nuance is all about. NLP Processing Engineer Merelbeke, Belgium Permanent role Response to Craig.Robertson@Nuance.com Qualifications Excellent background in statistics, pattern recognition, and/or signal processing • Expertise in natural language processing, computational linguistics, statistical language modeling, search, and/or machine translation • Strong programming skills, ideally in Python, Java, and/or C. • Skills related to text processing, scripting languages, regular expressions • Excellent oral and written communications skills in English. • Ability to carry out focused and goal-oriented research and development, ability to assume responsibility for one’s work • Ability to work in an international team as well as independently in fast-paced environment • Ability to creatively solve problems while leveraging existing technology with an eye for efficiency.
PhD or equivalent research experience are a strong asset
• Good knowledge of speech recognition theory, acoustics, and/or psychoacoustics
• User interface, human—machine interaction, and dialogue system development experience
• Operational knowledge of languages other than English
MSc, ideally PhD in computer science, engineering, physics, mathematics, or other technical field
Craig Robertson Recruitment Manager EMEA
| ||
6-29 | (2012-06-11) PhD Student 'Increasing Robustness of Speech Recognition' Radbout University Nijmegen NL
PhD Student 'Increasing Robustness of Speech Recognition' (1,0 fte) Renewed Job opening ! Faculty of Arts Vacancy number: 23.02.12 Closing date: 7 July 2012 Responsibilities As a PhD student you will participate in the FP7 Marie Curie Initial Training Network Investigating Speech Processing In Realistic Environments (INSPIRE). This network provides research opportunities for 13 PhD students and 3 postdocs. You will become a member of an international team of researchers whose aim is to gain a better understanding of how listeners recognize speech, even under non-ideal circumstances. You will contribute to urgently needed solutions that help alleviate the serious communication problems that arise, especially for older and hearing-impaired persons, when different combinations of 'adverse' conditions affect the speech processing system. You will conduct your research in the framework of of a project called ’Increasing robustness of speech recognition by using multiple signal representations’. Speech processing in the human brain presumably involves competition between multiple, intermediate signal representations. The redundancy of these different representations are assumed to help improve the robustness of recognition. In some cases, however, they may lead to conflicting interpretations resulting in intelligibility problems. The goal of this PhD project is to investigate to what extent human recognition errors with regard to speech in ’adverse’ conditions can be replicated by machines that were trained on multiple input representations which are partially redundant. Work environment The project will be carried out at the Centre for Language and Speech Technology (CLST). CLST is a research unit within the Faculty of Arts of Radboud University Nijmegen and hosts a large international group of senior researchers and PhD students who conduct research at the frontier of science and develop innovative applications. What we expect from you You should: - hold a Master's degree in engineering or science; - have a strong background in machine learning (experience with dynamic Bayesian networks would be an advantage), mathematical and/or statistical modelling, and signal processing; - have excellent programming skills; - be willing to spend several months at the Technical University of Denmark. Prior exposure to courses in linguistics or speech- or hearing-related fields would be an advantage. Furthermore, you should comply with the rules set forward by the FP7 Marie Curie ITNs, i.e. you should: - not have resided of performed your main research activity in the Netherlands for more than 12 months in the last three years; - be willing to work in at least one other country in the INSPIRE network; - have less than 4 years of research experience since you obtained your Master’s degree, and not hold a PhD. What we have to offer We offer you: - employment: 1,0 fte; - in addition to the salary: an 8% holiday allowance and an 8.3% end-of-year bonus; - the starting salary is €2,042 per month on a full-time basis; the salary will increase to €2,492 per month in the third year; - in addition to the salary, you will receive travel and training allowances on the basis of generous Marie Curie ITN provisions; - duration of the contract: 18 months with the possibility of extension by another 18 months. Are you interested in our excellent employment conditions (http://www.ru.nl/newstaff/working_at_radboud/conditions_of/)? The Radboud University is an equal opportunity employer. Female researchers are strongly encouraged to apply for this vacancy. Would you like to know more? Further information on: Investigating Speech Processing In Realistic Environments (http://www.inspire-itn.eu/) Dr. Bert Cranen, assistant professor Speech science Telephone: +31 24 3612904 E-mail: B.Cranen@let.ru.nl Applications Are you interested? Please include with your application: - a CV; - a 2-page description of your research interests explaining why the INSPIRE goals appeal to you, how the INSPIRE team may benefit from your participation, and your career perspectives as expected from INSPIRE; - university transcripts; - names and email addresses of two potential referees (or alternatively letters of recommendation). It is Radboud University Nijmegen's policy to only accept applications by e-mail. Please send your application, stating vacancy number 23.02.12, to vacatures@let.ru.nl, for the attention of drs. M.J.M. van Nijnatten, before 7 July 2012. No commercial propositions ple
| ||
6-30 | (2012-06-11) Speech Recognition Scientist at Sunnyvale Job Description Title: Speech Recognition Scientist Location: Sunnyvale Status: Regular, Full-time, Exempt We are a well-funded start-up with cutting-edge speech recognition with a disruptive technology platform applicable to a variety of markets and environments for spoken dialog interaction. With an exciting mix of evolving speech-enabled products, we offer excellent opportunities for 'rock star' scientists to grow and share in our success. We offer competitive compensation, excellent benefits and an ultra-creative work environment. We are currently seeking a talented speech recognition scientist to join our hands-on team developing our platform spoken dialog interactions. The ideal candidate has a proven track record for optimizing speech recognition performance. This work includes creating the necessary scripts and tools to experiment with novel algorithms to optimize recognition and natural language understanding throughout all stages of a multi-modal dialog system. Among others, you will be asked to work on statistical language modeling, as well as language model and acoustic model adaptation.
Responsibilities Develop tools and enhance algorithms that facilitate deployment and tuning of spoken dialog systems Analyze speech recognition performance andimplement solutions to provide optimum accuracy Use, improve and create research tools to create, update and optimize language models and speech recognition systems for multiple domains Evaluate and develop different language modeling and rescoring based on machine learning algorithms Document language model development and adaptation process Work with the team to design future product releases
Required Skills & Experience Ph.D. or Master’s degree in computer science, electrical engineering, comp. linguistics, or equivalent Speech and/or language processing background (in research and/or industry) In-depth scripting experience with Python, Perl or similar Ability to own and drive experimental definition, investigations and ultimately be responsible for the speech recognition performance Passion for solving difficult problems Strong planning and communication skills Strong analytical and problem solving skills and ability to troubleshoot issues
Pluses: Background in natural language processing, machine learning and/or computational linguistics Programming experience in C/C++ Qualified candidates are encouraged to send your resume and cover letter to swright@fluentialinc.com. Fluential, LLC, is an Equal Opportunity Employer. To learn more, please visit us online at http://www.fluentialinc.com
| ||
6-31 | (2012-06-11) Voice Developer (m/f) Speech Technology Automotive, Nuance Turin (Italy) Nuance is a leading provider of speech and imaging solutions for businesses and consumers around the world. Our technologies, applications and services make the user experience more compelling by transforming the way people interact with information and how they create, share and use documents. Every day, millions of users and thousands of businesses, experience Nuance by calling directory assistance, getting account information, dictating patient records, telling a navigation system their destination, or digitally reproducing documents that can be shared and searched. Making each of those experiences productive and compelling is what Nuance is all about.
As a contributing member of Nuance, you will work within a dynamic team environment to develop, support, market and sell our award-winning software applications. We offer competitive compensation packages and a casual work environment. Join our dynamic, entrepreneurial team that operates worldwide (Europe, US, APAC). Be a part of our fast growing track of continuing success.
For more information, please see www.nuance.com. Nuance is an equal opportunity employer.
For our office in Turin / Italy we are currently looking for a fulltime
Voice Developer (m/f) Speech Technology Automotive
Key responsibilities.
Qualifications.
We offer. At Nuance Communications we believe our people are our most valuable asset. We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment. As a Nuance team member you will work within a dynamic international team operating worldwide. Does Nuance speak to you? If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7804/job.
Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services. Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.
ILONA ALEXANDRA HOLTZ Recruiter - Employment Specialist DACH Human Resources Nuance Communications Aachen GmbH Site Ulm Soeflingerstr. 100 D-89077 Ulm, Germany Fon +49 731 - 379 50 1166 Fax +49 731 - 379 50 1106 (Zentrale) Mobil +49 170 56 15 235
WWW.NUANCE.COMThe experience speaks for itself ™ Geschäftsführung/Director: Jan Anthierens Sitz der Gesellschaft/Registered Office: Aachen Registergericht/Court of Registration: Aachen Reg. Nr.: HRB 11872 USt-ID/VAT: DE 813191696 This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.
Experience Nuance in the web: http://www.youtube.com/watch?v=32QbXebhiag&list=UUtmZ1Vk2yFJkOe1DYQwLgag&index=1&feature=plcp Or
http://www.youtube.com/watch?v=RkiYr8aw5pE&feature=related
| ||
6-32 | (2012-06-11) Speech Output Designer (m/f) Speech Technology Automotive Nuance at Merelbeke Belgium For our office in Merelbeke / Belgium we are currently looking for a fulltime
Speech Output Designer (m/f) Speech Technology Automotive
Key responsibilities.
Qualifications.
We offer. At Nuance Communications we believe our people are our most valuable asset. We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment. As a Nuance team member you will work within a dynamic international team operating worldwide. Does Nuance speak to you? If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7801/job .
Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services. Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.
ILONA ALEXANDRA HOLTZ Recruiter - Employment Specialist DACH Human Resources Nuance Communications Aachen GmbH Site Ulm Soeflingerstr. 100 D-89077 Ulm, Germany Fon +49 731 - 379 50 1166 Fax +49 731 - 379 50 1106 (Zentrale) Mobil +49 170 56 15 235
WWW.NUANCE.COMThe experience speaks for itself ™ Geschäftsführung/Director: Jan Anthierens Sitz der Gesellschaft/Registered Office: Aachen Registergericht/Court of Registration: Aachen Reg. Nr.: HRB 11872 USt-ID/VAT: DE 813191696 This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.
| ||
6-33 | (2012-06-11) Speech Output Designer (m/f) Speech Technology Automotive at Nuance Turin Italy For our office in Turin / Italy we are currently looking for a fulltime
Speech Output Designer (m/f) Speech Technology Automotive
Key responsibilities.
Qualifications.
We offer. At Nuance Communications we believe our people are our most valuable asset. We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment. As a Nuance team member you will work within a dynamic international team operating worldwide. Does Nuance speak to you? If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7801/job .
Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services. Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.
ILONA ALEXANDRA HOLTZ Recruiter - Employment Specialist DACH Human Resources Nuance Communications Aachen GmbH Site Ulm Soeflingerstr. 100 D-89077 Ulm, Germany Fon +49 731 - 379 50 1166 Fax +49 731 - 379 50 1106 (Zentrale) Mobil +49 170 56 15 235
WWW.NUANCE.COMThe experience speaks for itself ™ Geschäftsführung/Director: Jan Anthierens Sitz der Gesellschaft/Registered Office: Aachen Registergericht/Court of Registration: Aachen Reg. Nr.: HRB 11872 USt-ID/VAT: DE 813191696 This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.
| ||
6-34 | (2012-06-11) Co Producer (m/f) Speech Technology Automotive at Nuance Turin Italy For our office in Turin / Italy we are currently looking for a fulltime
Co Producer (m/f) Speech Technology Automotive
Key responsibilities:
Qualifications:
We offer. At Nuance Communications we believe our people are our most valuable asset. We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment. As a Nuance team member you will work within a dynamic international team operating worldwide. Does Nuance speak to you? If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7799/job .
Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services. Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.
ILONA ALEXANDRA HOLTZ Recruiter - Employment Specialist DACH Human Resources Nuance Communications Aachen GmbH Site Ulm Soeflingerstr. 100 D-89077 Ulm, Germany Fon +49 731 - 379 50 1166 Fax +49 731 - 379 50 1106 (Zentrale) Mobil +49 170 56 15 235
WWW.NUANCE.COMThe experience speaks for itself ™ Geschäftsführung/Director: Jan Anthierens Sitz der Gesellschaft/Registered Office: Aachen Registergericht/Court of Registration: Aachen Reg. Nr.: HRB 11872 USt-ID/VAT: DE 813191696 This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.
| ||
6-35 | (2012-06-11) Tools Developer (m/f) Software Engineering C/C++ Speech Technology Automotive at Nuance Turin Italy For our office in Turin / Italy we are currently looking for a fulltime
Tools Developer (m/f) Software Engineering C/C++ Speech Technology Automotive
Key responsibilities:
Qualifications:
We offer. At Nuance Communications we believe our people are our most valuable asset. We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment. As a Nuance team member you will work within a dynamic international team operating worldwide. Does Nuance speak to you? If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7797/job .
Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services. Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.
ILONA ALEXANDRA HOLTZ Recruiter - Employment Specialist DACH Human Resources Nuance Communications Aachen GmbH Site Ulm Soeflingerstr. 100 D-89077 Ulm, Germany Fon +49 731 - 379 50 1166 Fax +49 731 - 379 50 1106 (Zentrale) Mobil +49 170 56 15 235
WWW.NUANCE.COMThe experience speaks for itself ™ Geschäftsführung/Director: Jan Anthierens Sitz der Gesellschaft/Registered Office: Aachen Registergericht/Court of Registration: Aachen Reg. Nr.: HRB 11872 USt-ID/VAT: DE 813191696 This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.
| ||
6-36 | (2012-06-11) Voice Manager (m/f) Speech Technology Automotive at Nuance Turin Italy Voice Manager (m/f) Speech Technology Automotive
Key responsibilities:
Qualifications:
We offer. At Nuance Communications we believe our people are our most valuable asset. We offer competitive compensation packages and we offer you career development opportunities in a challenging technical but casual work environment. As a Nuance team member you will work within a dynamic international team operating worldwide. Does Nuance speak to you? If you are interested in joining our team, please send your English CV including earliest starting date and salary expectations via our Recruiting tool https://jobs-nuance.icims.com/jobs/7795/job.
Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services. Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities.
ILONA ALEXANDRA HOLTZ Recruiter - Employment Specialist DACH Human Resources Nuance Communications Aachen GmbH Site Ulm Soeflingerstr. 100 D-89077 Ulm, Germany Fon +49 731 - 379 50 1166 Fax +49 731 - 379 50 1106 (Zentrale) Mobil +49 170 56 15 235
WWW.NUANCE.COMThe experience speaks for itself ™ Geschäftsführung/Director: Jan Anthierens Sitz der Gesellschaft/Registered Office: Aachen Registergericht/Court of Registration: Aachen Reg. Nr.: HRB 11872 USt-ID/VAT: DE 813191696 This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.
| ||
6-37 | (2012-06-15) INESC-ID Open Positions, Lisbon Partugal INESC-ID Open Positions The Institution Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa (INESC-ID) is one of the most dynamic research institutes in Portugal in the areas of communication and information technologies. The activity of INESC-ID is focused on the following area: Interactive Intelligent Systems (http://www.inesc-id.pt).
Job Description Successful candidates will be integrated in an existing research group at INESC-ID and will conduct research focusing on:
By addressing the creation of intelligent and affective relations with machines, together with intelligent visualization and virtual environments, the candidate will develop techniques that will advance the state of the art in building dialogue, multi-modal interaction and affective relations with machines.
The work will be carried out in one of the three groups: Spoken Language Systems, Intelligent Agents and Synthetic Characters or Visualization and Multi-modal Interactions.
The position holder is expected to develop scientific research preferably within these topics, and will be encouraged to start their own projects in coordination with colleagues. Excellent candidates in related areas are also strongly encouraged to apply.
Qualifications Required Applicants should hold a PhD, be fluent in English and show evidence of autonomous research abilities. The should be willing to work in a team and also have a strong publication record. The successful candidate should propose an innovative research project with relevance for the research area where she/he will be integrated.
Contract Conditions INESC-ID is an equal opportunity employer that implements the principle of equal treatment and training irrespective of religion or belief, disability, age or sexual orientation in employment. The contract corresponds to a salary of a Research Assitant Professor in Portugal. Successful applicants will be in post from August 2012.
Application Deadline July 15th 2012
Application details
INESC-ID invites eligible individuals to submit their expressions of interest, which must include an application letter, a detailed CV, a 1-page outline of a proposed research program, and is highly recommended to include reference letters. All documents should be sent by email to applications@inesc-id.pt and/or snail mail to: Direcção INESC-ID, R. Alves Redol, 9, 1000-029 Lisboa, Portugal.
| ||
6-38 | (2012-06-15) Language Processing Software Engineer at ONMOBILE Language Processing Software Engineer At ONMOBILE SA, an IT and Telecom VAS company, we hire at the earliest possible date an experienced natural language processing (NLP) software engineer for new research and development projects on automatic speech recognition, text processing, and multilingual question-answering systems.
We are looking for a software development engineer with NLP development or research background on either commercial or academic speech recognition systems. You will be familiar with and have practical experience in the following areas: State-of-the-art NLP technologies (robust parsing, finite-state transducers FSTs, statistical language modelling etc.) Semantic Web technologies like RDF, OWL, SPARQL Ability to develop with Eclipse RCP Strong programming skill with modern programming languages (C++, Java) and scripting languages Expertise in speech recognition, acoustic modelling, and audio/video processing is a plus You should have an engineer school or university degree in computer science or related disciplines. A PhD or an equivalent level of experience would be helpful.
An application-oriented perspective and a concern for customers with strong analytical and problem solving skills are required. You should be capable to work independently when needed.
Other skills: transparent behaviour, clarity of expression, ability to work in a multi-cultural team.
Languages: fluent in English, French
The position is based in Paris, France.
Contact by email to WenXuan TENG (
| ||
6-39 | (2012-06-15) Speech and Audio Processing Software Engineer at ONMOBILE Speech and Audio Processing Software Engineer At ONMOBILE SA, an IT and Telecom VAS company, we are hiring at the earliest possible date an experienced Speech and Audio Processing Software Engineer for research and development projects on automatic speech recognition, text processing, and multilingual question-answering systems.
We are looking for a software development engineer with speech recognition and audio processing development or research background on either commercial or academic speech recognition systems. You will be familiar with and have practical experience in the following areas: State-of-the-art speech recognition technologies (decoder, language models, acoustic models, signal processing) and their implementation within efficient recognition and training systems Audio and signal processing for de-noising, acoustic feature extraction, audio fingerprinting, karaoke scoring etc. Strong programming skill with modern programming languages (C++, Java) and scripting languages Expertise in NLP technologies is a plus You should have an engineer school or university degree in signal processing or computer science or related disciplines. A PhD or an equivalent level of experience would be helpful.
An application-oriented perspective and a concern for customers with strong analytical and problem solving skills are required. You should be capable to work independently when needed.
Other skills: transparent behaviour, clarity of expression, ability to work in a multi-cultural team.
Languages: fluent in English, French
The position is based in Paris, France. Contact by email to WenXuan TENG (
| ||
6-40 | (2012-06-17) 2-4 PhD positions in Speech Technology and Communication at KTH Stockholm Sweden 2-4 PhD positions in Speech Technology and Communication The goal of the positions is to contribute to the research foundation for speech technology in tomorrow's conversational systems. Anticipated specializations
The positions include free tuition and are salaried 4 year employments, presently starting at 2800 euro/month increasing to 3400 euro/month for the last year. To get information about how to apply for the positions go to http://www.speech.kth.se/vacancies/
| ||
6-41 | (2012-06-21) Mandarin TTS (Text To Speech) Manager / Expert / Research Engineer Positions at Nuance: 6 Location : Shanghai, China Nuance Communications is a listed US$1.5B global software company and the world leader in speech, text and imaging solutions for businesses and consumers around the world, with aggressive growth plans in the Asia Pacific region.
Job title: Mandarin TTS (Text To Speech) Manager / Expert / Research Engineer Positions: 6 Location : Shanghai, China
Job description and requirements Overview: Nuance's Mobility Division builds innovative, intelligent and intuitive touch and speech interfaces to simplify and enhance the way people interact with mobile devices, applications, and services. Nuance Mobile solutions make mobile devices and in-car systems easier to use, automate customer self-service, and optimize the access and discovery of even the most advanced mobile applications and content - regardless of technical know-how, location, environment, or physical and literacy capabilities. Responsibilities:
Reporting to TTS manager, the research scientist will conduct and lead innovative research and development on speech synthesis technologies for Asian languages. In the role as research scientist, your goal will be to continuously drive improvements and innovation to the Nuance Chinese TTS system, for commercial deployment in all types of markets and platforms. Representative tasks will include: •Design, implementation, evaluation, optimization and testing of new algorithms and tools for text-to-speech synthesis, for both signal generation and text processing/understanding. •Product integration supervision of proven innovation results •Defining the team's innovation and technical agenda, in cooperation with TTS management •Creation of demonstrators and evaluators of new technologies. Required skills: Digital speech processing, strong mathematics knowledge, excellent computer programming skills preferably in C, C++ and scripting languages, familiarity with different OS and computing platforms, excellent English and communication skills, strong team player, proven track record of achievements in Chinese TTS R&D, fluent Mandarin speaking Preferred skills: Hands-on experiences in one or more of these areas, TTS R&D, software engineering, natural language processing and understanding, project management, parametric and/or unit selection TTS development Education: PhD or Master degree in EE or CS from a well known university You can also find the job details via searching job 7812 and 7806, 7808~7811 at http://www.nuance.com/company/careers/index.htm
Contact information: Further questions and resumes can be sent to Lily He, Recruiter for Nuance Greater China at lily.he@nuance.com Thank you for your time and consideration, I look forward to hearing from you!
Regards, Lily He
Recruiter, GreaterChina NUANCE COMMUNICATIONS, INC.
| ||
6-42 | (2012-06-30) Thèse CIFRE à Orange Labs (Issy les Moulineaux, Paris). Thèse CIFRE à Orange Labs (Issy les Moulineaux, Paris).
| ||
6-43 | (2012-06-12) Senior research scientist at Pearson Pearson has one defining goal: to help people progress in their lives through learning. We champion innovation and we invest in models for education that deliver on our promise for effective, accessible, and personal learning from early literacy, college and career readiness to professional education, through data informed instruction and inventive applications for mobile and digital learning.
Responsible for advancing the state of the company's automated grading technology while working with the sales, marketing and test development teams to solve practical problems from the field. The position demands a person who has broad interests and is motivated to design and implement improvements to the company’s system. Opportunities for further research in applicable areas are available.
Responsibilities of this role:
Personality attributes/skills required: • Experience in designing and running ASR experiments using HTK a plus
Please apply online at www.pearsoned.com/careers
| ||
6-44 | (2012-06-12) Research scientist at Pearson Pearson has one defining goal: to help people progress in their lives through learning. We champion innovation and we invest in models for education that deliver on our promise for effective, accessible, and personal learning from early literacy, college and career readiness to professional education, through data informed instruction and inventive applications for mobile and digital learning.
Research Scientist:
Responsible for advancing the state of the company's automated grading technology while working with the sales, marketing and test development teams to solve practical problems from the field. The position demands a person who has broad interests and is motivated to design and implement improvements to the company’s system. Opportunities for further research in applicable areas are available. Qualifications:
Personality attributes/skills required:
Apply online at www.pearsoned.com/careers
| ||
6-45 | (2012-07-05) PhD at LIG (Grenoble-France) PhD proposal : Collaborative Annotation of multi-modal, multi-lingual and multimedia documents
| ||
6-46 | (2012-07-08) Faculty position in Phonetic Science and Speech Technology at Nanjing Normal University, China Faculty position in Phonetic Science and Speech Technology at Nanjing Normal University, China (Urgent job announcement) The Institute of Linguistic Science and Technology at Nanjing Normal University, China, invites applications for a faculty position in the area of Phonetic Science and Speech Technology. The position can be Lecturer, Associate Professor, or Professor, depending on the qualifications and experience of the applicant. Nanjing Normal University (NNU) is situated in Nanjing, a city in China not only famous for its great history and culture but also pride for excellence in education and academy. With Chinese-style buildings and garden-like environment, the Suiyuan Campus of NNU is often entitled the “ Most Beautiful Campus in the Orient.” Nanjing Normal University is among the top 5 universities of China in the area of Linguistics. Placing strong emphasis on interdisciplinary research, the Institute of Linguistic Science and Technology at NNU is unique in that it bridges the studies of theoretical and applied linguistics, phonetics, cognitive sciences, neural sciences, and information technologies. The phonetic laboratory is very well equipped, with sound-proof recording studio, professional audio facilities, physiological instruments (e.g., WAVE system, PowerLab, EGG, EPG, airflow and pressure module, and nasality sensor), EEG for ERP studies, eye tracker, etc. The laboratory just organized an international symposium TAL 2012 www.TAL2012.org very successfully at the end of May. We welcome interested colleagues to join us. The research can cover any areas in phonetic sciences and speech technologies, including but not limited to speech production, speech perception, prosodic modeling, speech synthesis, automatic speech recognition and understanding, spoken language acquisition, computer-aided language learning, and ERP study for spoken languages. Outstanding research support will be offered. Requirements: * A PhD degree (or an expected one) in related disciplines (e.g., linguistics, psychology, physics, applied mathematics, computer sciences, and electronic engineering); * Good publication/patent record in phonetic sciences or speech technologies; * Good oral and written communication skills in both Chinese and English; * Team work spirit in a multidisciplinary group. Interested candidates should submit a CV, a detailed list of publication, the copies of the best two or three publications, and the contact information of two references to: Prof. Wentao GU Email: wtgu@njnu.edu.cn; wentaogu@gmail.com Phone: (office) +86-25-8359-8624, (mobile) +86-189-3687-2840 The position will keep open until it is filled. An earlier application is strongly recommended
| ||
6-47 | (2012-07-26) Offre de thèse en correction orthographique par traduction statistique, Univ. Le Mans, FranceOffre de thèse financée au sein du laboratoire d'Informatique de l'Université du Maine (LIUM) dans le domaine de la correction orthographique automatique par méthodes de traduction statistique. Lieu : LIUM (Le Mans) Date : 1/10/2012 Durée : 3 ans Cette thèse s'inscrit dans le projet 'investissement d'avenir' PACTE, porté par l'entreprise Diadeis, et dont sont également partenaires l'équipe Alpage (INRIA et Paris 7), et les entreprises A2ia et Isako. PACTE a pour objectif l'amélioration de la qualité orthographique des textes issus de différentes méthodes de capture textuelle. L'accent est mis sur les sorties d'OCR (reconnaissance optique de caractères sur des textes imprimés scannés), mais concerne également des données obtenues par reconnaissance d'écriture manuscrite, par saisie manuelle, et par rédaction directe. Les techniques qui seront utilisées sont à la fois statistiques et hybrides, faisant usage d'outils et de ressources de linguistique computationnelle. Le domaine d'application principal du projet est celui de la numérisation du patrimoine écrit, dans un contexte multilingue. Une deuxième thèse démarrera à Alpage avec un accent sur l'utilisation des connaissances linguistiques pour aider à optimiser automatiquement ou quasi-automatiquement la qualité orthographique des textes. Dans le cadre du projet PACTE, une étroite collaboration aura lieu entre le LIUM, Alpage et la société Diadeis. Dans ce contexte, l'enjeu de la thèse au LIUM est d'analyser comment utiliser les techniques de traduction automatique statistique pour la correction d'erreur. En effet, on peut considérer la correction d'erreur comme un processus de passage d'une langue erronée vers une langue correcte. Une approche similaire a déjà été utilisée avec succès pour corriger les sorties des systèmes de traduction par règles, connue sous le nom 'statistical post-editing (SPE)'. Dans le cadre de cette thèse, il s'agit donc d'étudier comment une approche similaire peut être utilisée pour la correction orthographique. Un aspect important de cette thèse concerne le développement de modèles de langue efficaces, donnant de bons résultats avec une faible empreinte mémoire. Les modèles n-grammes à repli seront privilégiés, mais d'autres méthodes seront également explorées, notamment la modélisation dans l'espace continu (continuous space language models). Nous nous intéresserons aussi à l'intégration de connaissances morphosyntaxiques, en collaboration avec l'équipe Alpage. Les langues étudiées seront prioritairement le français et l'anglais, ainsi que l'allemand. Une application à l'espagnol, l'italien, voire d'autres langues européennes est possible. Profil recherché : - bonnes compétences en informatique (la maîtrise de Linux est indispensable, programmation en C++, utilisation de scripts, Perl, etc); - des connaissances en traduction automatique statistique sont souhaitées, ou, à défaut, en apprentissage automatique; - une expérience avec l'outil Moses est un plus. La thèse se déroulera au sein de l'équipe LST du LIUM. Le LIUM est connu au niveau international pour ses recherches dans le domaine de la traduction statistique, et nous avons de nombreuses collaborations avec des universités et entreprises en Europe et aux États-Unis. Contact : Holger Schwenk Holger.Schwenk@lium.univ-lemans.fr
| ||
6-48 | (2012-08-03) PhD Studentship in Speaker Diarization at EURECOM, Sophia Antipolis, Alpes Maritimes, France PhD Studentship in Speaker Diarization at EURECOM
|