ISCApad #150 |
Tuesday, December 07, 2010 by Chris Wellekens |
6-1 | (2010-07-07) Two positions at ELDA Two positions are currently available at ELDA.
Engineer in HLT Evaluation Department
He/she will be in charge of managing the evaluation activities in relation with the collection of Language Resources for evaluation, the evaluation of technology components, and in general, the setting up of an HLT evaluation infrastructure. As part of the HLT Evaluation Department, he/she will be working on European projects and will be involved in the evaluation of technology components related to information retrieval, information extraction, machine translation, etc.
ProgrammerELDA offers a position for its Language Resource Production and Evaluation activities working in the framework of European projects. The position is related to a number of NLP activities within ELDA, with a focus on the development of web-service architectures for the automatic production and distribution of language resources. The candidate may also be involved in the creation of LR repositories, NLP applications development and/or evaluation, etc. Profile :
Applicants should send (preferably via email) a cover letter addressing the points listed above together with a curriculum vitae to :
Khalid Choukri
ELRA / ELDA
55-57, rue Brillat-Savarin
75013 Paris
FRANCE
Fax : 01 43 13 33 30
Courriel : job@elda.org
| |||||||||
6-2 | (2010-07-07) Doctorat au LORIA Nancy France (fluency in french required)) Sujet de thèse
Motivations
Dans le cadre d'une collaboration avec une entreprise qui commercialise des morceaux de
documentai res vidéo (rushes), nous nous intéres sons à la reconnaissance automatique
des dialogues de ces rushes afin de pouvoir les indexer.
L'équipe parole a développé un système de transcription automa tique de bulletins
d'information : ANTS [2,3]. Si les performances des systèmes de transcription
automatique actuels sont satisfaisantes dans le cas de la parole lue ou
« préparée » (bulletins d'informations, discours), elles se dégradent fortement dans le cas
de la parole spontanée [1,4,5]. Par rappor t à la parole préparée, la parole spontanée se
caractérise par:
• des insertions (hésitations, pauses, faux dépar t s de mots, reprises),
• des variations de prononciations comme la contraction de mots ou de syllabes
(/monsieur / => /m' sieu / ),
• des variations de la vitesse d'élocution (réduction de l'articulation de certains
phonèmes et allongement s d'aut res phonèmes),
• des environnement s sonores difficiles (parole superposée, rires, bruits
d'ambiance...).
Ces spécificités sont peu ou pas prises en compte par les systèmes de reconnaissance
actuels. Tous ces phénomène s provoquent des erreur s de reconnais sance et peuvent
entraîner une indexation erronée.
Sujet
Le but du sujet de thèse est de prendre en compte un ou plusieurs des phénomènes
spécifiques décrits ci- dessus, afin d'améliorer le taux de reconnaissance [4,6,7]. Les
phénomène s seront choisis et traités au niveau acoustique ou linguistique en fonction du
profil du candidat. Le travail consistera à :
• comprendre l'architecture de ANTS,
• pour les phénomène s choisis, faire un état de l'art et proposer de nouveaux
algorithmes,
• réaliser un prototype de reconnaissance de parole spontanée et le valider sur un
corpus de parole spontanée étiqueté.
Cadre du travail
Le travail s'effectuera au sein de l'équipe Parole de l'Inria - Loria à Nancy
(http: / / p a role.loria.fr). L'étudiant utilisera le logiciel ANTS de reconnaissance
automatique de la parole développé dans l'équipe.
Profil souhaité
Les candidat s devront maîtriser le français et l'anglais et savoir programme r en C ou en
Java dans un environnement Unix. Des connaissances en modélisation stochas tique ou en
traitement automatique de la parole seront un plus.
Contacts : illina@loria.fr , fohr@loria.fr ou mella@loria.fr
[1] S. Galliano, E. Geoffrois, D.Mostefa , K. Choukri, JF. Bonastre and G. Gravier, The ESTER Phase II Evaluation
Campaign for Rich Transcription of French broadcas t news, EUROSPEECH 2005,
[2] I. Irina, D. Fohr, O. Mella and C.Cerisara, The Automatic News Transcription System: ANTS some realtime
experiment s, ISCPL2004
[3] D. Fohr, O. Mella, I. Irina and C. Cerisara, Experiment s on the accuracy of phone models and liaison
proces sing in a French broadcas t news transcription systems, ISCPL2004
[4] J.- L Gauvain, G. Adda, L. Lamel, L. F. Lefevre and H. Schwenk, Transcription de la parole conversationnelle
Revue TAL vol 45 n° 3
[5] M. Garnier - Rizet, G. Adda, F. Cailliau, J.- L. Gauvain, S. Guillemin- Lanne, L. Lamel, S. Vanni, C. Waaste -
Richard CallSurf: Automatic transcription, indexing and structuration of call center conversational speech for
knowledge extraction and query by content. LREC 2008
[6] J.Ogata, M.Goto, The use of acous tically detected filled and silent pauses in spontaneous speech
recognition ICASSP 2009
[7] F. Stouten, J. Duchateau, J.- P. Martens and P. Wambacq, Coping with disfluencies in spontaneous speech
recognition: Acoustic detection and linguistic context manipulation, Speech Communication vol 48, 2006
| |||||||||
6-3 | (2010-07-14 ) Ph D position at Loria Nancy (in french)
Sujet de these Motivations Dans le cadre d'une collaboration avec une entreprise qui commercialise des morceaux de documentai res vidéo (rushes), nous nous intéres sons à la reconnaissance automatique des dialogues de ces rushes afin de pouvoir les indexer. L'équipe parole a développé un système de transcription automa tique de bulletins d'information : ANTS [2,3]. Si les performances des systèmes de transcription automatique actuels sont satisfaisantes dans le cas de la parole lue ou « préparée » (bulletins d'informations, discours), elles se dégradent fortement dans le cas de la parole spontanée [1,4,5]. Le travail s'effectuera au sein de l'équipe Parole de l'Inria - Loria à Nancy (http: / / parole.loria.fr). L'étudiant utilisera le logiciel ANTS de reconnaissance automatique de la parole développé dans l'équipe. Profil souhaité Les candidat s devront maîtriser le français et l'anglais et savoir programme r en C ou en Java dans un environnement Unix. Des connaissances en modélisation stochas tique ou en traitement automatique de la parole seront un plus. Contacts [1] S. Galliano, E. Geoffrois, D.Mostefa , K. Choukri, JF. Bonastre and G. Gravier, The ESTER Phase II Evaluation Campaign for Rich Transcription of French broadcas t news, EUROSPEECH 2005, [2] I. Irina, D. Fohr, O. Mella and C.Cerisara, The Automatic News Transcription System: ANTS some realtime experiment s, ISCPL2004 [3] D. Fohr, O. Mella, I. Irina and C. Cerisara, Experiment s on the accuracy of phone models and liaison proces sing in a French broadcas t news transcription systems, ISCPL2004 [4] J.- L Gauvain, G. Adda, L. Lamel, L. F. Lefevre and H. Schwenk, Transcription de la parole conversationnelle Revue TAL vol 45 n° 3 [5] M. Garnier - Rizet, G. Adda, F. Cailliau, J.- L. Gauvain, S. Guillemin- Lanne, L. Lamel, S. Vanni, C. Waaste - Richard CallSurf: Automatic transcription, indexing and structuration of call center conversational speech for knowledge extraction and query by content. LREC 2008 [6] J.Ogata, M.Goto, The use of acous tically detected filled and silent pauses in spontaneous speech recognition ICASSP 2009 [7] F. Stouten, J. Duchateau, J.- P. Martens and P. Wambacq, Coping with disfluencies in spontaneous speech recognition: Acoustic detection and linguistic context manipulation, Speech Communication vol 48, 2006
| |||||||||
6-4 | (2010-07-20) Ph D at IDIAP Martigny SwitzerlandPhD POSITION in PERSON SEGMENTATION AND CLUSTERING IN AUDIO-VIDEO STREAMS, 36 MONTHS STARTING IN OCTOBER 2010, in IDIAP (MARTIGNY, SUISSE) AND LIUM (LE MANS, FRANCE), NET SALARY: 1700€ + INDEMNITY ------------------------------------------------------------------------------------ Research areas: Audio/video segmentation and clustering, speaker recognition, face recognition, pattern recognition, machine learning, audio and image processing. --- Description: The objective of the thesis is to investigate novel algorithms for the automatic segmentation and clustering of people in audio-visual documents. More precisely, the goal is to detect the people who appear in the documents, when they appear or/and when they speak, with whom they speak, and who they are. The work will rely on and improve previous knowledge of the LIUM and IDIAP in speaker diarization, names recognition from automatic speech transcripts, person detection, tracking and recognition, and will be expanded to address the audio-visual identity association and the recognition of the roles of people in the Tv shows. The work will be evaluated in the framework of the REPERE evaluation campaign, which is a challenge for audio and video person detection and recognition in TV broadcasts (journal debates, sitcoms) and will focus on segmentation and clustering targeting well-known people (anchors, journalists, known or introduced persons). --- Supervision and organization: The proposed position is funded by the ANR in the SODA project. It is a joint PhD position within both IDIAP and LIUM, under academic co-supervision by Profs. Paul Deléglise (LIUM), Jean-Marc Odobez (IDIAP) and Sylvain Meignier (LIUM). He will work closely with a post-doctoral fellow working for the same project. The candidate will be registered as a student at the University of Le Mans. He will share this time between Le Mans and Martigny depending on the need. The position will start in October 2010 and the net salary will be between €1700 a month. 18 months of indemnity (€500 per month) will be provided to support the extra cost of working at two different sites, as well as the higher cost of life in Martigny. --- Requirement: Applicants should hold a strong university degree entitling them to start a doctorate (Master’s degree or equivalent) in a relevant discipline (Computer Science, Human Language Technology, Machine Learning, etc). Applicants for this full-time 3 year PhD position should be fluent in English or in French. Competence in French is optional, though applicants will be encouraged to acquire this skill during training. Very strong software skills are required, especially in Java, C, C++, Unix/Linux, and at least one scripting language such as Perl or Python. --- Contact: Please send a curriculum vitae to Jean-Marc Odobez odobez@idiap.ch AND sylvain.meignier@lium.univ-lemans.fr
| |||||||||
6-5 | (2010-07-28) Ph D position in model based speech synthesis Post Doctoral Speech Synthesis Research Associate Position
The Communication Analysis and Design Laboratory at Northeastern University is pleased to announce the availability of a postdoctoral research associate position, funded by the National Science Foundation Division of Computer and Information Systems. This project aims to build a personalized speech synthesizer for individuals with severe speech impairments by mining their residual source characteristics and morphing these vocal qualities with filter properties of a healthy talker. An initial prototype has been designed and implemented in MATLAB. Further work is required to refine the voice morphing and speech synthesis algorithms, to develop a front-end user interface and to assess system usability. The successful candidate will work on an interdisciplinary team toward the project goals.
Required Skills: PhD in computer science or electrical engineering or related field Strong knowledge in machine learning and digital signal processing Extensive experience with MATLAB and C/C++ programming Experience with building graphical user interfaces Knowledge of, and experience with, concatenative and/or model-based speech synthesis This position is available immediately. Funding is available for up to two years on this project. Additional funding may be available for work on related projects. Interested candidates should email and/or send the following to Rupal Patel, Director, Communication Analysis and Design Laboratory, 360 Huntington Avenue, Boston, MA, 02115; r.patel@neu.edu; 617-373-5842: A cover letter stating your research interests and career goals, CV, two letters of recommendation, official transcripts of all postsecondary education.
| |||||||||
6-6 | (2010-08) Speech Synthesis Post Doctoral Research Associate Position Speech Synthesis Post Doctoral Research Associate Position
The Communication Analysis and Design Laboratory at Northeastern University is pleased to announce the availability of a postdoctoral research associate position, funded by the National Science Foundation Division of Computer and Information Systems. This project aims to build a personalized speech synthesizer for individuals with severe speech impairments by mining their residual source characteristics and morphing these vocal qualities with filter properties of a healthy talker. An initial prototype has been designed and implemented in MATLAB. Further work is required to refine the voice morphing and speech synthesis algorithms, to develop a front-end user interface and to assess system usability. The successful candidate will work on an interdisciplinary team toward the project goals.
Required Skills: PhD in computer science or electrical engineering or related field Strong knowledge in machine learning and digital signal processing Extensive experience with MATLAB and C/C++ programming Experience with building graphical user interfaces Knowledge of, and experience with, concatenative and/or model-based speech synthesis This position is available immediately. Funding is available for up to two years on this project. Additional funding may be available for work on related projects. Interested candidates should email and/or send the following to Rupal Patel, Director, Communication Analysis and Design Laboratory, 360 Huntington Avenue, Boston, MA, 02115; r.patel@neu.edu; 617-373-5842: A cover letter stating your research interests and career goals, CV, two letters of recommendation, official transcripts of all postsecondary education.
| |||||||||
6-7 | (2010-09-08) European project in Basque country
| |||||||||
6-8 | (2010-09-12) Ph D positions at KTH PhD Student Positions:
| |||||||||
6-9 | (2010-09-27) Two positions at ELDA Two positions are currently available at ELDA (reminder).
| |||||||||
6-10 | (2010-10-01) Ingenieur/Doctorat en Reconnaissance automatique de la parole des personnes âgées Reconnaissance automatique de la parole des personnes âgées pour les,services d’assistance aux personnes à domicile
| |||||||||
6-11 | (2010-10-) Research internship on automatic speech at HLT research unit at Fondazione Bruno Kessler (Trento, Italy ) The Human Language Technology research unit at Fondazione Bruno
| |||||||||
6-12 | (2011-07-11) JHU Summer Workshops CALL FOR TEAM RESEARCH PROPOSALSJHU Summer Workshops CALL FOR TEAM RESEARCH PROPOSALS (revised) Deadline: Tuesday, November 9, 2010. http://www.clsp.jhu.edu/workshops/ws11/CFP The Center for Language and Speech Processing at Johns Hopkins University invites one-page research proposals for a Summer Workshop on Language Engineering, to be held in Baltimore, MD, USA, July 11 to August 19, 2011. An interactive peer-review meeting will refine and select proposals to be funded for a six-week residential team exploration. Proposals should aim to advance the state of the art in any of the various fields of Human Language Technology (HLT). This year, proposals in related areas of Machine Intelligence that share techniques with HLT, such as Computer Vision (CV), are also strongly solicited. Proposals are welcome on any topic of interest to HLT, CV and technically related areas. For example, proposals may address novel topics or long-standing problems in one of the following areas. * SPEECH TECHNOLOGY: Proposals are welcomed that address any aspect of information extraction from speech signal (message, speaker identity, language,...). Of particular interest are proposals for techniques whose performance would be minimally degraded by input signal variations, or which require minimal amounts of training data. * NATURAL LANGUAGE PROCESSING: Proposals for knowledge discovery from text are encouraged, as are proposals in traditional fields such as parsing, machine translation, information extraction, sentiment analysis, summarization, and question answering. Proposals may aim to improve the accuracy or enrich the output of such systems, or extend their reach by improving their speed, scalability, and coverage of languages and genres. * VISUAL SCENE INTERPRETATION: New strategies are needed to parse visual scenes or generic (novel) objects, analyzing an image as a set of spatially related components. Such strategies may integrate global top-down knowledge of scene structure (e.g., generative models) with the kind of rich bottom-up, learned image features that have recently become popular for object detection. They will support both learning and efficient search for the best analysis. * UNSUPERVISED AND SEMI-SUPERVISED LEARNING: Novel techniques that do not require extensive quantities of human annotated data to address any of the challenges above could potentially make large strides in machine performance as well as lead to greater robustness to changes in input conditions. Semi-supervised and unsupervised learning techniques with applications to HLT and CV are therefore of considerable interest. Research topics selected for investigation by teams in past workshops may serve as good examples for your proposal (http://www.clsp.jhu.edu/workshops). An independent panel of experts will screen all received proposals for suitability. Results of this screening will be communicated no later than November 12, 2010. Authors passing this initial screening will be invited to Baltimore to present their ideas to a peer-review panel on December 3-5, 2010. It is expected that the proposals will be revised at this meeting to address any outstanding concerns or new ideas. Two or three research topics and the teams to tackle them will be selected for the 2011 workshop. We attempt to bring the best researchers to the workshop to collaboratively pursue the selected topics for six weeks. Authors of successful proposals typically become the team leaders. Each topic brings together a diverse team of researchers and students. The senior participants come from academia, industry and government. Graduate student participants familiar with the field are selected in accordance with their demonstrated performance. Undergraduate participants, selected through a national search, are rising seniors: new to the field and showing outstanding academic promise. If you are interested in participating in the 2011 Summer Workshop we ask that you submit a one-page research proposal for consideration, detailing the problem to be addressed. If your proposal passes the initial screening, we will invite you to join us for the December 3-5 meeting in Baltimore (as our guest) for further discussions aimed at consensus. If a topic in your area of interest is chosen as one of the two or three to be pursued next summer, we expect you to be available for participation in the six-week workshop. We are not asking for an ironclad commitment at this juncture, just a good faith understanding that if a project in your area of interest is chosen, you will actively pursue it. We in turn will make a good faith effort to accommodate any personal/logistical needs to make your six-week participation possible. Proposals should be submitted via e-mail to clsp@jhu.edu by 4PM EST on Tue, November 9, 2010.
| |||||||||
6-13 | (2010-11-11) Joint PhD Studentship in Speech-based Security over the Telephone (Avignon-Swansea) Fully-funded Joint PhD Studentship in Speech-based Security over the Telephone This Studentship is for a Joint Phd with the Speech & Image Research Group at Swansea University and Laboratoire Informatique d'Avignon at Université d’Avignon et des Pays de Vaucluse. This studentship is fully funded and worth up to £15,000 p.a. plus fees. Closing Date: Students are advised to apply as soon as possible. Who is eligible:
Further information: This studentship is directly linked to a new EU Eureka/FP7 programme the overall aim of which is to enhance security of transactions over the telephone using speech-based signal processing and new biometric approaches. Over the last 20 years both LIA and Swansea University have been at the forefront of international research in the area of speaker recognition How to Apply: Send your CV to: Dr John Mason j.s.d.mason@swansea.ac.uk or Dr. Jean-Francois BONASTRE jean-francois.bonastre@univ-avignon.fr
| |||||||||
6-14 | (2010-11-28) Post Doc at KUL Leuven Belgium Ph D Position or Post-Doc Position in Speech Recognition at K.U.Leuven
| |||||||||
6-15 | (2010-12-01) Opened positions/internships at Microsoft: French Linguists (M/F)
Opened positions/internships at Microsoft: French Linguists (M/F) MLDC – Microsoft Language Development Center, a branch of the Microsoft Product Group that develops Speech Recognition and Synthesis Technologies, situated in Porto Salvo, Portugal ( Be native or near native Have a university degree in Linguistics (with good computational skills), Computational Linguistics (Master’s or PhD) or related area Have an advanced level of English (oral and written) Have some experience in working with Speech Technology/Natural Language Processing/Linguistics, either in academia or in industry Have some computational ability – being able to run tools, being comfortable to work with Microsoft Office tools and having some programming fundamentals, though no programming is required Have team work skills Willing to work in Porto Salvo (near Lisbon) for the duration of the contract Willing to work in a multicultural and multinational team across the globe To apply, please submit your resume and a brief statement describing your experience and abilities to We will only consider electronic submissions. Deadline for submissions
| |||||||||
6-16 | (2020-12-06) These CIFRE a a EDF France (in french) Financement : thèse CIFRE en traitement automatique de la langue parlée Sujet : L'extraction d'information à partir de données acoustiques de parole lance aujourd'hui un défi important aux chercheurs du traitement automatique de la parole: aller au-delà de la transformation du signal sonore en une simple suite de mots (transcription automatique) en y intégrant des informations sémantiques. Ces informations sémantiques sont notamment utilisées pour indexer les données acoustiques en fonction des thèmes abordés, des sujets traités, voire même des opinions exprimées sur ces sujets. De nombreuses recherches portées par la communauté TAL (traitement automatique des langues) abordent également la problématique de l'extraction d'information à partir de données textuelles de plus en plus variées et de sources hétérogènes (blogs, forum de discussions, etc.). En particulier, l'utilisation de techniques issues de l'indexation de documents textuels, appliquées à la transcription automatique de parole constitue un thème de recherche en plein essor qui soulève de nombreux problèmes scientifiques. Ces problèmes sont liés au traitement de la variabilité dans la parole, aux erreurs (ou au bruit) de la transcription automatique ainsi qu'aux modes d'expressions spontanés différents de ceux de l'expression écrite. Le sujet de thèse porte sur l'extraction d'information dans des données conversationnelles EDF à partir des transcriptions automatiques. Ce travail est à la fois concerné par l'extraction d'information à partir du texte et par l'analyse et la gestion des aspects liés à l'expression orale via le traitement des erreurs de transcription automatique. Les données conversationnelles EDF (centres d'appel, enquêtes de satisfaction) représentent un corpus riche fournissant un cadre de travail particulièrement motivant en termes de recherche fondamentale et d'enjeux applicatifs. La palette thématique et expressive est à la fois large et circonscrite par le cadre applicatif. Les sources de variabilités (variantes de prononciation, phénomènes dits « disfluents » et structures dites «agrammaticales», accents (étrangers, régionaux etc., bruits environnants etc.) sont particulièrement présentes dans ces données. Le travail envisagé portera sur: 1. la modélisation sémantique de l'information à extraire. (i) Un premier volet portera sur l'indexation thématique en fonction des données exploitées et des besoins opérationnels (ex: repérer les conversations où les clients abordent un thème donné). (ii) Un deuxième volet de recherche visera à expliciter les raisons de satisfaction ou d'insatisfaction. Le travail effectué sera examiné dans le cadre des recherches sur la fouille d'opinions (opinion mining). 2. l'adaptation aux transcriptions. Il s'agira d'adapter les techniques d'extraction d'information aux spécificités de l'oral: disfluences, faible cohérence syntaxique, marqueurs pragmatiques (ex: marqueurs de discours, indices dialogiques). Des traitements spécifiques seront à envisager afin de minimiser l'impact du bruit de transcription automatique (erreurs de transcription). Une étape préalable consistera à faire une analyse systématique des erreurs de transcription automatique en fonction de l'information recherchée et du cadre dialogique (profils conseiller/client). Profil recherché : Bac+5 dans le domaine du TALN et/ou du traitement de la parole. Français et Anglais courant. Contexte : Cette thèse s'effectuera entre le LIMSI, groupe TLP (Traitement du Langage Parlé) et le département ICAME (Innovation Commerciale pour l'Analyse des Marchés et de leur Environnement) d'EDF R&D Candidature : Envoyer un CV, une lettre de motivation, et les coordonnées d'au moins un référent universitaire à l'ensemble des adresses suivantes : chloe.clavel@edf.fr , anne.peradotto@edf.fr madda@limsi.fr, ioana@limsi.fr, sophie.rosset@limsi.fr
|