ISCA - International Speech
Communication Association

ISCApad Archive » 2013 » ISCApad #178 » Jobs

ISCApad #178

Wednesday, April 10, 2013 by Chris Wellekens

6 Jobs

6-1

(2012-10-05) ASSOCIATE RESEARCH SCIENTIST POSITION at ETS Princeton, NJ, USA

ASSOCIATE RESEARCH SCIENTIST POSITION

Speech

Educational Testing Service

Headquartered in Princeton, NJ, ETS is the world’s premier educational measurement institution and a leader in

educational research. As a nonprofit corporation and an innovator in developing tests for clients in education,

government, and business we are dedicated to advancing educational excellence for the communities we serve.

ETSs Research & Development division has an opening for a researcher in the NLP & Speech Group. The Group

currently consists of about 15 Ph.D. level research scientists in areas related to NLP and speech. Its main focus is on

foundational research as well as on development of new capabilities to automatically score written and spoken test

responses in a wide range of ETS test programs including TOEFL(R)iBT and GRE(R).

PRIMARY RESPONSIBILITIES



Provide scientific and technical skills in conceptualizing, designing, obtaining support for, conducting,

managing, and disseminating results of research projects in the field of speech technology, or portions of largescale

research studies or programs in the same field.



Develop and/or modify speech theories to conceptualize and implement new capabilities in automated scoring

and speech-based analysis and evaluation systems which are used to improve assessments, learning tools and test

development practices.



Apply scientific, technical and software engineering skills in designing and conducting research studies and

capability development in support of educational products and services.



Develop and oversee the conduct of selected portions of research proposals and project budgets.



Design and conduct complex scientific studies functioning as an expert in major facets of the projects.



Assist in the conduct of research projects by accomplishing directed tasks according to schedule and within

budget.



Participate in dissemination activities through the publications of research papers, progress and technical reports,

the presentation of seminars or other appropriate communication vehicles.



Develop professional relationships as a representative, consultant or advisor to external advisory and policy

boards and councils, research organizations, educational institutions and educators.

REQUIREMENTS

A Ph.D. in Language Technologies, Natural Language Processing, Computer Science or Electrical Engineering, with

strong emphasis on speech technology and preferably some education in linguistics is required.

Evidence of substantive research experience and/or experience in developing and deploying speech capabilities is

required. Demonstrable contributions to new and/or modified theories of speech processing and their implementation in

automated systems. Demonstrable expertise in the application of speech recognition systems and fluency in at least one

major programming language (e.g., Java, Perl, C/C++, Python).

HOW TO APPLY

Please apply online at

www.ets.org/careers – position #124337. ETS offers competitive salaries, outstanding benefits, a

stimulating work environment, and attractive growth potential. ETS is an Equal Opportunity, Affirmative Action Employer.

Back

Top

6-2

(2012-10-05) Researcher in Speech Technology at Vicomtech-IK4, San Sebastian, Spain

Researcher in Speech Technology
Vicomtech-IK4, an international applied research centre in Visual Interaction and Communication Technologies located in San Sebastian (Spain) is looking for a Researcher in Speech Technology. We are looking for someone who combines experience in speech processing and software engineering, with research interests in multilingual Speech Recognition, Speech Synthesis, Voice Transformation and Conversion, and also motivated by the transfer of this knowledge into real world applications by building advanced research prototypes developed to solve real needs. The selected candidate will have an important role in the Human Speech and Language Technologies Department, including advanced research, project management responsibilities and technical leadership in high profile projects.
Requirements: - Masters/Ph.D. degree (or equivalent) in Speech Technology or related field - Experience in Speech Recognition, Speech Synthesis, Voice Transformation or Voice Conversion is desired. - Good written and spoken Spanish and English. Any other languages will be valued.
If you are: - An accomplished researcher with management abilities and interests. - A team player with an ambitious and creative personality.
We offer: - A multi-cultural research environment. - A multidisciplinary research team. - A group active in the international Human Speech and Language Technologies research field. - The opportunity to develop management as well as research skills.
To apply, please submit your CV and a cover letter describing your experience and interest in the position to:
Dr.-Ing. Jorge Posada (Associate Director) E-mail: jposada@vicomtech.org Telephone: +34 943 30 92 30 http://www.vicomtech.org
Deadline for submission: Open until filled

Back

Top

6-3

(2012-10-10) Dolby Research Beijing looking for world-class talent!

Dolby Research Beijing looking for world-class talent!

Be part of the exciting future of entertainment and add your talents to those of an amazing team. For more than 40 years, Dolby has led the way in developing innovative entertainment products and technologies used by consumers and professionals worldwide. Innovations from Dolby can be heard in consumer audio and video products, entertainment software, and professional sound applications, including music recording, broadcasting, and sound for motion pictures.

Our company philosophy encourages creativity, collaboration and a strong focus on creation, development and delivery of innovative technology solutions that enhance the entertainment experience. Our team-oriented research environment offers the opportunity for market-savvy innovators to apply their theoretical knowledge, awareness of technology trends and alertness to emerging market opportunities to help create technology solutions that are broadly applied in the marketplace through Dolby’s global market reach. We offer great benefits, including an assortment of life insurance and health coverage options, and the opportunity for innovators to make a difference and to experience the satisfaction of seeing technology solutions to which they contributed, in the marketplace!

Senior Research Engineer

(Audio/speech algorithm architecture and design)

Dolby Sound Technology Research, Beijing

Position Summary

This position is in the Research Organization of Dolby Laboratories (www.dolby.com) and is located inBeijing, China. The senior staff research engineer position focuses on the creation of audio signal processing technologies including the whole range of research from the underlying theoretical concepts to the development of prototypes that provide a proof of concept. As a part of an international team, the senior staff research engineer will work on ideas exploring new horizons in the audio processing, analysis, replay and organization. The researcher is responsible for performing fundamental new research, transfer technology to product groups, and draft patent applications. The position includes project leaderships for projects being part of Dolby’s global technology initiatives. This requires efficient interactions which different functional divisions within the company. The position also requires the mentoring for more junior staff.

Dolby Laboratories is looking for a self-motivated, highly talented individual interested in applying his or her skills in technologies involving a fundamental understanding of the way that sound sources (audio and voice) are captured, manipulated, coded, delivered, enhanced and played back using digital signal processing techniques. Applications include pre-processing, coding and post-processing solutions in market areas such as consumer electronics, mobile, broadcast, PC and digital cinema applications and other technologies crucial to Dolby Laboratories’ success. The position involves working in cooperation with other technology developers/researchers within Dolby’s global research network, and the opportunity to propose new ideas for further investigation.

Education, Skills, Abilities, and Experience Required

M.S.E.E. (required) plus 3 years of applicable, hands-on commercial experience (strongly preferred), or Ph.D. in Electrical Engineering (desired) plus 3 years of closely relevant academic post-doc Research and Development experience

Demonstrated ability to create fundamentally new, novel (patentable) signal processing technologies and to envision applications those technologies in the form of innovative product solutions

Strong innovator

Project leadership skills

Mentoring skills

Experienced in global project and collaboration work

Proficient in advanced theory and application of audio signal processing techniques

Highly skilled in C/C++ language and Matlab programming

Team-oriented work ethic and interest to work in cross-continental teams

Strong personal interest sound technologies and in learning, researching, and creating relevant new technologies with high commercial impact

Independent, self-motivated worker requiring minimal supervision

Fluent in Chinese and English. Excellent communication skills

Good understanding of general acoustics

Strongly Desired

Experience working in a software development team, including software version control tools

Real-time windows programming

Real-time audio processing

Willing to do occasional international travel

Personal interest in audio in entertainment applications

Please send your English and Chinese resume to cb@dolby.com

Back

Top

6-4

(2012-10-11) Faculty Position at the Center for Spoken Language Understanding, Portland, Oregon

    Job title: Assistant, Associate, or Full Professor
    Institution: Oregon Health & Science University, Portland, Oregon
    Department: Center for Spoken Language Understanding

The Institute on Development & Disability/Center for Spoken Language Understanding invites applications at all ranks for a faculty position in Natural Language Processing, to include technologies for analysis of speech, language, or both. Special interest in applications to behavioral manifestations of neurological disorders is essential.

The primary interest is to extend our existing program in developing behavioral technologies that allow early detection and remediation of a wide range of neurological disorders, in including Autism and Parkinson’s.

The Institute on Development & Disability/ Center for Spoken Language Understanding is at the forefront of this new, exciting area of research. The faculty member will be expected to teach courses supporting the research program and appropriate background areas such Machine Learning and Computational Linguistics. We seek a researcher with a well-developed program in Natural Language Processing, to collaborate with the CSLU team and with clinicians throughout OHSU. The appointee will be expected to maintain an independent, extramurally funded research program.

Requirements:

Ph.D.
Experience with Computational Linguistics, Machine Learning and Natural Language Processing.

Please contact: Jan van Santen, vansantj@ohsu.edu

Back

Top

6-5

(2012-10-11) Research Programmer at the College of Pharmacy at the University of Minnesota

Brief Description: The College of Pharmacy at the University of Minnesota is seeking a talented, pro-active and innovative individual for a Research Programmer position to work on several projects in Center for Clinical and Cognitive Neuropharmacology (C3N). C3N is engaged in conducting interdisciplinary research focused on cognitive effects of medications and neurodegenerative disorders such as Alzheimer's disease. Computerized assessment is used to measure these cognitive effects. The successful candidate for this position will be responsible for a variety of computer-related tasks including creating and maintaining innovative computer-based neuropsychological testing applications that involve spontaneous speech and language collection and analysis. The successful candidate will also be responsible for creating and maintaining databases used to store and organize experimental samples and web-enabled interfaces to the databases and data analysis tools. The successful candidate will also be expected to work with graduate and undergraduate students on specific programming and research projects to meet the needs of the Center.

Full Description is available here on the official University of Minnesota job posting site:

https://employment.umn.edu/applicants/jsp/shared/position/JobDetails_css.jsp?postingId=576008

Back

Top

6-6

(2012-10-12) Postdoctoral research position: Automatic identifcation of South African languages

Postdoctoral research position: Automatic identifcation of South African languages U.Stellenbosch South Africa

A postdoc position focussing on automatic language identification for the eleven official languages of South Africa is available in the Digital Signal Processing Group of the Department of Electrical and Electronic Engineering at the University of Stellenbosch. Specific project objectives include the developemt of a research system, the production of associated publishable outputs, and the development of a web-based demonstrator. The position is part of a bilateral project grant between the Netherlands and South Africa.

Applicants should hold a PhD in the field of Electronic/Electrical Engineering, Information Engineering, or Computer Science, or other relevant disciplines. Suitable candidates must have strong computer programming, analytical and mathematical skills, and be familiar with a Linux computing environment. Candidates must also be self-motivated and able to work independently. Finally, candidates must have excellent English writing skills and have an explicit interest in scientific research and publication.

The position will be available for one year, with a possible extension to a second year, depending on progress and available funds. The proposed starting date is not later than 15 January 2013.

Applications should include a covering letter, curriculum vitae, list of publications, research projects, conference participation and details of three contactable referees and should be sent as soon as possible to: Prof Thomas Niesler, Department of Electrical and Electronic Engineering, University of Stellenbosch, Private Bag X1, Matieland 7602. Applications can also be sent by email to: trn@sun.ac.za. The successful applicant will be subject to University policies and procedures.

Interested applicants are welcome to contact me at the above e-mail address for further information regarding the project.

Back

Top

6-7

(2012-10-18) Dolby’s technology group in Beijing looking for software engineers!

Dolby’s technology group in Beijing looking for software engineers!

Job Title: Embedded SW-Engineer (Audio)

Summary Description

This position is in the Engineering organization of Dolby Laboratories and located in Beijing, China. The main focus of this position is to implement Dolby’s audio technologies, including creating the reference code, porting to the embedded platforms such as ARM cores or TI DSPs. The position requires a deep knowledge in signal processing algorithms, fixed-pointed algorithms and optimization technical including the use of assembly language, as well as an excellent understanding of DSP architectures.

We are looking for a highly motivated individual for whom working with different tool chains under various operating systems in hardware close environments is fun and not a challenge.

The candidate will be part of the engineering team in Beijing and work closely together with other Dolby engineering entities in the US, Germany and Australia. We expect the candidate to build-up expert knowledge on highly efficient Dolby audio engines. Working in an international environment requires excellent verbal and written English communication skills.

Essential Job Functions:

Implement Dolby’s audio signal processing algorithms for both floating-point and fixed-point platforms.
Port and optimize audio signal processing algorithms to embedded fixed-point devices in a timely manner based on assigned portions of projects and existing architectures.
Write code, following best practices in embedded SW-engineering, leading to well documented, reliable and easy to maintain SW-components.
Validate and maintain correct behavior of SW-components via automated unit tests.
Serve as a team member with responsibility for maintaining embedded sub-components.
Work together with the development teams in the US, Germany and Australia to improve Dolby’s products.
Be a local resource on embedded Dolby audio technologies by combining an understanding of algorithmic behavior with a good knowledge of processor architectures.
Seek to increase knowledge by attending internal and external trainings and conferences.
Contribute ideas for new technologies, tools, or methodologies.
Share relevant information within the project team.
Provide technical assistance to non-engineering teams such as Research.
Promote a positive work environment.
Practice sensitivity in working with others.
Accept input from other team members.

Teamwork & Communications

Education, Skills, Abilities, and Experience:

M.S. in Electrical Engineering, Computer Science or comparable field is required.
Professional experience in porting and optimization of signal processing algorithms to embedded platforms is a strong plus.
C/C++ programming skills under Windows and Linux environments is required.
Good understanding of development/debugging on embedded simulators/hardware devices is required.
Excellent English spoken and written communication skills are required.
Ability to meet timelines is required.
Good understanding of at least one assembly language is a strong plus.
Knowledge of scripting languages such as Perl or Python is a plus.
Familiar with embedded real time operating systems is a plus.

Please send your CV to cv.engineerin.beijing@dolby.com

Back

Top

6-8

(2012-10-19) Post-doc at LABEX EPL Paris 3

Dans le cadre du LABEX EFL 'Empirical foundations of Linguistics' (http://www.labex-efl.org/), un projet d’une durée de 10 ans initié en 2011, nous proposons un projet post-doctoral de 6 mois à temps complet dans l’opération de recherche « Assessing phonetic and phonological complexity in motoric speech  disorders ».

Le candidat participera à une recherche sur la complexité phonétique et phonologique dans le contexte des troubles moteurs de la parole.

Il travaillera sous la responsabilité de Cécile Fougeron et en collaboration avec le Dc Lise Crevier-Buchman au Laboratoire de Phonétique et Phonologie, à Paris .

Le candidat sera titulaire d’un doctorat en phonétique ou en Speech and Hearing Sciences, avec une expérience en phonétique clinique. Une formation en phonétique acoustique et dans l’expérimentation avec des patients est requise. Une expérience en investigation physiologique avec l’ultrason sera un plus.

Les dossiers de candidatures contiendront les documents suivants et seront envoyés à martine.adda-decker_at_univ-paris3.fr avant le 21/11/2012 :

une lettre de motivation
un curriculum vitae à jour avec une liste des publications
l’adresse du site Internet où l’on peut accéder aux publications
les noms et adresses électroniques de deux personnes pouvant fournir des références

Pour plus d'informations, voir http://www.labex-efl.org/?q=fr/node/136

Contact: Cécile Fougeron (LPP-P3)

Adresse du responsable: cecile.fougeron@univ-paris3.fr

Université: Université Paris 3

Niveau: chercheur postdoctorant

Durée: 06 mois

Salaire: 12 000€ net / 6 mois

Spécialités: phonétique clinique, acoustique, échographie linguale, dysarthrie

Date limite de candidature: 21 novembre 2012

Adresse pour la candidature: martine.adda-decker_at_univ-paris3.fr

Référence de candidature: EFL-PPC5

Back

Top

6-9

(2012-10-20) Large-Scale Audio Indexing Researchers/Engineers: 2 W/M positions at IRCAM-Paris

Job Openings: Large-Scale Audio Indexing Researchers/Engineers: 2 W/M positions at IRCAM

Starting : January, 2013

Duration : 18 months

Position description A

The hired Researcher will be in charge of the research and the development of scalable technologies for supervised learning (i.e. scaling GMM, PCA or SVM algorithms) to be applicable to millions of annotated data.

He/she will then be in charge of the application of the developed technologies for the training of large-scale music genre and music mood models and their application to large-scale music catalogues.

Required profile for A:

High skill in audio indexing and data mining
Previous experience into scalable machine-learning models
High-skill in Matlab programming, skills in C/C++ programming
Skill in audio signal processing (spectral analysis, audio-feature extraction, parameter estimation)
Good knowledge of Linux, Windows, MacOS environments
High productivity, methodical works, excellent programming style.

Position description B

The hired Engineer/Researcher will be in charge of the development of the framework for scalable storage, management and access of distributed data (audio and meta-data). He/she will be also in charge of the development of scalable search algorithms.

Required profile for B:

High skill in database management systems
High skill in indexing technologies (hash-table, m-trees, …)
Good knowledge of Linux, Windows, MacOS environments
High productivity, methodical works, excellent programming style.

The hired Engineers/Researchers A and B will also collaborate with the development team and participate in the project activities (evaluation of technologies, meetings, specifications, reports).

Introduction to IRCAM

IRCAM is a leading non-profit organization associated to Centre Pompidou, dedicated to music production, R&D and education in sound and music technologies. It hosts composers, researchers and students from many countries cooperating in contemporary music production, scientific and applied research. The main topics addressed in its R&D department include acoustics, audio signal processing, computer music, interaction technologies, musicology. Ircam is located in the centre of Paris near the Centre Pompidou at 1, Place Igor Stravinsky 75004 Paris.

Salary

According to background and experience

Applications

Please send an application letter together with your resume and any suitable information addressing the above issues preferably by email to: peeters_a_t_ircam dot fr with cc to vinet_a_t_ircam dot fr, roebel_at_ircam_dot_fr

Back

Top

6-10

(2012-11-03) Research Scientist at Yandex Zurich

Title: Research Scientist

Opening in our Zurich (Switzerland) office. (Working language: English)

Yandex (company.yandex.com) is the leading search engine in Russia with an extensive set of added value services.

We are seeking an experienced and motivated person to join our team developing basic technology and applications.
The ideal candidate will be able to take responsibility for developing and implementing advanced modules resulting in improved speech recognition performance.

Responsibilities:
-Ability to independently develop new and improved algorithms for speech recognition
-Analyze speech recognition performance and implement solutions to provide optimum accuracy
-Use, improve and create research tools to create, update and optimize speech recognition systems for multiple domains
-Work with the team to design future products

Required:
-Higher degree in speech science, machine learning, or related field
-Experience developing ASR applications - training, tuning, and optimization
-Software development experience
-Programming experience in C/C++
-Excellent communication skills in English
-Willing to relocate to Switzerland

Desirable:
-Experience in advanced aspects of speech recognition (e.g. noise robust ASR, adaptation, discriminative training, decoding methods, etc.)
-Knowledge of scripting languages (especially python)
-Experience in commercial projects in the area of speech recognition and other speech technologies.
-Experience with applications for GPU is a plus
-Background in natural language processing, machine learning and/or computational linguistics is a plus

Interested candidates, please send application to:
barbara@yandex-team.ru

Back

Top

6-11

(2012-11-08) Postdoc position at IBM Language and Knowledge Center, Trento , Italy

The newly established IBM Language and Knowledge Center, Trento , Italy has a postdoc position in the following area:

-Natural Language Dialog

The postdoc scholar will be part of a Êresearch project aiming at designing machines that interact with humans and support them in complex and large scale knowledge and decision making tasks. The team includes researchers Êfrom IBM and the TrentoRise Human Language Technology Center founded by the University of Trento and FBK. Candidates with strong research background in at least one of the following:

- Conversational Dialogue Systems - Statistical models of Dialogue - Natural Language Understanding - Machine Learning - Question Answering Systems

are invited to apply.

The official postdoc position application site:

http://www.trentorise.eu/call-for-participation/bando-di-selezione-l-call-po sitions

If you would like to enquiry about the position send an email along with the CV addressed to

Prof. Giuseppe Riccardi sisl-jobs@disi.unitn.it Subject: Postdoc Position at IBM, Trento

Deadline: February 5, 2013

Back

Top

6-12

(2012-10-12) PhD position on The influence of robots on the development of language, New Zealand

PhD Position: The influence of robots on the development of language

Job Posting

Project description

The ‘Wordovators’ project is a three-year project funded by the John Templeton Foundation. The project will conduct large-scale experiments in the form of computerized word games. These games will be designed to probe the factors underpinning word creation and creativity, and how these develop through the life-span. One strand of the project will probe particular issues surrounding interactions between people and humanoid Robots. How are new words created and adopted in contexts involving such interactions? This PhD position is for a highly motivated student to join the project team, and conduct work that explores the ways that robots might shape human languages. These studies will analyze the factors and processes that might contribute to the influence of robots on the vocabularies of English and of artificial languages in imaginary worlds.

This project is a collaboration between University of Canterbury, New Zealand and Northwestern University, USA. The PhD candidate will enroll for a PhD degree in the HIT Lab NZ at University of Canterbury, and will be primarily supervised by Dr Christoph Bartneck. (the HIT Lab NZ). Other associated faculty are Professor Jen Hay (NZILBB), Janet Pierrehumbert (Northwestern University / Adjunct Professor NZILBB), and Professor Stephanie Stokes (NZILBB). The PhD student will be encouraged to regularly visit Northwestern University.

Your skills

You should have an interest in human language and have a strong background in robotics or computer science.

The HIT Lab NZ

The Human Interface Technology Laboratory New Zealand (HIT Lab NZ) is world leading research institutions developing and commercializing technology that improves human computer interaction. The HIT Lab NZ has over 50 staff and students and has extensive experience in Human Computer Interaction and Science & Technology Studies. The HIT Lab NZ is located at the University of Canterbury in Christchurch, New Zealand. The University of Canterbury has the top Engineering School in New Zealand, including a highly ranked Department of Computer Science. For more information about the HIT Lab NZ see http://www.hitlabnz.org/.

NZILBB

The HIT Lab NZ at the University of Canterbury is affiliated with the New Zealand Institute of Language, Brain and Behaviour (NZILBB). NZILBB is a multi-disciplinary centre dedicated to the study of human language. The researchers come from a wide range of disciplines, forging connections across linguistics, speech production and perception, language acquisition, language disorders, social cognition, memory, brain imaging, cognitive science, bilingual education, and interface technologies. More information is available at: http://www.nzilbb.canterbury.ac.nz/.

Christchurch

Christchurch is the second largest city in New Zealand and offers an exciting and easy lifestyle for students. It is the most affordable major city to live in. It is easy to get around whether you are biking, walking, driving or using the excellent public transport system. Christchurch also offers outstanding opportunities for outdoor activities, and is close to both surf beaches and ski-fields.

Appointment and Scholarship Support

The PhD scholarship is full time for a duration of three years with an annual scholarship of $25,000 NZD. The scholarship will also cover the tuition fees.

The research in this project must be concluded with writing a PhD thesis within the Human Interface Technology PhD program of the HIT Lab NZ. For more information about the PhD program in Human Interface Technology, please see http://www.hitlabnz.org/index.php/education/phd-program.

Further Information and Application

Further information can be obtained by contacting Christoph Bartneck (christoph.bartneck@canterbury.ac.nz). Information about the HIT Lab NZ is available at: http://www.hitlabnz.org. Please upload your application as one PDF file at http://www.hitlabnz.org/index.php/jobs/job/37/ Your application must include a letter explaining your specific interest in the project, an extensive curriculum vitae, your academic records, and a list of two references. Applications will be accepted until November 15th, 2012 or until position is filled.

International applicants will be required to arrange for their NZ student visa after an offer of a place. Please check http://www.immigration.govt.nz for information about what type of visa might be most suitable and the process of acquiring it. The university has various types of accommodation available on campus. Please check http://www.canterbury.ac.nz/accom/ for information about the options and prices. International students should also consult the International Student website at http://www.canterbury.ac.nz/international/ to learn about the cost of living, fees, and insurances.

Back

Top

6-13

(2012-11-15) Faculty positions at CSLP at the Johns Hopkins University in Baltimore, USA

The Center for Language and Speech Processing at the Johns Hopkins University in Baltimore, USA,
http://www.clsp.jhu.edu
seeks applicants for a tenure-track or tenured faculty member in speech and language processing. Rank will be dependent on the experience and accomplishments of the candidate.

Applicants must have a Ph.D. in a relevant discipline and will be expected to establish a strong, independent, multidisciplinary, internationally recognized research program. Commitment to quality teaching at the undergraduate and graduate levels is required. We are committed to building a diverse educational environment; women and minorities are especially encouraged to apply.

Prospective candidates should email rscully@jhu.edu for further information.

>>>>>>>>>>>>>>

Hynek Hermansky
Julian S. Smith Professor in Electrical Engineering
Director
Center for Language and Speech Processing
The Johns Hopkins University
3400 N. Charles Street, Hackerman Hall
Baltimore, Maryland 21218
410-516-6766
hynek@jhu.edu

Admin. support:
Ruth Scally
410-516-4237
rscally1@jhu.edu

Back

Top

6-14

(2012-11-26) INRIA-Internship for Master2 Students

INRIA-Internship for Master2 Students

Title:

Speech analysis for Parkinson's disease detection

Description:

Parkinson's disease (PD) is one of the most common neurodegenerative disorders and its clinical

diagnosis, particularly early one, is still a difficult task. Recent research has shown that the speech

signal may be useful for discriminating people with PD from healthy ones, based on clinical evidence

which suggests that the former typically exhibit some form of vocal disorder. In fact, vocal disorder

may be amongst the earliest PD symptoms, detectable up to five years prior to clinical diagnosis. The

range of symptoms present in speech includes reduced loudness, increased vocal tremor, and

breathiness (noise). Vocal impairment relevant to PD is described as dysphonia (inability to produce

normal vocal sounds) and dysarthria (difficulty in pronouncing words). The use of sustained vowels,

where the speaker is requested to sustain phonation for as long as possible, attempting to maintain

steady frequency and amplitude at a comfortable level, is commonplace in clinical practice. Research

has shown that the sustained vowel “aaaaa” is sufficient for many voice assessment applications,

including PD status prediction.

The first goal of this internship is to implement/improve some state-of-the-art algorithms for dysphonia

measures and use them within an appropriate classifier (like SVM) to discriminate between disordered

and healthy voices. These measures are based on linear and nonlinear speech analysis and are well

documented in [1]. The experiments will be carried on on the well established Kay Elemetrics

Disordered Voice Database (

http://www.kayelemetrics.com/).

The second goal is to try to develop new dysphonia measures based on novel nonlinear speech

analysis algorithms recently developed in the GeoStat team [2]. These algorithms have indeed shown

significant improvements w.r.t. state-of-the-art techniques in many applications including speech

segmentation, glottal inverse filtering and sparse modeling.

The work of this internship will be conducted in collaboration with Dr. Max Little (MediaLab of MIT

and Imperial College of London) and should lead to a PhD fellowship proposition.

References:

[1] A. Tsanas,

M.A. Little, P.E. McSharry, J. Spielman, L.O. Ramig. Novel speech signal processing

algorithms for high-accuracy classification of Parkinson’s disease. IEEE Transactions on Biomedical

Engineering,

59(5):1264-1271. 2012

[2] PhD thesis of Vahid Khanagha. INRIA Bordeaux-Sud Ouest. January 2013.

Prerequisites:

Good level in mathematics and signal/speech processing is necessary, as well as Matlab

and C/C++ programing. Knowledge in machine learning would be an advantage.

Supervisor:

Khalid Daoudi (khalid.daoudi@inria.fr), GeoStat team (http://geostat.bordeaux.inria.fr).

Location:

INRIA- Bordeaux Sud Ouest (http://www.inria.fr/bordeaux). Bordeaux, France.

Starting date:

Fev/Mars 2012.

Duration

: 6 months

Salary:

1200 euros / month

Back

Top

6-15

(2012-12-07) Doctoral and Post-doctoral Positions in Signal Processing for Hearing Instruments, Bochum, Germany

Doctoral and Post-doctoral Positions in Signal Processing for Hearing Instruments

Position Description

The ITN ICanHear, starting on 1 January 2013, will provide cutting-edge research projects for 12 doctoral and 5 post-doctoral research fellows in digital signal processing for hearing instruments. ICanHear aims to develop models based on emerging knowledge about higher-level processing within the auditory pathway and exploit that knowledge to develop creative solutions that will improve the performance of hearing instruments.

Attractive grants and a wide variety of international training activities, including collaborations with ICanHear Associated Partners in the U.K., Switzerland, Belgium, U.S.A., and Canada, will be made available to successful candidates, who will stay in the network for a period of 12 to 36 months.

Research and training positions will be available in the following ICanHear labs:

Institute of Communication Acoustics, Ruhr-Universität Bochum (DE)
Experimental Oto-rhino-laryngology (ExpORL), Katholieke Universiteit Leuven (BE)
Institute of Sound and Vibration Research, University of Southampton (UK)
Hearing Systems Group, Denmark Technical University, Lyngby (DK)
Laboratory for Experimental Audiology, University Hospital Zurich (CH)
Signal Processing Group, Siemens Hearing Instruments (DE)
Cochlear Research and Development, Cochlear Ltd., Mechelen (BE)

Requirements for Candidates and Application procedure:

Early-stage (doctoral) Research Fellows have less than four years experience in research after obtaining a Masters degree in engineering, computer science, or similar.

Experienced (post-doctoral) Researcher Fellows are already in possession of a doctoral degree or have at least 4 years but less than 5 years of research experience in engineering and/or hearing research.

In order to ensure transnational mobility candidates may have resided no more than 12 months (during the last 3 years) in the country of the host institution they wish to apply to. For all positions excellent English language skills are required.

To apply please send in the following documents via e-mail to the ICanHear coordination office (icanhear@rub.de): CV, certified copies of all relevant diplomas and transcripts, two letters of recommendation, proof of proficiency in English, letter of motivation (research interest, reasons for applying to programme and host). For further information on research projects available, application details and eligibility please visit the ICanHear web-site (http://www.icanhear-itn.eu) or contact the project coordinator Rainer Martin (rainer.martin@rub.de).

Back

Top

6-16

(2012-12-15) Technicien en instrumentation scientifique,expérimentation et mesure Aix-en-Provence France

CAMPAGNE NOEMI HIVER 2012-2013 PROFIL DE POSTE Description de l'Unité Code unité : UMR 7309 Nom de l’unité : Laboratoire Parole et Langage Directeur : Noël NGUYEN Ville : Aix-en-Provence Délégation régionale : DR12 Institut : INSHS Description du poste NUMERO NOEMI : T54030 CORPS : Technicien BAP : C Emploi-type : C4B21-Technicien en instrumentation scientifique, expérimentation et mesure Fonction Technicien de Plateforme Technicien de Plateforme Mission Au sein du Laboratoire Parole et Langage (LPL), affecté au Centre d’Expérimentation sur la Parole (CEP), l’agent sera chargé du soutien aux expériences en collaboration avec le coordinateur de la plateforme. Activités L’activité principale consiste à apporter un soutien quotidien au fonctionnement de la plateforme, il peut s’agir notamment : - D’assurer le prêt et le suivi du matériel utilisé sur la plateforme ou à l’extérieur, - D’effectuer le montage, l’assemblage de sous-ensembles (notamment audio et vidéo) pour la réalisation d’expériences, - D’assister les expérimentateurs lors de la passation d’expériences en appliquant un protocole défini, - D’effectuer des modifications ou adaptations de dispositifs expérimentaux, - D’assurer la maintenance et les interventions de premier niveau, la détection et le diagnostique de pannes,. - De réaliser des enregistrements (installation et enregistrement proprement-dit) audio et vidéo, - D’assurer la gestion des consommables nécessaires au déroulement des expériences, - D’utiliser des applications logicielles de contrôle d’instruments. Compétences Le (ou)la candidate devra faire preuve d’une grande motivation pour ce poste de soutien indispensable au fonctionnement du centre d’expérimentation sur la parole. Une formation de base en électronique et/ou en mesures physiques est souhaitée, pour mener à bien la réalisation éventuelle de modules élémentaires de synchronisation entre instruments, ou encore pour synchroniser les systèmes d’enregistrement audio et vidéo, pour effectuer le montage et l’assemblage de sous-ensembles pour la réalisation de dispositifs expérimentaux. Le ou la candidate devra apprécier le travail en équipe puisqu’il ou elle travaillera en lien étroit avec le coordinateur de la plateforme. La personne doit être capable d’apprendre de nouvelles techniques, et avoir goût pour lele sens du contact humain car elle sera en contact avec un grand nombre d’utilisateurs. Elle devra montrer une grande rigueur dans le respect des procédures mises en place. L’adhésion aux règles d’hygiène et sécurité en place ainsi est indispensable. Contexte Le Laboratoire Parole et Langage est une unité de recherche du CNRS et d’Aix Marseille Université. Il accueille des phonéticiens, linguistes, des informaticiens, des psychologues, des neuroscientifiques, des physiciens et des médecins. Les activités du LPL portent sur l’étude des mécanismes de production et de perception du langage et de la parole. Le LPL se distingue par ses méthodes de recherche reposant à la fois sur l’expérimentation, l’investigation instrumentale et la formalisation. Approche originale dans ce champ scientifique, qui émarge à la fois aux domaines des sciences humaines, des sciences du vivant et des sciences pour l’ingénieur. Cette particularité explique, au-delà d’une forte activité de recherche fondamentale, l’importance des applications développées à partir des travaux menés dans les domaines du traitement de l’écrit, de l’intelligibilité du message parlé, de la conversion texte-parole de qualité, ou encore de l’évaluation et de la rééducation des troubles de la voix ou du langage. Ces caractéristiques font du Laboratoire Parole et Langage une unité de recherche adaptée aux défis scientifiques des sciences du langage, tout en étant impliquée dans leurs enjeux technologiques. Le LPL regroupe actuellement plus de 80 personnes statutaires (chercheurs, enseignants-chercheurs, ingénieurs, techniciens, administratifs), auxquelles s’ajoutent 40 doctorants dont 20 boursiers. Il est le laboratoire français le plus important dans ce domaine scientifique et l’un des premiers en Europe. Le LPL dispose désormais d’une plateforme technique regroupant un ensemble d’instruments pour l’investigation de la production et la perception de la parole : électro-encéphalographie, tracking oculaire, articulographie, électro- palatographie, évaluation articulatoire, etc. Cette ressource unique en Europe est mutualisée au sein du Centre d’Expérimentation sur la Parole (http://www.lpl.univ-aix.fr/~cep/), plateforme technique à laquelle le poste sera affecté le poste.

Back

Top

6-17

(2012-12-16) Master project IRISA Rennes France

Computer Science Internship

CORDIAL group

Title :

Voice Conversion from non-parallel corpora

Description :

The main goal of a voice conversion system (VCS) is to transform the speech

signal uttered by speaker (the source speaker) so that it sounds like it was uttered by an other

person (the target speaker). The applications of such techniques are limitless. For example, a

VCS can be combined to a Text-To-Speech system in order to produce multiple high quality

synthetic voices. In the entertainment domain, a VCS can be used to dub an actor with its own

voice.

State of the art VCS use Gaussian Mixture Models (GMM) to capture the transformation

from the acoustic space of the source to the acoustic space of the target. Most of the models are

source-target joint models that are trained on paired source-target observations. Those paired

observations are often gathered from parallel corpora, that is speech signals resulting from the

two speakers uttering the same set of sentences. Parallel corpora are hard to come with. Moreo-

ver, they do not guaranty that the pairing of vectors is accurate. Indeed, the pairing process is

unsupervised and uses a Dynamic Time Warping under the strong (and unrealistic) hypothesis

that the two speakers truly uttered the same sentence, with the same speaking style. This asser-

tion is often wrong and results in non-discriminant models that tends to over-smooth speaker's

distinctive characteristics.

The goal of this Master subject is to suppress the use of parallel corpora in the process of

training joint GMM for voice conversion. We suggest to pair speech segments on high level speech

descriptors as those used in Unit Selection Text-To-Speech. Those descriptors not only contain

the segmental information (acoustic class for example) but also supra-segmental informations

such as phoneme context, speed, prosody, power, ... In a rst step, both source and target

corpora are segmented and tagged with descriptors. In a second step, each class from one corpus

is paired with the equivalent class from the other corpus. Finally, a classical DTW algorithm

can be applied on each paired class. The expected result is to derive transform models that both

could take into account speaker variability and be more robust to pairing errors.

Keywords :

Voice Conversion, Gaussian Mixture Models

Contacts :

Vincent Barreaud (vincent.barreaud@irisa.fr)

Bibliographie :

[1] H. Benisty and D. Malah. Voice conversion using gmm with enhanced global variance. In

Conference of the International Speech Communication Association (Interspeech)

, pages 669{

672, 2011.

[2] L. Mesbahi, V. Barreaud, and O. Boeard. Non-parallel hierarchical training for voice conver-

sion. In

Proceedings of the 16th European Signal Processing Conference, Lausanne, Switzerland, 2008.

[3] Y. Stylianou, O. Cappe, and E. Moulines. Continuous probabilistic transform for voice conver-

sion.

IEEE Transactions on Speech and Audio Processing, 6(2) :131-142, 1998.

Back

Top

6-18

(2012-12-16) Master project 2 IRISA Rennes France

Computer Science Internship

CORDIAL group

Title :

Unit-selection speech synthesis guided by a stochastic model of spectral and prosodic

parameters.

A Text-To-Speech system (TTS) produces a speech signal corresponding to the vocalization

of a given text. Such a system is composed of a linguistic processing stage followed by an acoustic

one which complies as much as possible with the linguistic directives. Concerning the second step,

the most used approaches are

{ the corpus based synthesis approach which lies on the selection and concatenation of unit

sequences extracted from a large continuous speech corpus. It has been popular for 20

years, yielding an unmatched sound quality but still bearing some artefacts due to spectral

discontinuities.

{ the statistical approach. The new generation of TTS systems has emerged in the last years,

reintroducing the rule based systems. The rules are no longer deterministic like in the

rst systems in the 1950's, but they are replaced by stochastic models. HTS, an HMMbased

speech synthesis system, is currently the most used statistical system. The HTS type

systems yield a good acoustic continuum but with a sound quality strongly depending on

the underlying acoustic model.

Recently, some hybrid synthesis systems have been proposed, combining the statistical approach

with the method of unit selection. It consists in using the acoustic descriptions and the

melodic contours generated by a statistical system in order to drive the cost function during the

natural speech unit selection phase, or also, substituting the poor quality natural speech units

by units derived from a statistical system.

The framework of this subject is the corpus based TTS. Considering the combinatorial problem

due to the search of an optimal unit sequence with a blind sequencing, the work consists

in determining heuristics to reduce the search space and satisfy a real time objective. These

assumptions, based on spectral and prosodic type parameters generated by HTS, will permit to

implement pre-selection lters or to propose new cost functions within the corpus based system

developped by the Cordial group. The production of the hybrid system will be evaluated and

compared via listening tests with standard systems like HTS and a corpus based system.

Keywords :

TTS, Corpus based speech synthesis, Statistical Learning, Experiments.

Contacts :

Olivier Boe
ard, Nelly Barbot, Damien Lolive (prenom.nom@irisa.fr)

Bibliography :

[1] A. W. Black and K. A. Lenzo,

Optimal data selection for unit selection synthesis, 4th ISCA

Tutorial and Research Workshop on Speech Synthesis, 2001.

[2] H. Kawai, T. Toda, J. Ni, M. Tsuzaki and K. Tokuda,

Ximera : a new tts from atr based on

corpus-based technologies

. ISCA Tutorial and Research Workshop on Speech Synthesis, 2004.

[3] S. Rouibia and O. Rosec,

Unit selection for speech synthesis based on a new acoustic target

cost

, Interspeech, 2005.

[4] H. Zen, K. Tokuda and A. W. Black,

Statistical parametric speech synthesis. Speech Communication,

v.51, n.11, pages 1039{1064, 2009.

[5] H. Silen, E. Helander, J. Nurminen, K. Koppinen and M. Gabbouj,

Using Robust Viterbi

Algorithm and HMM-Modeling in Unit Selection TTS to Replace Units of Poor Quality

Interspeech 2010.

Back

Top

6-19

(2012-12-16) Master project 3 IRISA Rennes France

Computer Science Internship

CORDIAL group

Title: Grapheme-to-phoneme conversion adaptation using conditional random elds

Description:

Grapheme-to-phoneme conversion consists in generating possible pronuncia-

tions for an isolated word or for a sequence of words. More formally, this conversion is a translit-

eration of a sequence of graphemes, i.e., letters, into a sequence of phonemes, symbolic units to

represent elementary sounds of a language. Grapheme-to-phoneme converters are used in speech

processing

either to help automatic speech recognition systems to decode words from a speech signal

or as a mean to explain speech synthesizers how a written input should be acoustically

produced.

A problem with such tools is that they are trained on large and varied amounts of aligned

sequences of graphemes and phonemes, leading to generic manners of pronouncing words in a

given language. As a consequence, they are not adequate as soon as one wants to recognize

or synthesize specic voices, for instance, accentuated speech, stressed speech, dictating voices

versus chatting voices,

etc. [1].

While multiple methods have been proposed for grapheme-to-phoneme conversion [2, 3], the

primary goal of this internship is to propose a method to adapt grapheme-to-phoneme models

which can easily be adapted under conditions specied by the user. More precisely, the use of

conditional random elds (CRF) will be studied to model the generic French pronunciation and

variants of it [4]. CRFs are state-of-the-art statistical tools widely used for labelling problems

in natural language processing [5]. A further important goal is to be able to automatically

characterize pronunciation distinctive features of a given specic voice as compared to a generic

voice. This means highlighting and generalizing di
erences that can be observed between two

sequences of phonemes derived from a same sequence of graphemes.

Results of this internship would be integrated into the speech synthesis platform of the team

in order to easily and automatically simulate and imitate specic voices.

Technical skills:

C/C++ and a scripting language (e.g., Perl or Python)

Keywords:

Natural language processing, speech processing, machine learning, statistical learn-

ing

Contact:

Gwenole Lecorve (gwenole.lecorve@irisa.fr)

References:

[1] B. Hutchinson and J. Droppo. Learning non-parametric models of pronunciation. In

Pro-

ceedings of ICASSP

, 2011.

[2] M. Bisani and H. Ney. Joint-sequence models for grapheme-to-phoneme conversion. In

Speech

Communication

, 2008.

[3] S. Hahn, P. Lehnen, and Ney H. Powerful extensions to crfs for grapheme to phoneme

conversion. In

Proceedings of ICASSP, 2011.

[4] Irina Illina, Dominique Fohr, and Denis Jouvet. Multiple pronunciation generation using

grapheme-to-phoneme conversion based on conditional random elds. In

Proceedings of

SPECOM

, 2011.

[5] John D. La
erty, Andrew McCallum, and Fernando C. N. Pereira. Conditional random

elds: probabilistic models for segmenting and labeling sequence data. In

Proceedings of

ICML

, 2001.

Back

Top

6-20

(2013-01-14) Ph.D. Researcher in Speech Synthesis, Trinity College, Dublin, Ireland

Post Specification

Post Title:

Ph.D. Researcher in Speech Synthesis

Post Status:

3 years

Department/Faculty:

Centre for Language and Communication Studies (CLCS)

Location:

Phonetics and Speech Laboratory

Salary:

€16,000 per annum (plus fees paid)

Closing Date:

31st January 2013

Post Summary

A Ph.D. Researcher is required to work in the area of speech synthesis at the Phonetics and

Speech Laboratory, School of Linguistic, Speech and Communication Sciences. The position

will involve carrying out research on the topic of Hidden Markov Model (HMM)-based speech

synthesis. Specifically, we are looking for a researcher to work on developing a source-filter

based acoustic modelling for HMM-based speech synthesis which is closely related to the

human speech production process and which can facilitate modification of voice source and

vocal tract filter components at synthesis time.

Background to the Post

Much of the research carried out to date in the Phonetics and Speech Laboratory has been

concerned with the role of the voice source in speech. This research involves the development of accurate voice source processing both as a window on human speech production and for exploitation in voice-sensitive technology, particularly synthesis. The laboratory team is interdisciplinary and includes engineers, linguists, phoneticians and technologists.

This post will the main be funded by the on-going Abair project which has developed the first

speech synthesisers for Irish (www..abair.ie), and the researcher will exploit the current Abair

synthesis platform. In this project the aim is to deliver multi-dialect synthesis with multiple

personages and voices that can be made appropriate to different contexts of use. The post will also be linked to the FastNet project which aims at voice-sensitive speech technologies.

A specific goal of our laboratory team is to leverage our expertise on the voice by improving the naturalness of parametric speech synthesis, as well as making more flexible synthesis platforms which can allow modifications of voice characteristics (e.g., for creating different personalities/characters, different forms of expression etc).

Standard duties of the Post

Initially the researcher will be required to attend some lectures as part of the Masters

programme on Speech and Language Processing. This and a supervised reading

programme will provide a background in the area of voice production, analysis and

synthesis.

* In the very early stages the researcher will be required to develop synthetic voices, using

the Irish corpora, with the standard HMM-based synthesis platform (i.e. HTS). Note that

to work with the Irish corpora does not require a background in the Irish language, as

there will be collaboration with experts in this field.

* The researcher will be required to familiarise themselves with existing speech synthesis

platforms which provide explicit modelling of the voice source (e.g., Cabral et al. 2011,

Raitio et al. 2011, Anumanchipalli et al. 2010).

* The researcher will then need to first implement similar versions of these systems and

then work towards developing novel vocoding methods which would allow full parametric

flexibility of both voice source and vocal tract filter components at synthesis time.

Person Specification

Qualifications

* Bachelors degree in Electrical Engineering, Computer Science with specialisation in

speech signal processing, or related areas.

* Knowledge & Experience (Essential & Desirable)

* Strong digital signal processing skills (Essential)

*Good knowledge of HTS including previous experience developing synthetic voices

(Essential)

* Knowledge of speech production and perception (Desirable)

* Experience in speech recognition (Desirable)

Skills & Competencies

* Good knowledge of written and spoken English.

Benefits

* Opportunity to work with a world-class inter-disciplinary speech research group.

To apply, please email a brief cover letter and CV, including the names and addresses of two

academic referees, to: kanejo@tcd.ie and to cegobl@tcd.ie

Back

Top

6-21

(2013-01-12) Stage de Master à IRIT Toulouse F

Nous sommes à la recherche d'une personne pour un stage de Master 2 recherche qui sera suivi d'un financement CIFRE de 3 ans avec AIRBUS. Merci de me contacter en m'envoyant un CV et une lettre de motivation le plus rapidement possible.

Contact
~~~~~~
Jérôme Farinas
Equipe SAMOVA
Institut de Recherche en Informatique de Toulouse
Tél : 05 61 55 74 34
Mèl : jfarinas@irit.fr

Mots clés
~~~~~~~
Parole spontanée, son, bruit, audio, reconnaissance, transcription, apprentissage.

Contexte de l’étude
~~~~~~~~~~~~~~~
Lors d’un vol, tous les paramètres sont enregistrés dans deux calculateurs distincts, le DFDR (Digital Flight Data Recorder) et le CVR (Cockpit Voice Recorder). Le DFDR enregistre les paramètres techniques du vol. Le CVR enregistre toutes les conversations entre l’équipage, le personnel de cabine, les centres au sol (navigation aérienne, compagnie…). Sont également enregistrés toutes les alarmes qui peuvent survenir à bord ainsi que tous les bruits qui peuvent être entendus dans le poste de pilotage.

Intérêt de l’étude
~~~~~~~~~~~~~
Au sein du département avionique, le contenu du CVR, suite à un vol d’essais et plus particulièrement ceux liés à une certification, est analysé afin de corréler tous les évènements tracés par les pilotes et ingénieurs navigants avec le contenu de l’enregistrement. Cette analyse a aussi pour but d’identifier et de caractériser tous les évènements sonores imprévus. L’analyse et la transcription sont faites par une simple écoute.
Afin d’améliorer la fiabilité, la pertinence, l’exhaustivité et la répétitivité des analyses effectuées, l’intérêt de l’étude est de proposer des algorithmes capables d’extraire du bruit ambiant du poste de pilotage, de la voix, des sons synthétiques et des bruits caractéristiques afin de les transcrire automatiquement.

Principaux objectifs de l’étude
~~~~~~~~~~~~~~~~~~~~~~~~
Ce stage adresse à la fois des problématiques liées à la reconnaissance de sons prédéfinis (existence d’une référence audio), la détection de bruits spécifiques et la transcription de la parole spontanée.
Ce sujet de stage est lié à une thèse qui sera financée avec un financement CIFRE avec AIRBUS. La thèse est divisée en 3 parties correspondant à l’étude de chacune des trois catégories de sons explorées ; les approches seront de fait différentes :
1. Pour la reconnaissance des sons (alarmes, code morse…), il est possible de définir des prototypes ou références. C’est pourquoi l’étude s’orientera vers l’emploi d’une part d’algorithmes de détection de fréquences caractéristiques en prenant en compte les contraintes d’un environnement hétérogène (bruit, recouvrement…) et d’autre part de méthodes de reconnaissance des formes classiques appliqués à l’audio tels que ceux employés en reconnaissance de motifs sonores [13].
2. La détection des bruits représentatifs (régime moteur, train d’atterrissage…) ou inattendus (usure anormale, parasite…), passera par une analyse des signatures
acoustiques caractéristiques pour en déduire une modélisation. Cette détection nécessitera une phase d’apprentissage.
Pour ces deux premières parties, le titulaire s’appuiera sur les résultats d’un stage de fin de cycle d’ingénieur pour lequel un prototype a déjà été développé. Il s’agira
d’enrichir la démarche scientifique, puis de compléter et de confirmer les choix techniques qui ont été proposés.
3. La reconnaissance de la parole qui est de loin la plus importante, s’orientera d’abord vers une étude de faisabilité. Les deux principales difficultés sont liées à la production même de la parole qui est quasi « spontanée » et à l’environnement dans lequel la prise de son est faite.
Dans un premier temps, le titulaire devra effectuer une analyse très précise de l’environnement, bien plus contraignant que les environnements « conventionnels »
dans lesquels sont développés la majorité des systèmes de reconnaissance de la parole (parole téléphonique, journaux d'information en anglais [1], français [2],
sessions du parlement européen [3]). Il existe des études sur l'influence de la dégradation due aux environnements bruités : dans le secteur de la construction [4],
dans le secteur aéronautique [5]. Les travaux dans ce dernier secteur ont principalement pour objectif depuis les années 1980 de réaliser une simple
commande vocale dans les cockpits. L'analyse de la parole à partir d'enregistrements CVR n’a jamais fait l'objet d'étude, seules des recherches sur les sons d'alerte ont été menées [6,7].
Dans un second temps et une fois l’analyse effectuée, le titulaire s’attachera à définir les spécifications du système de reconnaissance automatique, tout en respectant les contraintes suivantes :
- Contraintes liées à un environnement fortement bruité et altéré par des événements sonores liés à l’exploitation de l’avion (recouvrement).
- Contraintes liées à une population multiculturelle : langue (capacité à changer de langue pour un même locuteur), accent, vocabulaire.
- Contraintes liées aux conditions dans lesquelles le locuteur opère : augmentation du débit, stress, fatigue.
Les principales pistes de recherche envisagées se déclinent selon deux axes :
- La compensation au niveau de la paramétrisation : une première analyse des différents bruits des enregistrements CVR permettra de cibler les traitements pour lutter contre le bruit existant (soustraction cepstrale, normalisation de la variance, filtrage ARMA, filtres RASTA...). Un deuxième axe consiste à s’appuyer sur des résultats récents en compensation de bruit dans le domaine cepstral, domaine de paramétrisation le plus performant en reconnaissance de parole. Il s’agit de décomposer l’espace de représentation en une composante utile pour la reconnaissance et une composante dite de nuisance qui rend compte de la variabilité de la session d’enregistrement [14]. Dans le cas présent et le cadre particulier des cockpits, cette variabilité représenterait l’environnement bruité, mais elle pourrait aussi rendre compte des conditions de stress du locuteur.
- L’adaptation au niveau des modèles acoustiques et linguistiques : les modélisations de la parole devront être adaptées aux différentes formes de parole présentes dans les enregistrements. Il s’agira de regarder plus précisément les vocabulaires employés pour les enrichir éventuellement, et tenir compte de la spontanéité au travers des modèles de langage (le traitement de la spontanéité reste à ce jour un défi très important à relever). Les techniques d'adaptation des lois des modèles acoustiques par Maximum Likelihood Linear Regression (MLLR) [8] et Maximum A Posteriori (MAP) [9] et les techniques d’adaptation au niveau de la modélisation elle-même (analyse factorielle appliquée aux modèles de Markov cachés [10,11]) seront les points de départs. Les adaptations des lexiques et des modèles de langage devront être étendues à ce type de dialogues [12].
Durant le stage, les parties 1 et 2 devront donner lieu à un prototype utilisable. La partie 3 pourra être concrétisé par la mise en place d’un système de reconnaissance de la parole de base.

Back

Top

6-22

(2013-01-20) Ph.D. Researcher in Speech Synthesis, Trinity College, Dublin, Ireland

Ph.D. Researcher in Speech Synthesis, Trinity College, Dublin, Ireland

Post Specification

Post Title:

Ph.D. Researcher in Speech Synthesis

Post Status:

3 years

Department/Faculty:

Centre for Language and Communication Studies (CLCS)

Location:

Phonetics and Speech Laboratory

Salary:

€16,000 per annum (plus fees paid)

Closing Date:

31st January 2013

Post Summary

A Ph.D. Researcher is required to work in the area of speech synthesis at the Phonetics and

Speech Laboratory, School of Linguistic, Speech and Communication Sciences. The position

will involve carrying out research on the topic of Hidden Markov Model (HMM)-based speech

synthesis. Specifically, we are looking for a researcher to work on developing a source-filter

based acoustic modelling for HMM-based speech synthesis which is closely related to the

human speech production process and which can facilitate modification of voice source and

vocal tract filter components at synthesis time.

Background to the Post

Much of the research carried out to date in the Phonetics and Speech Laboratory has been

concerned with the role of the voice source in speech. This research involves the development

of accurate voice source processing both as a window on human speech production and for

exploitation in voice-sensitive technology, particularly synthesis. The laboratory team is interdisciplinary

and includes engineers, linguists, phoneticians and technologists.

This post will the main be funded by the on-going Abair project which has developed the first

speech synthesisers for Irish (

www.abair.ie), and the researcher will exploit the current Abair

synthesis platform. In this project the aim is to deliver multi-dialect synthesis with multiple

personages and voices that can be made appropriate to different contexts of use. The post will

also be linked to the FastNet project which aims at voice-sensitive speech technologies. A

specific goal of our laboratory team is to leverage our expertise on the voice by improving the

naturalness of parametric speech synthesis, as well as making more flexible synthesis platforms

which can allow modifications of voice characteristics (e.g., for creating different

personalities/characters, different forms of expression etc).

Standard duties of the Post

Initially the researcher will be required to attend some lectures as part of the Masters

programme on Speech and Language Processing. This and a supervised reading

programme will provide a background in the area of voice production, analysis and

synthesis.

Back

Top

6-23

(2013-01-23) Tenure-track and Research Assistant Professor positions at the Toyota Technological Institute, Chicago

Tenure-track and Research Assistant Professor positions
Toyota Technological Institute at Chicago (http://www.ttic.edu) is a philanthropically endowed academic computer science institute, dedicated to basic research and graduate education in computer science, located on the University of Chicago campus.
TTIC opened for operation in 2003. It currently has 9 tenure-track/tenured faculty, 10 research faculty, and a number of adjoint/visiting faculty, and is growing. Regular faculty have a teaching load of one course per year and research faculty have no teaching responsibilities. Research faculty positions are endowed positions (not based on grant funding) and are for a term of 3 years.
Applications are welcome in all areas of computer science, including speech and language processing, for both tenure-track and research faculty positions.
Applications can be submitted online at http://www.ttic.edu/faculty-hiring.php . Additional questions can be directed to Karen Livescu at klivescu@ttic.edu

Back

Top

6-24

(2013-01-25) Postdoc at Orange Labs Lannion

Densité de présence des personnes dans les émissions télévisuelles

Le sujet porte sur la détection automatique de la présence des personnes dans les émissions télévisuelles, et introduit une notion qualitative et quantitative de cette présence, résumée sous la dénomination « densité de présence ».

En effet, les personnes présentes dans une émission télévisuelle n’occupent pas toutes une place équivalente, durant l’émission.

Tout d’abord, il convient de différencier la notion de présence vs la citation : par exemple, on peut distinguer plusieurs niveaux de présence :

-présence physique (ou en duplex) de la personne dans l’émission : la personne parle dans l’émission (interview etc)

-présence par extrait : l’émission montre des extraits de documents audiovisuels ou la personne parle.

-citation visuelle: la personne ne parle pas , mais est montrée (images de reportages, extrait)

-citation : on parle de cette personne, qui n’est pas présente dans l’émission

Ensuite, il convient de différencier « l’intensité » de cette présence, selon le rôle qu’occupe cette personne dans l’émission : sujet principal, témoin. Cette notion d’intensité est orthogonale au type de présence : une personne peut être présente par citation uniquement, et sujet central. C’est la combinaison des niveaux de présence et de leur intensité (rôle) qui définit ce que nous proposons de nommer « densité » de présence.

Enfin, ces notions ne sont pas nécessairement constantes tout le long de l’émission, et il convient de déterminer automatiquement les segments durant lesquelles la densité de présence de la personne est constante. En pratique, cela permet par exemple d’extraire d’une émission uniquement le segment où telle personne est le sujet principal.

Le travail de ce post-doc consistera dans un premier temps à affiner ces notions de présence/intensité, pour formaliser le problème de classification/segmentation automatique associé. Il s’agira ensuite d’annoter les corpus d’émissions télévisuelles disponibles en fonction des classes de présence, puis de concevoir, développer, et tester les algorithmes permettant de répondre à ce problème.

Le postdoc est d’une durée de 12 mois, non-renouvelable, à Orange Labs Lannion, rémunéré 34k€ brut annuel (soit environ 2150€ net/mois).

Il doit être le premier contrat de travail après la soutenance de la thèse.

Pour tout renseignement : delphine.charlet@orange.com

Back

Top

6-25

(2013-01-30) 15 RESEARCH POSITIONS (PhD, Post-Doc and Research Programmer), Dublin, Ireland

15 RESEARCH POSITIONS (PhD, Post-Doc and Research Programmer) IN
MACHINE TRANSLATION, PARSING, INFORMATION RETRIEVAL, INFORMATION
EXTRACTION AND TEXT ANALYTICS AT CNGL, DCU. CLOSING DATE 15th FEBRUARY
2013
-------------

At the Centre for Next Generation Localisation (CNGL, www.cngl.ie) at
Dublin City University (DCU), Ireland, we are recruiting 11 PhDs, 3
Post-Doctoral Researchers and 1 Research Programmer (1 position).

-------------

CNGL is a €50M+ Academia-Industry partnership, funded jointly by
Science Foundation Ireland (SFI) and our industry partners, and is
entering its second cycle of funding. CNGL is looking to fill multiple
posts associated with its second phase which will focus on expansion
of our work into the challenging areas of social text sources and
multimedia content.

CNGL is an active collaboration between researchers at Dublin City
University (DCU), Trinity College Dublin (TCD), University College
Dublin (UCD), University of Limerick (UL), as well as 10 industrial
partners, including SMEs, Microsoft, Symantec, Intel, DNP, and
Welocalize.

CNGL comprises over 100 researchers across the various institutions
developing novel technologies addressing key challenges in the global
digital content and services supply chain. CNGL is involved in a large
number of European FP7 projects, as well as commercial projects in the
areas of language technologies, information retrieval and digital
content management. CNGL provides a world class collaborative research
infrastructure, including excellent computing facilities, and
administrative, management and fully integrated and dedicated on-site
commercialisation support.

The successful candidates will become part of the research team based
at DCU, joining two leading academic MT/NLP/IR and Translation
research groups (www.nclt.dcu.ie/, cttsdcu.wordpress.com/). The team’s
location at DCU, minutes from Dublin city centre, offers a highly
conducive environment for research, collaboration and innovation with
a wealth of amenities on campus.

DCU is ranked in the TOP 50 of young universities worldwide (under 50
years old) (QS Ranking) and in the TOP 100 under the Times Higher
Education (under 50 years) ranking scheme.

The research is supervised by Dr. Jennifer Foster, Dr. Sharon O'Brien,
Dr. Gareth Jones, Prof. Qun Liu and Prof. Josef van Genabith.

-----------------------------------
--* CNGL PHD STUDENSHIPS AT DCU *--
-----------------------------------

Parsing, Analytics and Information Extraction:

[PhD_CC_1] Tuning Text Analytics to User-Generated Content: Parse
quality estimation and targeted self-training.
[PhD_CC_2] Extracting Events and Opinions from User-Generated Content:
Deep parsing-based methods.

Information Retrieval:

[PhD_SD_1] Self-Managing Information Retrieval Technologies: Query,
search technique and parameter selection in information retrieval
applications
[PhD_SD_2] Indexing and Search for Multimodal (Spoken/Visual) Content:
Locating relevant content in multimodal sources

[PhD_SDCC_1] Application of Text Analytics in Information Retrieval:
Enhancing information retrieval using features from text analysis
[PhD_SDDI_1] Investigating Human-Computer Interaction Issues for
Search and Discovery with Multimodal (spoken/Visual) Content

Machine Translation:

[PhD_TL_1] Syntax- and Semantics-Enhanced Machine Learning Based MT
[PhD_TL_2] Domain Adaptation Based on Multi-Dimensional Quality
Estimation, Similarity Metrics, Clustering and Search
[PhD_TL_3] Human interaction with MT output: Usability, Acceptability,
Post-editing Research

[PhD_TLDI_1] MT and Multimodal Interaction
[PhD_TLSD_1] MT for Multimodal Cross Language Information Retrieval

Ideal candidates for the PhD studentships (except PhD_TL_3 - see
below) should have:

- excellent computing and mathematical skills
- strong machine learning and statistical skills
- strong interest in basic research, applied research and showcasing
research in demonstrator systems
- willingness to work as a team, but also be able to work on their initiative
- strong ability in independent and creative thinking
- strong problem-solving skills
- excellent communication abilities (including writing and
presentation skills)
- proficiency in English
- a background in NLP, Computational Linguistics, Information
Retrieval, Information Extraction or Machine Translation as
appropriate for the relevant position is an advantage

Ideal candidates for the PhD studentship PhD_TL_3 (human interaction
with MT output: Usability, Acceptability, Post-editing Research)
should have:

- excellent skills in empirical research in translation, technical
communication and/or HCI
- strong interest in basic research, applied research and showcasing
research in demonstrator systems
- willingness to work as a team, but also be able to work on own initiative
- strong ability in independent and creative thinking
- strong problem-solving skills
- excellent communication abilities (including writing and
presentation skills)
- high proficiency in English
- a background in Translation, NLP, Computational Linguistics, or
Machine Translation is an advantage

PhD positions are fully funded for 3 years. Stipend: Fees + €16,000
p.a. living expenses (tax free)

-------------------------------
--* POST-DOCTORAL POSITIONS *--
-------------------------------

Parsing, Analytics and Information Extraction:

[PD_CC_1] Extracting Events and Opinions from User-Generated Content:
Parsing-based deep methods (up to 2 year contract)
[PD_CC_2] Extracting Events and Opinions from UGC: Shallow methods,
including unsupervised methods (up to 2.5 year contract)

Machine Translation:

[PD_TL_1] User/Human Centric MT (up to 2.5 year contract)

Ideal candidates for the post-doctoral positions should have:

- a strong international publication record
- a background in NLP, Computational Linguistics, Information
Retrieval, Information Extraction or Machine Translation as
appropriate for the position
- excellent computing and mathematical skills
- strong machine learning and statistical skills
- strong interest in basic research, applied research and showcasing
research in demonstrator systems
- willingness to work as a team, but also be able to work on their
own initiative
- strong ability for independent and creative thinking
- strong problem-solving skills
- excellent communication abilities (including writing and
presentation skills)
- proficiency in English
- ability to identify and develop future research and funding initiatives
- willingness to supervise and assist undergraduate and postgraduate students
- ability to lead small teams of researchers, in co-operation with
the principal investigator

Indicative starting salary (subject to experience and qualifications):
€37,750 - €42,394 (taxable)

-------------------------------------
--* RESEARCH PROGRAMMER POSITIONS *--
-------------------------------------
[RProg_CC_1] Research Programmer (up to 2.5 year contract)

Parsing, Analytics and Information Extraction:

Candidates for the researcher programmer position will have:
- strong computer engineering and design skills
- excellent knowledge of one or more of the following: Java, C++, PHP, Python
- comfortable working in both UNIX and Windows environments
- excellent algorithmic and analytical skills
- candidates will hold an MSc/PhD in Computer Science/Software Engineering
- excellent communication abilities
- willingness to work as part of a team, but also be able to work on
your own initiative
- experience in Natural Language Processing, Artificial
Intelligence, Information Retrieval, Localisation etc. would be highly
advantageous
- experience with research-based software development and/or
cloud-based and grid computing technologies is also highly preferred

Indicative Salary (subject to experience and qualifications):
€37,750 - €42,394 (taxable)

-------------

**CLOSING DATE FOR APPLICATIONS ALL POSITIONS: 15th FEBRUARY 2013**
For more details please see: http://www.cngl.ie/vancancies.html

Application forms are available from:
http://www.dcu.ie/vacancies/APPLICATION_FORM_8pg.doc. Please also send
a CV with two contact points for references. When completing your
application, please indicate which positions you are applying for (in
order of preference) e.g. [PHD_TL_1], [PHD_TL_2]. Completed
application forms should be sent to Dr. Declan Groves
.

*For informal enquiries please contact the relevant PI below*:

- Dr. Jennifer Foster    [PhD_CC_1]
- Prof. Josef van Genabith
[PhD_CC_2],[PD_CC_1],[PD_CC_2],[RProg_CC_1]
- Dr. Gareth Jones     [PhD_SD_1],
[PhD_SD_2], [PhD_SDCC_1], [PhD_SDDI_1]
- Prof. Qun Liu     [PhD_TL_1], [PhD_TL_2],
[PhD_TLDI_1], [PhD_TLSD_1], [PD_TL_1]
- Dr. Sharon O'Brien    [PhD_TL_3]

Back

Top

6-26

(2013-02-01) Ph.D. Research Assistant or Post-Doctoral Researcher, Cooperative State University in Karlsruhe, Germany

This is a pre-announcement for a position in the Computer Science Department at the Cooperative State University in Karlsruhe, Germany, for a

Ph.D. Research Assistant or Post-Doctoral Researcher

in the field of Automatic Language Processing for Education

to be filled immediately with a salary according to TV-L E13 at 50% for 18 Months. The opening is in Karlsruhe, Germany, as part of a joint research project between Karlsruhe Institute of Technology (KIT), the Cooperative State University (DHBW) and the University of Education (PH) sponsored by DFG involving speech technology for educational system. (Working language: English or German)

Project cooperation partners are:

Cooperative State University (Duale Hochschule, Karlsruhe)

University of Education, Karlsruhe (Pädagogische Hochschule, Karlsruhe)

Karlsruhe Institute of Technology (KIT)

Description

Starting as soon as possible, we are seeking an experienced and motivated person to join our team of researchers from the above mentioned institutes. The ideal candidate will have knowledge in computational linguistics and algorithm design. Responsibilities include the use and improvement of research tools to update and optimize algorithms applied to diagnostics in children’s (German) writing using speech recognition and speech synthesis tools. For further details of this work, please refer to publications at SLaTE 2011, Interspeech 2011, and WOCCI 2012 by authors Berkling, Stüker, and Fay. Joint and collaborative research between the partners will be very close, offering exposure to each research lab.

Candidates:

Doctoral Research Candidates may apply and are welcome for joint research with their host institution.
Experienced (post-doctoral) Research Candidates are already in possession of a doctoral degree or have at least 3 years but less than 5 years of research experience in engineering and/or hearing research.

Requirements

Higher degree in speech science, linguistics, machine learning, or related field
Experience developing ASR applications - training, tuning, and optimization
Software development experience (for example: Perl, TCL, Ruby, Java, C)
Excellent communication skills in English
Willingness and ability to spend 18 months in Germany, working in a team with project partners
Knowledge of German linguistics, phonics, graphemes, morphology or willingness to learn
Strong interest in computational linguistics, morphology, phonics for German

Desirable:

Interest in Education and language learning
Interest in Human Computer Interaction and game mechanics
Ability to create Graphic interfaces in multi-player applications

Application Procedure: Non-EU candidates need to check their ability to reside in Germany. Interested candidates, please send application (CV, certified copies of all relevant diplomas and transcripts, two letters of recommendation , proof of proficiency in English, letter of motivation (research interest, reason for applying to position) to: Berkling@dhbw-karlsruhe.de

The Cooperative State University is pursuing a gender equality policy. Women are therefore particularly encouraged to apply. If equally qualified, handicapped applicants will be preferred.

Back

Top

6-27

(2013-02-20) Research Associate in Robust Speech Recognition at the University of Shefield UK

-- Research Associate in Robust Speech Recognition
===============================================

The University of Sheffield, Department of Computer Science invite applications for a position as Research Associate to work on a project to research and develop robust technology for recognition of speech in meetings. The associated project, DocuMeet, is funded by the European Union and involves collaboration with partners from academia and industry across Europe. The Speech and Hearing Research Group is responsible for speech technology in the project, but also contributes to some natural language understanding tasks.

Speech transcription of meeting data is a well-established task, with international competitions (held by U.S. NIST), and supported by several large-scale research projects. While significant progress has been made, the performance of recognition, detection and analysis systems is still very far from usable in many realistic, natural scenarios. There are many significant challenges seeking a solution: the acoustic complexity of meetings goes well beyond standard settings.: Noise and reverberation are standard; speech signals show significant amounts of overlap between speakers; varying degrees of emotion are present; and speakers are moving. All of these pose significant challenges to speech research and practical applications.

In the DocuMeet project we specifically work on speech recognition robustness to noise and reverberation. We aim to work on new algorithms that allow to factor environment and context in novel ways (e.g. eigen-environments). The recordings from multiple microphones can be used to remove unwanted acoustics, while knowledge about a specific environment type should be used to adjust acoustic models of the recognition systems. Further we will investigate how such algorithms can be integrated with personalisation (acoustic/language) and how metadata can be used to inform such processes. Extensive experimentation of existing and new corpora will be required to demonstrate the effectiveness of the new techniques.

Applicants are required to have a track record of work on speech technologies including speech recognition, and to have had exposure to modern machine learning techniques. Ideally, such a track record is demonstrated by publications in international journals and conferences. The successful candidate will be required to hold a PhD in the field; work on the project will require publication of results, travelling to conferences and extensive visits to itslanguage offices. At this point the project duration is for one year, but extensions are likely.
The project will be embedded in the Speech and Hearing (SpandH) research group at (http://www.dcs.shef.ac.uk/spandh) in the Department of Computer Science, and in particular the subgroup on machine intelligence for natural interfaces (MINI). SpandH is amongst the largest speech research groups in the UK, with extensive infrastructure and a vibrant research environment. The group is well known internationally for its research, which reaches across traditional divides to encompass and link computational hearing, speech perception and speech technology. The MINI subgroup is led by Prof. Hain, currently has 13 members, and is amongst other things well known for speech recognition and classification. It has had systems with best performance in international competitions that are available to the public at www.webasr.org. The subgroup is currently involved in many projects, including an EPSRC programme grant (with Univ. of Cambridge, Univ. of Edinburgh), research organisations (e.g. Idiap, NICT), and Industry (e.g. Cisco, Google). It has its own extensive computing infrastructure, access to large quantities of data, as well as dedicated recording facilities.

The Department of Computer Science, which is a member of the Faculty of Engineering, was established in 1982 and has since attained an international reputation for its research and teaching. Currently there are over 100 members of staff in Computer Science, including 35 Academics. The Department has an international reputation for the quality of its research, and was awarded grade 5 in the 2001 research assessment exercise, and in the 2008 exercise, 65% of our research was rated world leading or internationally excellent in terms of its originality, significance and rigor.

If you would like to know more about this position, please contact Prof. Thomas Hain - t.hain@dcs.shef.ac.uk.

In order to apply, the best option is to visit jobs.ac.uk and then press the 'Apply' button on the page:

http://www.jobs.ac.uk/job/AFU678/research-associate/

The University of Shefield JOB ID is UOS005891.

Back

Top

6-28

(2013-02-10) Developer for Large-Scale Audio Indexing Technologies at IRCAM, Paris

Position: Developer for Large-Scale Audio Indexing Technologies: 1 W/M position at IRCAM

Starting: March 2013

Duration: 18 months

Position description

He/she will be in charge of the development of a framework for scalable storage, management and access of distributed data (audio and meta-data). He/she will be also in charge of the development of scalable search algorithms.

Required profile:

· High skill in database management systems

· High skill in scalable indexing technologies (hash-table, m-trees …)

· High skill in C++ development (including template-based meta-programming)

· Good knowledge of Linux, Mac OSX and Windows development environment (gcc, Intel and MSVC,  svn)

· High productivity, methodical work, excellent programming style.

The developer will collaborate with the project team and participate in the project activities (evaluation of technologies, meetings, specifications, reports).

Introduction to IRCAM

IRCAM is a leading non-profit organization associated to Centre Pompidou, dedicated to music production, R&D and education in sound and music technologies. It hosts composers, researchers and students from many countries cooperating in contemporary music production, scientific and applied research. The main topics addressed in its R&D department include acoustics, audio signal processing, computer music, interaction technologies, and musicology. Ircam is located in the center of Paris near the Centre Pompidou, at 1, Place Igor Stravinsky 75004 Paris.

Salary

According to background and experience

Applications

Please send an application letter together with your resume and any suitable information addressing the above issues preferably by email to: peeters_at_ircam_dot_fr with cc to vinet_a_t_ircam_dot_fr, roebel_at_ircam_dot_fr.

Back

Top

6-29

(2013-02-14) INRIA PhD fellowship, Bordeaux, F

Proposal for an INRIA PhD fellowship (Cordi-S)

Title of the proposal: Nonlinear speech analysis for differential diagnosis between Parkinson's disease and Multiple-System Atrophy

Project Team INRIA: GeoStat (http://geostat.bordeaux.inria.fr/)

Author of the proposal research subject: Khalid Daoudi (khalid.daoudi@inria.fr)

Keywords: speech processing, nonlinear speech analysis, machine learning, voice pathology,

dysphonia, dysarthria, Multiple-System Atrophy, Parkinson's disease.

Scientific context:

Parkinson's disease (PD) is the most common neurodegenerative disorder after Alzheimer's disease.

Prevalence is 1.5% of the population over age 65 and affects about 143,000 French. Given the aging of the population, the prevalence is likely to increase over the next decade.

Multiple-System Atrophy (MSA) is a rare and sporadic neurodegenerative adult disorder, of

progressive evolution and of unknown etiology. The MSA has a prevalence of 2 to 5/100 000 and has no effective treatment. It usually starts in the 6th decade and there is a slight male predominance. It takes 3 years on average from the first signs of the disease for a patient to require a walking aid, 4-6 years to be in a wheelchair and about 8 years to be bedridden.

The PD and MSA require different treatment and support. However, the differential diagnosis between PD and MSA is a very difficult task because, at the early stage of the diseases, patients look alike as long as signs, such as dysautonomia, are not more clearly installed for MSA patients. There is currently no valid clinical nor biological marker for clear distinction between the two diseases at an early stage.

Goal:

Voice and speech disorders in Parkinson's disease is a clinical marker that coincides with a motor

disability and the onset of cognitive impairment. Terminology commonly used to describe these

disorders is dysarthria [1].

Like PD patients, depending on areas of the brain that are damaged, people with AMS may also have speech disorders: difficulties of articulation, staccato rhythm, squeaky or muted voice. Dysarthria in AMS is more severe and early in the sense that it requires more early rehabilitation compared to PD.

Since dysarthria is an early symptom of both diseases, the purpose of this thesis is to use dysarthria,

through digital processing of voice recordings of patients as a mean for objective discrimination

between PD and MSA. The ultimate goal is to develop a numerical dysarthria measure, based on the

analysis of the speech signal of the patients, which allows objective discrimination between PD and

MSA and would thus complement the tools currently available to neurologists in the differential

diagnosis of the two diseases.

Project:

Pathological voices, such as in PD and MSA, generally present high non-linearity and turbulence.

Nonlinear/turbulent phenomena are not naturally suited to linear signal processing. The latter is

however ruling over current speech technology. Thus, from the methodological point of view, the goal of this thesis is to investigate the framework of nonlinear and turbulent systems, which is better suited to analyzing the range of nonlinear and turbulent phenomena observed in pathological voices in general [2], and in PD and MSA voices in particular. We will adopt an approach based on novel nonlinear speech analysis algorithms recently developed in the GeoStat team [3]. The goal being to extract relevant speech features to design new dysarthria measures that enable accurate discrimination between

PD and MSA voices. This will also require investigation of machine learning theory in order to develop robust classifiers (to discriminate between PD and MSA voices) and to make correspondence (regression) between speech measures and standard clinical rates.

The PhD candidate will actively participate, in coordination with neurologists from the Parkinson's

Center of Haut-Lévêque Hospital, to set up the experimental protocol and data collection. The latter

will consist in recording patient's voices using DIANA or EVA2 workstation (http://www.sqlab.fr/).

References:

[1] Auzou, P.; Rolland, V.; Pinto, S., Ozsancak C. (eds.). Les dysarthries. Editions Solal. 2007.

[2] Baghai-Ravary L. ; Beet S.W. Automatic Speech Signal Analysis for Clinical Diagnosis and Assessment of Speech Disorders. Springer 2013.

[3] PhD thesis of Vahid Khanagha. GeoStat team, INRIA Bordeaux-Sud Ouest. January 2013.

http://geostat.bordeaux.inria.fr/images/vahid%20khanagha%204737.pdf

Advisor: K. Daoudi

Duration: 3 years (starting fall 2013)

Prerequisites: Good level in signal/speech processing is necessary, as well as Matlab and C/C++

programing. Knowledge in machine learning would be a strong advantage.

Back

Top

6-30

(2013-02-19) Ph.D Student in Speech and Music Communication , KTH Stockholm Sweden

Ph.D Student in Speech and Music Communication

KTH School of Computer Science and Communication (CSC) announces a PhD position in Speech and Music Communication.

The Workplace

KTH in Stockholm is the largest and oldest technical university in Sweden. No less than one-third of Sweden’s technical research and engineering education capacity at university level is provided by KTH. Education and research spans from natural sciences to all branches of engineering and includes architecture, industrial management and urban planning. There are a total of just over 15,000 first and second level students and more than 1,600 doctoral students. KTH has almost 4,300 employees.

KTH Computer Science and Communication is one of the most outstanding research and teaching environments in Information Technology in Sweden with activities at KTH and partly at Stockholm University. We conduct education and research in theoretical computer science, from theory building and analysis of mathematical models to algorithm construction, implementation and simulation. The applied computer science research and education dealing with computer vision, robotics, machine learning, computational biology, neuroinformatics and neural networks, including high performance computing, visualization and speech and music communication. It also conducts applied research and training in media technology, human-computer interaction, interaction design and sustainable development.

For more information about CSC, go to www.kth.se/csc.

Assignment

KTH School of Computer Science and Communication (CSC) announces a PhD position in Speech and Music Communication at the Department of Speech, Music and Hearing. The thesis work will be directed towards basic research on simulating the human voice through advanced computer models. It comprises theoretical as well as experimental studies of speech production.

For more information about the research project: http://www.speech.kth.se/eunison

This is a four-year time-limited position that can be extended up to a year with the inclusion of a maximum of 20% departmental duties, usually teaching. Doctoral students must be registered at KTH. Expected starting date: 2013-09-02.

Employment

Form of employment: Time-limited Work time: Full time The salary follows the directions provided by KTH Start date: According to agreement, preferably 2013-09-02. Number of positions: 1

Qualifications

The applicant should, at the time of application or no later than the expected starting date, possess a Master of Science degree in computer science, electrical engineering or engineering physics, or the equivalent.

In addition, thorough knowledge in several of the following areas are required: programming, phonetics or speech technology, statistical methods, computer simulations and multi-physics. The applicant should demonstrate a high proficiency in both written and spoken English.

Applicants must be strongly motivated for doctoral studies, possess the ability to work independently and perform critical analysis as well as possess good levels of cooperative and communicative abilities.

Application

Application deadline: March 22, 2013 Employer's reference: D-2013-0107

Applications via email are to be sent to: Camilla Johansson, e-mail jobs@csc.kth.se.

Write reference number in the email subject. (CV, etc. should be sent as an attachment, as pdf-files.)

We also accept hard copy applications sent to:

KTH, CSC Att. Camilla Johansson, Lindstedtsvägen 3, 4th floor, SE-100 44 Stockholm, Sweden

Application

The application must be written in English and contain:

Cover letter: A summary of the application that describes the particular merits that make the applicant suitable for the open position. Maximum one (1) page.

 Curriculum Vitae

Official record of transcripts and copy of degree certificate. The documents should be in English or be accompanied by an authorized translation to English.

References: Please provide detailed contact information to at least two (2) references.

Statement of purpose: Please discuss your research interests and motivation for carrying out PhD studies at KTH, and how this can be demonstrated in your earlier experiences (studies, technical knowledge, other assignments etc.). Maximum two (2) pages.

Summary of relevant publications: If existing, provide a list of publications, sorted according to relevance for this position. The list should for each publication give a short summary and a web-link to the full text.

Applicants are kindly ask to also fill in the form at http://www.speech.kth.se/eunison/phd-applicants.html

We are currently gathering information to help improve our recruitment process. We would, therefore, be very grateful if you could include an answer to the following question within your application: Where did you initially come across this job advertisement?

Contact(s)

For enquiries about Ph.D studies and employment conditions please contact:

Eva-Lena Åkerman, HR Manager Phone: +46 8 790 91 06 Email: ela@csc.kth.se

For enquiries about the project please contact:

Olov Engwall, Professor in Speech Communication Telephone: +468-790 75 35 e-mail: engwall@kth.se

Union representative

Lars Abrahamsson, SACO Phone: +46 8 790 7058 Email: lars.abrahamsson@ee.kth.se

Back

Top

6-31

(2013-02-19) Poste de Maître de conférences en informatique est ouvert à l'Université Paris-Sud, à destination de l'IUT d'Orsay. France

Un poste de Maître de conférences en informatique est ouvert au concours à l'Université Paris-Sud, à destination de l'IUT d'Orsay.

Le traitement de la parole (de l'interaction vocale à l'indexation de documents multimédia) fait partie des thématiques de recherche prioritaires du poste ; ce sujet est développé au LIMSI dans le groupe TLP (voir aussi http://www.limsi.fr/tlp/postes13.html).

Pour consulter la fiche détaillée du poste :

http://www.u-psud.fr/_attachments/enseignants_chercheurs/Fiche%2520emploi%252027%2520MCF%25201905x.pdf?download=true

et les modalités pour le dossier de candidature (à déposer avant le 28 mars 2013) :

http://www.u-psud.fr/fr/recrutement/enseignants/enseignants_chercheurs.html

Back

Top

6-32

(2013-02-25) Post-doc de durée de 12 mois, INRIA-LORIA, Nancy, France

Dans le cadre du projet ANR ContNomina(2013-2016), nous proposons

un post-doc de durée de 12 mois, financé par ce projet :

Détection des noms propres dans des transcriptions automatiques de la parole en Français

Bien que la reconnaissance d’entités nommées en anglais offre des performances excellentes, ce n’est pas le cas pour les autres langues, dans les domaines d’application avec peu de données d’apprentissage, et sur des transcriptions automatiques. Le travail demandé est donc de proposer des solutions permettant de : (i) exploiter les mesures de confiance du système de reconnaissance et le contexte lexical afin de localiser les noms propres, qu’ils soient présents ou non dans le lexique de transcription ; et (ii) pallier l’absence ou à la rareté des données d’apprentissage, ce qui donnera au système développé un caractère incrémental et auto-adaptatif très important pour envisager une valorisation à long terme des résultats, au-delà de la durée du projet lui-même. Les modèles bayésiens génératifs forment un cadre théorique intéressant à explorer pour résoudre ce défi.

Personnes à contacter :

Irina Illina, Responsable du projet ANR ContNomina, INRIA-LORIA, Nancy, équipe Parole, tel 03 54 95 84 90, illina@loria.fr

Christophe Cerisara, INRIA-LORIA, Nancy, équipe Sinalp, tel 03 54 95 86 25, cerisara@loria.fr

Back

Top

6-33

(2013-02-21) 4 positions for doctoral students, University of Gothenburg, Sweden

My department is now announcing four (not much, but still) positions for doctoral students. The positions mean four years fully financed (full time salary on the order of € 2500 per month), office space, computer and other technical resources necessary to do the job. You will find more information here:

http://www.flov.gu.se/english/education/doctoral-studies-third-cycle/admission/?languageId=100001&contentId=-1&disableRedirect=true&returnUrl=http%3A%2F%2Fwww.flov.gu.se%2Futbildning%2Fforskarniva%2Fansokan%2F

I would greatly appreciate if you could make this opportunity known among your students.

Anders Eriksson, MSc, PhD.
Professor of Phonetics
Department of Philosophy, Linguistics and Theory of Science
University of Gothenburg
Box 200, SE-405 30 Gothenburg, SWEDEN
Web: http://www.ling.gu.se/~anders <http://www.ling.gu.se/%7Eanders>

Back

Top

6-34

(2013-02-22) Post-doc at François Rabelais University, Tours (France)

Job offer: Post-doc
    A post-doctoral position in linguistics is available at François     Rabelais University, Tours (France), to work with the linguists in     the Inserm Research Unit 930 Imagerie & cerveau on a research     project funded by the French National Research Agency, BiLaD     (Développement du langage bilingue : Enfants à développement typique     et enfants avec troubles du langage).
    Starting date: September 1st, 2013 for a duration of 12 months     (renewable for a total of 24 months).
    To apply, send a CV + names and contact information for two     recommenders to Laurie Tuller (tuller@univ-tours.fr).
    Responsibilities: Organization, processing, and analysis of     linguistic and psycholinguistic data from experimental and     standardized tasks (verbal and non-verbal).
    Required training and skills: PhD in linguistics or related field     (psychology, cognitive science) with specialty in language     acquisition. Solid experience in transcribing linguistic data, and     in use of data entry and processing software (Excel, SPSS, PRAAT).
    Other skills: Experience with children with SLI, familiarity with     the field of speech-language pathology/therapy and bilingualism.     Ability to work in French and in English required; mastery of     Turkish or Portuguese would be an asset.
    Deadline : May 15, 2013.

Back

Top

6-35

(2013-02-22) Researcher/PostDoc/Intern positions at Telfonica Research Barcelona, Spain

We are seeking candidates for Researcher/PostDoc/Intern positions to

strengthen and complement our efforts in the areas we currently work

on:

- Distributed systems and networking

- Human&computer interaction

- Mobile computing

- Multimedia analysis

- Speech processing

- Recommender systems

- User modeling and machine learning

- Security and privacy

The Telefonica Digital Research group was created in 2006 and follows

an open research model in collaboration with Universities and other

research institutions, promoting the dissemination of scientific

results both through publications in top-tier peer-reviewed

international journals and conferences and technology transfer. Our

multi-disciplinary and international team comprises more than 20 full

time researchers, holding PhD degrees in various disciplines of

computer science and electrical engineering.

The salaries we offer are competitive and will depend upon the

candidate's experience. We also offer great benefits and a stimulating

and friendly working atmosphere in one of the most vibrant cities in

the world, Barcelona (Spain).

You can find more information about the group here:

http://www.tid.es/en/Research/Pages/TIDResearchHome.aspx

To apply for a position at Telefonica Research Barcelona, please send

an e-mail with your cv and research statement to:

careers_research@tid.es

Applications submitted by March 10, 2013 will receive full

consideration, although we will continue to accept applications after

this date until all positions are filled up.

Back

Top

6-36

(2013-03-01) Maître de Conférences en Informatique: Traduction automatique statistique, Le Mans, France

Poste de Maître de Conférences en Informatique
Profil 'Traduction automatique statistique'
Université du Maine, Le Mans
Référence GALAXIE : 4055

Profil recherche :

Le candidat doit avoir des connaissances approfondies dans au moins un des domaines suivantes : la traduction automatique statistique, le traitement automatique de la langue, l'apprentissage automatique. Une expérience dans la construction de grands systèmes de traduction avec l'outil Moses est particulièrement bienvenue.

Les candidats sont invités à se manifester auprès de :
Holger Schwenk : holger.schwenk@lium.univ-lemans.fr

Lieu(x) d’exercice : LIUM, Université du Maine, site du Mans
Nom directeur labo : Yannick Estève
Tel directeur labo : 02 43 83 3874
Email directeur labo : yannick.esteve@lium.univ-lemans.fr
URL labo : http://www-lium.univ-lemans.fr

Descriptif du laboratoire :
Le Laboratoire d'Informatique de l'Université du Maine (LIUM - EA 4023), créé il y a environ vingt-cinq ans, regroupe la plupart des enseignants-chercheurs en informatique de l’Université du Maine. Il comprend actuellement environ 50 personnes : 20 enseignants-chercheurs, 8 post-doctorants, 13 doctorants et 4 BIATOSS. Le LIUM est composé de deux équipes : une équipe de neuf enseignants-chercheurs permanents spécialisés en Environnements informatiques pour l'apprentissage humain (Ingénierie des EIAH), dirigée par Christophe Choquet ; une équipe de onze enseignants-chercheurs spécialisés en reconnaissance de la parole et traduction automatique (Language and Speech Technology LST), dirigée par Paul Deléglise.

Profil enseignement :

Le candidat devra être capable d'enseigner en français les matières fondamentales de l'informatique (algorithmique, programmation orientée objet, réseaux, bases de données, génie logiciel, etc).

Nom du responsable du Département (enseignement) :
Christophe Després Christophe.Despres@univ-lemans.fr

Back

Top

6-37

(2013-03-02) Researcher/PostDoc/Intern positions at Telefonica, Barcelona, Spain

We are seeking candidates for Researcher/PostDoc/Intern positions to

strengthen and complement our efforts in the areas we currently work

on:

- Distributed systems and networking

- Human&computer interaction

- Mobile computing

- Multimedia analysis

- Speech processing

- Recommender systems

- User modeling and machine learning

- Security and privacy

The Telefonica Digital Research group was created in 2006 and follows

an open research model in collaboration with Universities and other

research institutions, promoting the dissemination of scientific

results both through publications in top-tier peer-reviewed

international journals and conferences and technology transfer. Our

multi-disciplinary and international team comprises more than 20 full

time researchers, holding PhD degrees in various disciplines of

computer science and electrical engineering.

The salaries we offer are competitive and will depend upon the

candidate's experience. We also offer great benefits and a stimulating

and friendly working atmosphere in one of the most vibrant cities in

the world, Barcelona (Spain).

You can find more information about the group here:

http://www.tid.es/en/Research/Pages/TIDResearchHome.aspx

To apply for a position at Telefonica Research Barcelona, please send

an e-mail with your cv and research statement to:

careers_research@tid.es

Applications submitted by March 10, 2013 will receive full

consideration, although we will continue to accept applications after

this date until all positions are filled up.

Back

Top

6-38

(2013-03-03) Six positions at Nuance, Vienna, Austria

Nuance Healthcare, a division of Nuance Communications, is the market leader in providing clinical understanding solutions that accurately capture and transform the patient story into meaningful, actionable information. Thousands of hospitals, providers and payers worldwide trust Nuance speech-enabled clinical documentation and analytics solutions to facilitate smarter, more efficient decisions across the healthcare enterprise. These solutions are proven to increase clinician satisfaction and HIT adoption, supporting organizations to achieve Meaningful Use of EHR systems and transform to the accountable care model. Nuance Healthcare has been recognized as “Best-in-KLAS” 2004-2012 for Speech Recognition.

Research Scientist (m/f) – Computational Linguist NLP

Preferred Vienna / Austria

As a Research Scientist - Computational Linguist - you will be part of the Healthcare Automatic Speech Recognition Research team. You will work on research and development of algorithms, resources and methods to support data collection and improve accuracy of Nuance healthcare products.

Your Task.

Development of methods, algorithms, resources and tools in the area of natural language processing (NLP) and computational linguistics
Experimental and theoretical analysis of computational linguistics and NLP related problems
Following academic development in speech recognition area, attending conferences and writing scientific papers (if applicable)

Your Profil.

PhD or Master degree in (computational) linguistics, engineering sciences, mathematics or physics
Knowledge in machine learning methods (statistical, rule-based)
Analytical and problem-solving skills
Computer programming (C++ and/or scripting languages)
Team player but also able to work on his own initiative
Willing to learn
English language

Preferred

Experience (industrial or academic) in NLP or computational linguistics
Knowledge in automatic speech recognition
Linguistic knowledge

Our offer.

We offer a competitive compensation package and a casual yet technically challenging work environment. Join our dynamic, entrepreneurial team and become part of our fast growing track of continuing success. Nuance is an Equal Opportunity Employer.

Innovative products
full time & permanent employment
International projects and international teams
Development within Nuance organization

Does Nuance speak to you?

Please apply via our Recruiting tool on our homepage https://jobs-nuance.icims.com/jobs/9447/job or via EMEAjobs@nuance.com reference number 9447-Research Scientist - Comp Linguist NLP. Please provide CV, supporting documents and letter of motivation including preferred country, start date and salary expectations.

For more information visit us www.nuance.com.

°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°

Research Scientist (m/f) – Language Modeling

Preferred Vienna / Austria

As a Research Scientist you will be part of the Healthcare Automatic Speech Recognition Research team. You will work on research and development of ASR algorithms, resources and methods to improve accuracy and performance of Nuance healthcare products.

Your Task.

Work in language modeling and grammar design area
Development of training and adaptation tools and algorithms
Performing speech recognition (computer) experiments and studies
Experimental and theoretical analysis of speech recognition problems
Following academic development in speech recognition area, attending conferences and writing scientific papers (if applicable)

Your Profil.

PhD or Master degree in engineering sciences, mathematics or physics
Knowledge in automatic speech recognition methods and algorithms
Analytical and problem-solving skills
Statistical data analysis
Scripting languages in Unix/Linux environment (e.g. Perl, Python, Bash)
Team player but also able to work on his own initiative
Willing to learn
English language

Preferred

Experience (industrial or academic) in language modeling area
Statistical pattern recognition
Linguistic knowledge

Our offer.

Innovative products
full time & permanent employment
International projects and international teams
Development within Nuance organization

Does Nuance speak to you?

Please apply via our Recruiting tool on our homepage https://jobs-nuance.icims.com/jobs/9446/job or via EMEAjobs@nuance.com reference number 9446-Research Scientist - LM. Please provide CV, supporting documents and letter of motivation including preferred country, start date and salary expectations.

For more information visit us www.nuance.com.

°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°

Speech Scientist (m/f) - Linguist

Preferred Vienna / Austria

As a Speech Scientist - (Computer) Linguist - you will be part of the Healthcare Automatic Speech Recognition Research team. You will work on research and development of resources, algorithms and methods to improve accuracy and performance of Nuance healthcare products.

Your Task.

Development of rules (grammars), lexicons and statistic models for speech recognition
Performing speech recognition (computer) experiments and studies
Following academic development in speech recognition area, attending conferences and writing scientific papers (if applicable)

Your Profil.

PhD or Master degree in linguistic, machine learning, computer linguistic or in a related field
(Computer) linguistic and/or machine learning knowledge
Scripting languages (e.g. Perl and/or Python)
Analytical and problem-solving skills
Team player but also able to work on his own initiative
Communication / coordination skills
Willing to learn
English language

Preferred

Experience in the area of automatic speech recognition
Knowledge of context independent grammars
Working in Unix/Linux and Windows environment

Our offer.

Innovative products
full time & permanent employment
International projects and international teams
Development within Nuance organization

Does Nuance speak to you?

Please apply via our Recruiting tool on our homepage https://jobs-nuance.icims.com/jobs/9445/job or via EMEAjobs@nuance.com reference number 9445-Speech Scientist Linguist. Please provide CV, supporting documents and letter of motivation including preferred country, start date and salary expectations.

For more information visit us www.nuance.com.

°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°

Research Scientist (m/f) – Acoustic Modeling

Preferred Vienna / Austria

Your Task.

Work in acoustic modeling area
Development of training and adaptation tools and algorithms
Performing speech recognition (computer) experiments and studies
Experimental and theoretical analysis of speech recognition problems
Following academic development in speech recognition area, attending conferences and writing scientific papers (if applicable)

Your Profil.

PhD or Master Degree in engineering sciences, mathematics or physics
Knowledge in automatic speech recognition methods and algorithms
Analytical and problem-solving skills
Statistical data analysis
Scripting languages in Unix/Linux environment (e.g. Perl, Python, Bash)
Team player but also able to work on his own initiative
Willing to learn
English language

Preferred

Experience (industrial or academic) in acoustic modeling area
Statistical pattern recognition
Signal processing

Our offer.

Innovative products
full time & permanent employment
International projects and international teams
Development within Nuance organization

Does Nuance speak to you?

Please apply via our Recruiting tool on our homepage https://jobs-nuance.icims.com/jobs/9444/job or via EMEAjobs@nuance.com reference number 9444-Research Scientist - AM. Please provide CV, supporting documents and letter of motivation including preferred country, start date and salary expectations.

For more information visit us www.nuance.com.

°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°°

Research Scientist (m/f) - Acoustic Modeling

Preferred Vienna / Austria

As a Research Scientist you will be part of the Healthcare Automatic Speech Recognition Research team. You will work on research and development in area of acoustic modeling to improve accuracy and performance of Nuance healthcare products.

Your Task.

Work in acoustic modeling area (e.g. creation of acoustic models for various languages)
Performing speech recognition (computer) experiments and studies
Experimental and theoretical analysis of speech recognition problems
Following academic development in speech recognition area, attending conferences and writing scientific papers (if applicable)

Your Profil.

PhD or Master degree in engineering sciences, mathematics or physics
Knowledge in automatic speech recognition methods and algorithms
Analytical and problem-solving skills
Statistical data analysis
Scripting languages in Unix/Linux environment (e.g. Perl, Python, Bash)
Team player but also able to work on his own initiative
Willing to learn
English language

Preferred

Experience (industrial or academic) in acoustic modeling area
Statistical pattern recognition
Signal processing

Our offer.

Innovative products
full time & permanent employment
International projects and international teams
Development within Nuance organization

Does Nuance speak to you?

Please apply via our Recruiting tool on our homepage https://jobs-nuance.icims.com/jobs/9443/job or via EMEAjobs@nuance.com reference number 9443-Research Scientist AM. Please provide CV, supporting documents and letter of motivation including preferred country, start date and salary expectations.

For more information visit us www.nuance.com.

Research Scientist (m/f) – Automatic Speech Recognition

Preferred Vienna / Austria

Your Task.

Application specific speech recognition research
Tuning and optimization of speech recognition algorithms
Software implementation
Analysis of application specific speech recognition issues
General expertize in speech recognition area

Your Profil.

PhD or Master degree in engineering sciences, mathematics or physics
Analytical and problem-solving skills
Statistical data analysis
Linux OS including bash
Team player but also able to work on his own initiative
Willing to learn
English language
Either

strong background in the area of automatic speech recognition
some experience in software development

strong background in software development
some experience in the area of automatic speech recognition

Preferred Skills:

Experience with an object-oriented programming language
Experience with scripting languages like Perl and Python
Statistical pattern recognition

Our offer.

Innovative products
full time & permanent employment
International projects and international teams
Development within Nuance organization

Does Nuance speak to you?

Please apply via our Recruiting tool on our homepage https://jobs-nuance.icims.com/jobs/8047/job or via EMEAjobs@nuance.com reference number 8047-Research Scientist - ASR. Please provide CV, supporting documents and letter of motivation including preferred country, start date and salary expectations.

For more information visit us www.nuance.com.

ILONA ALEXANDRA HOLTZ

Recruiter - Employment Specialist DACH

Human Resources

Nuance Communications Deutschland GmbH

Site Ulm

Soeflingerstr. 100

D-89077 Ulm, Germany

Fon +49 731 - 379 50 1166

Fax +49 731 - 379 50 1106 (Zentrale)

Mobil +49 170 56 15 235

WWW.NUANCE.COM The experience speaks for itself ™

Geschäftsführung/Director: Caroline Curtis, Todd Michael DuChene, Thomas L. Beaudoin

Sitz der Gesellschaft/Registered Office: Aachen

Registergericht/Court of Registration: Aachen

Reg. Nr.: HRB 16313

USt-ID/VAT: DE 264500438

This electronic transmission and any files transmitted with it are confidential. It is transmitted for the sole use of the person(s) to whom it is addressed. Any further distribution or copying is prohibited. If you receive this message in error, please inform the sender immediately, do not use it or disclose its contents and delete it from your system. Please note that Nuance cannot guarantee that the transmission will be secure or error-free.

Experience Nuance in the web:

DragonDrive!

http://www.youtube.com/watch?v=wYIwVP2JqL4

DragonNaturallySpeaking - Say It Your Way!

http://www.youtube.com/watch?v=RkiYr8aw5pE

or meet me on XING or LinkedIn

Back

Top

6-39

(2013-03-13) Speech Data Evaluator for French at Google Dublin Ireland

Speech Data Evaluator for French

Job title:

Speech Data Evaluator for French (multiple positions)

In Dublin.

Job description:

As a Speech Data Evaluator and a native speaker of French, you will be part of a team based in Dublin, processing large amounts of linguistic data and carrying out a number of tasks to improve the quality of Google’s speech synthesis and speech recognition in your own language.

This includes:

annotating and classifying linguistic data
labeling text for disambiguation, expansion, and text normalization
providing phonetic transcription of lexicon entries according to given standards and using in-house tools

Job requirements:

native speaker of French (with good command of the standard dialect) and fluent in English
computer-literate (should feel comfortable using in-house tools)
attention to detail
good knowledge of orthography and grammar in French
passion for language and a keen interest in technology
good organizational skills
a degree in a language-related field such as linguistics, language teaching, translation, editing, writing, proofreading, or similar

Project duration: 6-9 months (with potential for extension)

For immediate consideration, please email your CV and cover letter in English (PDF format preferred) with 'Speech Data Evaluator French' in the subject line.

Email Address for applications: DataOpsMan@gmail.com

Contact information: Linne Ha

Closing date: open until filled

Back

Top

6-40

(2013-03-15) Offre de post-doc en Traitement Automatique du Langage au LIUM Le Mans France

Offre de post-doc au LIUM
Traitement Automatique du Langage
–
Correction de sorties d’OCR, traduction automatique statistique, modélisation du langage.
Offre de postdoc au sein du laboratoire d’Informatique de l’Université du Maine (LIUM) dans le domaine de la correction orthographique par méthodes de traduction automatique statistique.
Résumé de l’offre – Thématiques : traitement du langage naturel, application à la correction d’OCR et traduction automatique statistique. – Lieu : LIUM (Le Mans), équipe LST (http ://www-lium.univ-lemans.fr/). – Période : disponible dès maintenant pour une durée d’un an renouvelable.
Contexte Ce postdoc s’inscrit dans le projet PACTE ('investissement d’avenir'), porté par l’entreprise Diadeis, et dont sont également partenaires l’équipe Alpage (INRIA et Paris 7), et les entreprises A2ia et Isako. PACTE a pour objectif l’amélioration de la qualité orthographique des textes issus de différentes méthodes de capture textuelle. L’accent est mis sur les sorties d’OCR (reconnaissance optique de caractères) sur des textes imprimés scannés, mais concerne également des données obtenues par reconnaissance d’écriture manuscrite, par saisie manuelle, et par rédaction directe. Les techniques qui seront utilisées sont à la fois statistiques et hybrides, faisant usage d’outils et de ressources de linguistique computationnelle.
Objectifs Vérification et correction des sorties d’OCR par des méthodes de modélisation statistique du langage. Les systèmes OCR utilisés n’exploitent pas ou peu de connaissance sur la langue. L’objectif est d’exploiter la modélisation de la langue afin combler ce manque.
Utilisation de la traduction automatique statistique pour la correction d’erreurs des sorties d’OCR. La correction des sorties d’OCR peut être vue comme une tâche de traduction d’un texte erroné vers un texte correct. Dans le cadre de l’OCR, le paradigme de traduction doit être adapté afin de prendre en compte les spécificités de la tâche.
Le cadre applicatif de ce travail est assez exceptionnel, avec l’exploitation d’une grande quantité de données issue notamment du Bureau Européen des Brevets (EPO - European Patent Office) et du Journal Officiel de l’Union Européenne.
Profil recherché – Compétences en informatique : environnement Linux, C++, scripting, etc. ; – Connaissances en apprentissage automatique, linguistique computationelle. – Une expérience en traduction automatique statistique est un plus. Le postdoc se déroulera au sein de l’équipe LST du LIUM. Le LIUM est connu au niveau international pour ses recherches dans le domaine de la traduction automatique statistique, et possède de nombreuses collaborations avec des universités et entreprises en Europe et aux États-Unis.
Contacts Envoyer une lettre de motivation et un CV montrant vos compétences pour ce poste aux adresses suivantes : Loïc Barrault : loic.barrault@lium.univ-lemans.fr Holger Schwenk : holger.schwenk@lium.univ-lemans.fr

Back

Top

6-41

(2013-03-20) Post-doc job proposal, LIMSI, France

Postdoctoral position at LIMSI, France
Expressive Vocal Signal Analysis and Modelling

Content
During the post-production of a movie, as well as for video games, digital double rendering techniques are used to modify the actor’s performance without the need of playing the scene again. This project is aim toward characterising and reproducing the actor’s vocal personality. By creating a digital double of an actor, it will be possible to create new auditory scenes, as well as dubbing a movie – preserving the actor’s voice, habits and vocal personality.
We thus aim at characterising the expressive space of a given speaker, mainly in the prosodic domain, but also for its vocal quality and articulation peculiarities.
This postdoctoral position thus consist in implementing a set of voice analysis methods in order to observe the variations of vocal source and articulation in a given set of expressive performances. Databases presenting variations of vocal effort and expressive speech are already available and labelled. They will be analysed in order to model the expressive changes in tem of variation of the vocal signal. These models will be confronted to high-level descriptions of the expressive content of the databases.

Required competences
This research requires sound knowledge of signal processing applied to speech analysis. Skills in phonetic and/or linguistic are valuable.
Teamwork skills, and deadline oriented environment is mandatory.
Candidate with signal processing, computational linguistic or acoustic phonetic background will be taken into consideration.

Research team
This position is part of the French FUI ADN T-R project. It takes place at LIMSI-CNRS (www.limsi.fr) in the Audio & Acoustic group, in collaboration with partners from the industry.
LIMSI-CNRS is a research lab located in the University Paris-Sud campus, at Orsay, south of Paris. LIMSI is internationally renown for its works on speech processing. The Audio & Acoustic group focuses more specifically on speech analysis and synthesis, real-time audio processing and expressive sound.
This project will start as soon as possible (from may 2013), and is funded by the FUI agency for a maximum of 16 months. Gross monthly salary is approximately 2500€, on the basis of CNRS post-doctoral contract.

Supervision & contact:
This postdoctoral position will be supervised by Christophe d’Alessandro and Albert Rilliard.
Applications should be send to:

Back

Top

Organisation	Events	Membership	Help
> Board	> Interspeech	> Join - renew	> Sitemap
> Legal documents	> Workshops	> Membership directory	> Contact
> Logos			> FAQ
			> Privacy policy