ISCA - International Speech
Communication Association

ISCApad Archive » 2020 » ISCApad #259 » Jobs

ISCApad #259

Friday, January 10, 2020 by Chris Wellekens

6 Jobs

6-1

(2019-07-11) 3year Early Stage Researcher PhD positions

Applications are invited for a three-year Early Stage Researcher PhD positions in the

speech technology for pathological speech.

Description

The thesis focuses on studying the link between the internal representations of Deep Neural Networks (DNNs) and

the subjective representation of speech intelligibility. We propose to explore the saliency detection capabilities of

DNNs when used in a regression task for predicting speech intelligibility scores as given by human experts. By

saliency, we mean to retrieve which frequency bands are important and used by a DNN to make its predictions.

The final expectation is to identify regions of interest in the speech signal, both in time and frequency, that

characterise the level of speech impairment.

The experiments will be processed on various samples of speech performed by 150 people (100 patients and 50

healthy controls). This database was recorded within the INCA C2SI project, and contains speech from patients

treated for cancer of the oral cavity or pharynx. It contains also various metadata such as the location of the tumor,

the impairment in terms of severity and intelligibility that were appreciated by human experts, self evaluation

questionnaires on the patient’s quality of life… Various tasks were recorded such as a sustained vowel, read

speech, nonsense words, prosodic exercises, picture description, etc. There will be also the possibility to extend

the work to another corpus which is composed of voice of patients suffering from Parkinson disease.

At first, the PhD will have to take benefit from the various analysis and descriptions that were done during the C2SI

project trying to correlate the impact of the tumor and the communication ability. Those results will help attesting

the human representation of the impact of the disease. Then, a DNN representation will be modeled to fit the data,

taking care of the data sparsity. The last part of the work will be to explore the intern representation of the DNN,

trying to explore what part of the signal help to make a decision on the impact of the disease and that will be the

final goal of the thesis, studying the automatic representation that lies in the model the student will propose.

This work is funded by the TAPAS project (https://www.tapas-etn-eu.org) which is a Horizon 2020 Marie

Skłodowska-Curie Actions Initial Training Network European Training Network (MSCA-ITN-ETN) project that aims

to transform the well being of people across Europe with debilitating speech pathologies (e.g., due to stroke,

Parkinson's, etc.). These groups face communication problems that can lead to social exclusion. They are now

being further marginalised by a new wave of speech technology that is increasingly woven into everyday life but

which is not robust to atypical speech.

The supervision of the PhD will take place at IRIT laboratory by the SAMoVA team in Toulouse. SAMoVA does

research in the domain of “analysis, modeling and structuring of audiovisual content”. The application areas are

diverse: speech processing, identification of languages, speaker verification and speech and music indexing. The

researchers expertise covers novel machine learning and audio processing technologies and is now focused on

deep learning methods, leading to several publications in international conferences.

Eligibility Criteria

Early Stage Researchers (ESRs) shall, at the time of recruitment by the host organization, be in the first four

years (full-time equivalent research experience) of their research careers.

- The ESR may be a national of a Member State, of an Associated Country or of any Third Country.

- The ESR must not have resided or carried out her/his main activity (work, studies, etc.) in the country of her/his

host organization for more than 12 months in the 3 years immediately prior to her/his recruitment.

- Holds a Master’s degree or equivalent, which formally entitles to embark on a Doctorate.

- Does not hold a PhD degree.

Duration of recruitement: 36 months

Contact: Julie Mauclair (mauclair@irit.fr)

Back

Top

6-2

(2019-07-17) Chief Technical Officer (CTO) at ELDA

Chief Technical Officer (CTO)

Under the supervision of the CEO, the responsibilities of the Chief Technical Officer (CTO) include planning and supervising technical development of tools, software components or applications for language resource production and management.
He/she will be in charge of managing the current language resources production workflows and co-ordinating ELDA?s participation in R&D projects while being also hands-on whenever required by the language resource production and management team. He/she will liaise with external partners at all phases of the projects (submission to calls for proposals, building and management of project teams) within the framework of international, publicly- or privately-funded projects.

This yields excellent opportunities for creative and motivated candidates wishing to participate actively to the Language Engineering field.

Profile:
?    PhD in Computer Science, Natural Language Processing, or equivalent
?    Experience in Natural Language Processing (speech processing, data mining, machine translation, etc.)
?    Familiarity with open source and free software
?    Knowledge of a statically typed functional programming language (OCaml preferred) is a plus
?    Good level in English, with strong writing and documentation skills in English
?    Dynamic and communicative, flexible to work on different tasks in parallel
?    Ability to work independently and as part of a multidisciplinary team
?    Citizenship (or residency papers) of a European Union country
?    Good level in Python, knowledge of Django would be a plus
?    Proficiency in classic shell scripting in a Linux environment (POSIX tools, Bash, awk)

Salary: Commensurate with qualifications and experience (between 45-55K?).
Other benefits: complementary health insurance and meal vouchers

Applicants should email a cover letter addressing the points listed above together with a curriculum vitae to: job@elda.org

ELDA is acting as the distribution agency of the European Language Resources Association (ELRA). ELRA was established in February 1995, with the support of the European Commission, to promote the development and exploitation of Language Resources (LRs). Language Resources include all data necessary for language engineering, such as monolingual and multilingual lexica, text corpora, speech databases and terminology. The role of this non-profit membership Association is to promote the production of LRs, to collect and to validate them and, foremost, make them available to users. The association also gathers information on market needs and trends.

For further information about ELDA/ELRA, visit: ww.elra.info

Back

Top

6-3

(2019-07-19) Two Post-doctoral positions at Le Mans University , France

2 Post-doctoral positions at Le Mans University on Deep learning approaches speech processing

*Place of work* Le Mans University, Le Mans ? France

*Starting date* From now to June 2020

*Salary* between 2 300 and 2 600 ? /month

*Duration* 12 months and 24 months (can be combined in a 36 months position)

****************************************
1st position
****************************************

* Context *
The LST team from LIUM (Le Mans University) is focusing on autonomous system?s behavior
for the task of speaker diarization and machine translation.
The ALLIES project (European Chist-ERA collaborative project) aims at developing
evaluation protocols, metrics and scenarios for lifelong learning autonomous systems.
The goal is to enable auto-adaptable systems that can also auto-evaluate in order to
sustain their performance across time. Autonomous systems can rely on human domain
experts via active and interactive learning processes to be define within the ALLIES project.

* Missions *
Develop an autonomous system for speaker diarization by integrating lifelong learning,
active and interactive learning components. The research work will be related to some of the following topics:
- unsupervised adaptation
- unsupervised evaluation
- active learning (based on the unsupervised evaluation process, the autonomous
   system is free to require additional knowledge from the human domain expert)
- Interactive learning (a human domain expert provides specific knowledge to
   the autonomous system. This information must be taken into account by the system)
Performance will be analyzed using protocols, metrics and scenarios developed for the ALLIES project.

Participation to the ALLIES benchmarking evaluation for speaker diarization.
During the ALLIES project, LIUM is organizing two international evaluation
campaigns (one for Speaker Diarization jointly organized with Albayzin and the
second one for Machine Translation jointly with WMT)
The benchmarking evaluation will serve to validate approaches developed during the post-doc

* Dissemination*
The research will be published in the major conferences and journals

* Duration * 12 months
* Salary * 2 365,14? (after taxes)

* Start * as soon as possible, latest January 2020

* Supervisers * Anthony Larcher (anthony.larcher@univ-lemans.fr) and Loïc Barrault (loic.barrault@univ-lemans.fr)

Expected competences:
    - Phd in Machine Learning and Deep Learning
    - Experience in speech processing is positive
    - Python fluent
    - familiar with a deep learning toolkit (Pytorch, TensorFlow)

ALLIES website: https://projets-lium.univ-lemans.fr/allies/

****************************************
2nd position
****************************************

* Context *
The LST team from LIUM (Le Mans University) is focusing on evolutive end-to-end
neural networks for speaker recognition. The Extensor project (French ANR funded)
aims at developing novel architectures for end-to-end speaker recognition as well as
explaining the behavior of those networks. The focus of Extensor is threefold:
get rid of the legacy of bayesian system?s architecture and explore wider opportunities offered in deep learning;
explore real end-to-end architectures exploiting the tax signal instead of classical features (such as MFCC of filterbanks);
Develop tools for explainability in speaker recognition.

* Missions *
Develop end-to-end speaker recognition system based on state-of-the-art approaches (x-vectors, sincnet?)
Develop evolutive architectures making use of existing genetic algorithms and study their behavior.
Participate to the three hackathons organized by the Extensor project in order to develop
tools for evolutive neural network architecture and explainability for speaker recognition.
Dissemination: the research will be published in the major conferences and journals

* Duration * 24 months
* Salary * 2 600? (after taxes)

* Start * as soon as possible, latest June 2020

* Location * LIUM, Le Mans University

* Superviser * Anthony Larcher (anthony.larcher@univ-lemans.fr)

Expected competences:
    - Phd in Machine Learning and Deep Learning
    - Experience in speech processing is positive
    - Python fluent
    - familiar with a deep learning toolkit (Pytorch, TensorFlow)

Anthony Larcher Maître de Conférences, HDR / Associate Professor
Directeur de l'Institut Informatique Claude Chappe
co-responsable de la Spécialité Informatique
Responsable de l'option Interface Personnes Systèmes Tél. +33 (0)2 43 83 38 30
Avenue Olivier Messiaen, 72085 - LE MANS Cedex 09 univ-lemans.fr

Back

Top

6-4

(2019-07-20) Three-year Early Stage Researcher PhD positions, IRIT, Toulouse, France