ISCA - International Speech
Communication Association

ISCApad Archive » 2019 » ISCApad #254 » Jobs

ISCApad #254

Saturday, August 10, 2019 by Chris Wellekens

6 Jobs

6-1

(2019-02-01) Lecturer at LIMSI, Orsay, Paris, France

Le département informatique de l?UFR Sciences de l?université
Paris-Sud recrute un.e Maître de Conférences pour renforcer son équipe
pédagogique et continuer de développer la recherche au sein du LIMSI
sur les les thèmes du traitement automatique des langues et de la
parole.

Les recherches de la personne recrutée porteront en priorité sur le
développement de nouvelles méthodes en traitement automatique de la
parole, avec par exemple les thématiques suivantes: la caractérisation
du locuteur dans un contexte multimédia ; l?étude des dimensions
affectives des interactions sociales ; l?étude des systèmes de
traduction automatique et l?apprentissage artificiel ; l?étude des
systèmes de reconnaissance vocale. Le laboratoire est également ouvert
à des candidatures qui mettraient en avant d?autres thématiques
relatives au traitement automatique de la parole, ou plus largement à
l?ensemble du champ du traitement automatique des langues.

La personne recrutée pourra enseigner dans toutes les filières
relevant du département informatique de l'UFR Sciences d?Orsay, au
niveau Licence et Master (classique et en apprentissage). Elle pourra
enseigner dans ses domaines d?intérêt et dans un ou desles domaines de
l'informatique qui auront besoin d'être renforcés. La personne pourra
également dispenser une partie de ses enseignements en anglais, en
particulier dans le cadre de Masters internationaux.

Les candidat.e.s pourront obtenir des informations plus précises à
partir de la page suivante:

https://www.limsi.fr/fr/limsi-emplois/offres-de-postes-chercheurs-et-enseignants-chercheurs

Top

6-2

(2019-02-03) Senior speech scientist (acoustic modeling) at ELSA Corp. Lisbon, Portugal

Senior speech scientist (acoustic modeling) at ELSA Corp. in Lisbon, Portugal or Remote

https://elsanow.io/careers#senior-speech-scientist

Top

6-3

(2019-02-03) PhD and Postdoc positions at University of Genova, Italy

Fully funded PhD and PostDoc positions are available at the Casa Paganini -
InfoMus Research Centre (www.casapaganini.org), DIBRIS-Dept. of Informatics,
Bioengineering, Robotics, and Systems Engineering, Polytechnic School,
University of Genoa, Italy. Each research position will have a specific
focus on the development of computational models, multimodal systems and
interfaces, research experiment and prototypes in one of the following
areas: (i) automated measurement, analysis, and prediction of full-body
non-verbal individual movement qualities and emotions; (ii) automated
measurement, analysis, and prediction of full-body non-verbal social signals
(synchronization, entrainment, leadership).
Accepted candidates will develop a research plan in the framework of the
4-year (2019-2022) Horizon 2020 European Project FET Proactive EnTimeMent
(https://entimement.dibris.unige.it/ ), and may be asked to participate in
joint activities with research partners in EnTimeMent, including possible
short residencies at EnTimeMent partners' sites.

Requirements
Candidates should ideally have the following profile:
+ Master's degree in Computer Science, Computer Engineering or related
disciplines;
+ Excellent technical and programming skills (Python, Java, C/C++);
+ Prior experience in at least one of the following fields: human computer
interaction, affective computing, motion capture and motion analysis,
multimodal interfaces, sound analysis and interactive sonification, computer
vision, machine learning;
+ Ability to work independently, self-motivation, and ability to actively
contribute as a member of a multidisciplinary research team including
experts in computer science and engineering, movement science, cognitive
neuroscience, cognitive and motoric rehabilitation, performing arts;
+ Strong commitment to advancing the state-of-the-art research and
publishing in top research venues;
+ Excellent communication skills in English.

Applying
To apply, please email your application to: antonio.camurri@unige.it and
gualtiero.volpe@unige.it
The application should consist of a single pdf file including:
+ cover letter expressing your interest in the position and your profile
relevance;
+ curriculum vitae showing academic records with tracks related to the
themes of the thesis;
+ list of publications (post-doc applications only);
+ transcript of marks according to M1-M2 profile or last 3 years of
engineering or related school (PhD applications only);
+ contact and recommendation letter of at least two university referents;

As a preliminary step, candidates will be invited for a Skype interview.
Candidates may also be invited to a fully funded short research internship
in our research team during summer 2019. To be finally enrolled, candidates
will need to pass a formal evaluation performed by a selection committee
University of Genova will appoint according to the Italian laws.
The envisioned starting date for the first selected PhD candidates is
November 2019. PostDoc starting date is negotiable.

Conditions of employment
Hired on a fixed-term contract at University of Genoa, working full-time at
the Casa Paganini-InfoMus Research Centre of DIBRIS, University of Genoa,
with possible short internships at a research centre of an EnTimeMent
project partner.
Duration: three years for PhD students; 2-year contract for post-docs
(possible extensions available).

Further Information
For any question or further details please send email to
antonio.camurri@unige.it and gualtiero.volpe@unige.it

The Casa Paganini-InfoMus Research Centre at DIBRIS, Polytechnic School,
University of Genoa, Italy
As art influences science and technology, science and technology can in turn
inspire art. Recognizing this mutually beneficial relationship, researchers
at the Casa Paganini-InfoMus Research Centre work to combine scientific
research in information and communications technology (ICT) with artistic
and humanistic research.
The mission of Casa Paganini - InfoMus consists of carrying out scientific
and technological research on human-centered computing where art and
humanistic culture are a fundamental source of inspiration. The research
team includes computer engineers and experts from the human sciences and the
arts.
Scientific and technological research includes: investigation and
development of computational models and of multimodal systems focusing on
non-verbal, full-body, expressive, emotional, and social behavior
(entrainment, leadership); sound and music computing; interactive
sonification; multimodal interactive systems and serious games for
rehabilitation, entertainment, sport, edutainment, museums and cultural
institutions; multimedia systems and services for the creative industry: ICT
for active music listening, interactive dance, theatre, cultural heritage,
user-centric media and mobile systems.
The Casa Paganini - InfoMus Research Centre coordinates and participates as
partner in many international projects on scientific and technological
research, education, and develops multimedia systems, platforms, and
applications for the creative industry and cultural institutions.
www.casapaganini.org youtube.com/InfoMusLab

The EnTimeMent EU Horizon 2020 FET PROACTIVE project
EnTimeMent aims at a radical change in scientific research and enabling
technologies for human movement qualitative analysis, entrainment and
prediction, based on a novel neuro-cognitive approach of the multiple,
mutually interactive time scales characterizing human behaviour. Our
approach will afford the development of computational models for the
automated detection, measurement, and prediction of movement qualities from
behavioural signals, based on multi-layer parallel processes at non-linearly
stratified temporal dimensions, and will radically transform technology for
human movement analysis. EnTimeMent new innovative scientifically-grounded
and time-adaptive technologies operate at multiple time scales in a
multi-layered approach: motion capture and movement analysis systems will be
endowed with a completely novel functionality, achieving a novel generation
of time-aware multisensory motion perception and prediction systems. The
proposed model and technologies will be iteratively tested and refined, by
designing and performing controlled and ecological experiments, ranging from
action prediction in a controlled laboratory setting, to prediction in
dyadic and small group interaction. EnTimeMent scenarios include health
(healing and support of everyday life of persons with chronic pain and
disability), performing arts (e.g. dance), sports, and entertainment group
activities, with and without living architectures. EnTimeMent will create
and support community-building and exploitation with concrete initiatives,
including a community of users and stakeholders, innovation hubs and SME
incubators, as premises for the consolidation beyond the end of the project
in a broader range of market areas.
http://entimement.dibris.unige.it

Top

6-4

(2019-02-08) Fully funded PhD position at Graz University of Technology, Austria

Graz University of Technolgy (TU Graz) is the organizer of the INTERSPEECH 2019 conference in September 2019 and offers a PhD position in its Signal Processing and Speech Communication Laboratory. Be part of this exciting opportunity and join our team!

The position is for up to four years and involves both research and teaching commitments. Teaching will be focussed on problem classes and lab courses for fundamental subjects such as signal processing. Research will address interdisciplinary topics at the interface between automatic speech recognition and speech science. You will work on top-level publications and your PhD thesis under the joint supervision of Prof. Gernot Kubin and Dr. Barbara Schuppler. Graz University of Technology offers systematic guidance to their doctoral students in specific doctoral schools with structured programs, international cooperation opportunities, and more. All doctoral programs and more than half of our Masters' programs are taught in English. The gross salary (before taxes) for this full-time position is according to scale B1 at Austrian Universities, approximately 40.000,- EUR per year. The expected starting date is September-October 2019.

Mandatory skills of the candidates are a relevant master's degree in electrical or information engineering, computer science, or speech science; excellent programming skills; English language competence (IELTS 7.0 or higher). Expertise in signal processing and machine learning as well as knowledge of the German language are considered additional assets.

Interested candidates should send immediately the following information in PDF format to Prof. Gernot Kubin (g.kubin@ieee.org) and to the dean’s office dekanat.etit@tugraz.at: curriculum vitae, transcript of records of both Bachelor's and Master's degree courses, master's thesis and all publications, proof of English language competence, and contact information for 2 referees. The official application deadline is July 26, 2019. Additional application documents may be required in due course. Female students are particularly encouraged to apply. For more information consult

https://www.spsc.tugraz.at for the Signal Processing and Speech Communication Laboratory,
https://www.interspeech2019.org for the INTERSPEECH 2019 conference, https://www.tugraz.at/tu-graz/arbeiten-an-der-tu-graz/offene-stellen/ausgeschriebene-wissenschaftliche-stellen-ausser-professuren/#c57834 for the official job announcement in German,
https://www.tugraz.at/en/go/welcome-center for information on student life at Graz University of Technology and in the city of Graz, Austria.

Top

6-5

(2019-02-14) PhD student, Radbout University, Nijmegen, The Netherlands

PhD student “Morphology in spoken word recognition models”

Location: Radboud University, Nijmegen

Supervision: Louis ten Bosch, Mirjam Ernestus, and Ingo Plag

Starting date: September / October 2019

Duration: 4 Years (3 years, with possibility of extension of 1 year)

Salary: Around 1600 euros a month

The project is part of the project DMC: Dutch morphologically complex words: The role of morphology in speech production and comprehension of the Spoken Morphology: Phonetics and phonology of complex words DFG research unit FOR 2373. It is funded by the Deutsche Forschungsgemeinschaft.

See http://www.spoken-morphology.hhu.de/en.html

Requirements:

We are looking for candidates with

Master degrees in Linguistics, Artificial Intelligence, or Social Sciences;
Clear interest in speech and language;
Expertise in quantitative research methods, preferably including large scale data analyses or computational modeling;

Effective verbal and written communicative skills in English.

Application:

Please send your motivation letter and extensive resume (including grades for all university courses) to m.ernestus@let.ru.nl.

The deadline is 15 March 2019 or until the position is filled.

Questions:

l.tenbosch@let.ru.nl or m.ernestus@let.ru.nl

Abstract:

The PhD student will study what properties a computational model of auditory word recognition needs to have in order to well simulate human listeners’ processing of morphologically complex words. The computational models that will be considered are DIANA (e.g., ten Bosch et al., 2013, 2014, 2015) and Naïve Discriminative Learning (NDL, e.g., Arnold et al. 2017), since these two models represent very different types of processing and both can receive the speech signal, with morpho-acoustic cues, as their inputs. The human data to be simulated will include the BALDEY database (Ernestus & Cutler, 2015). The PhD student will produce a dissertation consisting of several publishable articles, preceded by a General Introduction and followed by a general discussion.

Top

6-6

(2019-02-15) Faculty position (Associate professor) at Telecom ParisTech, Paris, France

Faculty position (Associate professor) at Telecom ParisTech in

Machine-Learning.

Important Dates

? May 3, 2019: closing date

? June 3, 2019: hearings of preselected candidates

Telecom ParisTech?s [1] machine learning, statistics and signal processing group (a.k.a S²A group) [2], within the laboratoire de traitement et communication de l?information (LTCI) [5], is inviting applications for a permanent (indefinite tenure) faculty position at the *Associate Professor* level (Maitre de Conferences) in *Machine learning*.

Main missions

The recruit will be expected to:

Research activities

? Develop groundbreaking research in the field of theoretical or applied machine learning, targeting applications that are well aligned with the topics of the S²A group [3] and the Images, Data & Signals department [4], which include (and is not restricted to) sequential/reinforcement learning, multitask learning, learning for structured data (e.g. time series analysis, audio signals), natural language processing, social signal processing, predictive maintenance, biomedical or physiological signal analysis, recommendation, finance, health, ?.

? Develop both academic and industrial collaborations on the same topic, including collaborative activities with other Telecom ParisTech research departments and teams, and research contracts with industrial players

? Set up research grants and take part in national and international collaborative research projects

Teaching activities

? Participate in teaching activities at Telecom ParisTech and its partner academic institutions (as part of joint Master programs), especially in machine learning and Data science, including life-long training programs (e.g. the local Data Scientist certificate)

Impact

? Publish high quality research work in leading journals and conferences

? Be an active member of the research community (serving in scientific committees and boards, organizing seminars, workshops, special sessions...)

Candidate profile

As a minimum requirement, the successful candidate will have:

? A PhD degree

? A track record of research and publication in one or more of the following areas: machine learning, applied mathematics, signal processing,

? Experience in teaching

? Good command of English

The ideal candidate will also (optionally) have:

? Experience in temporal data analysis problems (sequence prediction, multivariate time series, probabilistic graphical models, recurrent neural networks...)

NOTE:

The candidate does *not* need to speak French to apply, just to be willing to learn the language (teaching will be mostly given in English)

Other skills expected include:

? Capacity to work in a team and develop good relationships with colleagues and peers

? Good writing and pedagogical skills

More about the position

? Place of work: Paris until 2019, then Saclay (Paris outskirts)

? For more information about being an Associate Professor at Telecom ParisTech, check [6] (in French)

How to apply

Applications are to be sent by e-mail to: recrutement@telecom-paristech.fr

The application should include:

? A complete and detailed curriculum vitae

? A letter of motivation

? A document detailing past activities of the candidate in teaching and research: the two types of activities will be described with the same level of detail and rigor.

? The texts of the main publications

? The names and addresses of two referees

? A short teaching project and a research project (maximum 3 pages)

Contacts :

Stephan Clémençon (Head of the S²A group)

Gaël Richard (Head of the IDS department)

[1] http://www.tsi.telecom-paristech.fr

[2] http://www.tsi.telecom-paristech.fr/ssa/

[3] http://www.tsi.telecom-paristech.fr/aao/en/

[4] http://www.tsi.telecom-paristech.fr/en/
[5] https://www.ltci.telecom-paristech.fr/?lang=en
[6] http://www.telecom-paristech.fr/telecom-paristech/offres-emploi-stages-theses/recrute-enseignants-chercheurs.html

Top

6-7

52019-02-16) Research engineer or post-doc position in Natural Language Processing, LORIA, Nancy, France

Research engineer or post-doc position in Natural Language Processing:
Introduction of semantic information in a speech recognition system

Supervisors: Irina Illina, MdC, Dominique Fohr, CR CNRS

Team: Multispeech, LORIA-INRIA (https://team.inria.fr/multispeech/)

Contact: illina@loria.fr, dominique.fohr@loria.fr

Duration: 12-15 months

Deadline to apply : March 15th, 2019

Required skills: Strong background in mathematics, machine learning (DNN),
statistics, natural language processing and computer program skills (Perl,
Python).

Following profiles are welcome, either:

· Strong background in signal processing
or
· Strong experience with natural language processing

Excellent English writing and speaking skills are required in any case.

Candidates should email a detailed CV with diploma

LORIA is the French acronym for the ?Lorraine Research Laboratory in Computer
Science and its Applications? and is a research unit (UMR 7503), common to
CNRS, the University of Lorraine and INRIA. This unit was officially created in
1997. Loria?s missions mainly deal with fundamental and applied research in
computer sciences.

MULTISPEECH is a joint research team between the Université of Lorraine, Inria,
and CNRS. Its research focuses on speech processing, with particular emphasis
to multisource (source separation, robust speech recognition), multilingual
(computer assisted language learning), and multimodal aspects (audiovisual
synthesis).

Context and objectives

Under noisy conditions, audio acquisition is one of the toughest challenges to
have a successful automatic speech recognition (ASR). Much of the success
relies on the ability to attenuate ambient noise in the signal and to take it
into account in the acoustic model used by the ASR. Our DNN (Deep Neural
Network) denoising system and our approach to exploiting uncertainties have
shown their combined effectiveness against noisy speech.

The ASR stage will be supplemented by a semantic analysis. Predictive
representations using continuous vectors have been shown to capture the
semantic characteristics of words and their context, and to overcome
representations based on counting words. Semantic analysis will be performed by
combining predictive representations using continuous vectors and uncertainty
on denoising. This combination will be done by the rescoring component. All our
models will be based on the powerful technologies of DNN.

The performances of the various modules will be evaluated on artificially noisy
speech signals and on real noisy data. At the end, a demonstrator, integrating
all the modules, will be set up.

Main activities

? study and implementation of a noisy speech enhancement module and a
propagation of uncertainty module;
? design a semantic analysis module;
? design a module taking into account the semantic and uncertainty information.

References

[Nathwani et al., 2018] Nathwani, K., Vincent, E., and Illina, I. DNN
uncertainty propagation using GMM-derived uncertainty features for noise robust
ASR, IEEE Signal Processing Letters, 2018.

[Nathwani et al., 2017] Nathwani, K., Vincent, E., and Illina, I. Consistent DNN
uncertainty training and decoding for robust ASR, in Proc. IEEE Automatic
Speech Recognition and Understanding Workshop, 2017.

[Nugraha et al., 2016] Nugraha, A., Liutkus, A., Vincent E. Multichannel audio
source separation with deep neural networks. IEEE/ACM Transactions on Audio,
Speech, and Language Processing, 2016.

[Sheikh, 2016] Sheikh, I. Exploitation du contexte sémantique pour améliorer la
reconnaissance des noms propres dans les documents audio diachroniques?, These
de doctorat en Informatique, Université de Lorraine, 2016.

[Sheikh et al., 2016] Sheikh, I. Illina, I. Fohr, D. Linares, G. Learning word
importance with the neural bag-of-words model, in Proc. ACL Representation
Learning for NLP (Repl4NLP) Workshop, Aug 2016.

[Mikolov et al., 2013a] Mikolov, T. Chen, K., Corrado, G., and Dean, J.
Efficient estimation of word representations in vector space, CoRR, vol.
abs/1301.3781, 2013.

Top

6-8

(2019-02-16) PhD's grants at SHEFFIELD CENTRE FOR DOCTORAL TRAINING IN SPEECH AND LANGUAGE, UK

SHEFFIELD CENTRE FOR DOCTORAL TRAINING IN SPEECH AND LANGUAGE

The University of Sheffield has won an £8M grant to fund some 60 PhDs over a period of 8
years in Speech and Language Technology. This 'Centre for Doctoral Training' is one of 16
newly announced by UKRI (UK Research and Innovation) to strengthen British AI research.

The Sheffield CDT will cover a wide range of SLT subjects, researching and developing
robust methods for natural language and speech processing in challenging real world
scenarios. Its research will stimulate novel applications of SLTs in sectors such as
health, sport, manufacturing, aerospace, robotics, finance, political science and digital
humanities.

A major feature of the CDT is close collaboration with industry, involving multinationals
such as Google, Amazon, Voicebase, Nuance, NHS Digital, Solvay and TechNation, as well as
substantial UK SME support.

In addition to a Ph.D., students will complete a Postgraduate Diploma (PGDip) in SLT
Leadership. This bespoke programme will provide them with the necessary skills for
academic and industrial leadership in SLT, and will cover software engineering,
entrepreneurship, management, and societal responsibility.

The SLT is headed by Professor Thomas Hain and Professor Rob Gaizauskas.

For more details, visit

https://www.sheffield.ac.uk/dcs/latest-news/speech-and-language-centre-for-doctoral-training-1.831895

https://slt-cdt.group.shef.ac.uk/

Top

6-9

(2019-02-20) Post-doc position at GIPSA-Lab, Grenoble, France

The CRISSP team of GIPSA-lab (Grenoble-France) is opening a 24-month postdoctoral position on 'Text-to-Speech alignment for assessing reading fluency of young children' within the framework of the e-FRAN Fluence project, where we assess and train 700 pupils and 300 collegians.

The detailed description in attachement and here : http://www.gipsa-lab.grenoble-inp.fr/transfert/propositions/2_2019-02-25_PostDocELARGIRCNRSEnglish.pdf

Deadline for application is 31th March.

Top

6-10

(2019-02-20) CDD ingenieur d'etudes chargé d'affaires, Aix-Marseille, France

En collaboration avec les directions des trois plateformes technologiques1 CRVM, CEP et H2C2, et sous la responsabilité du chargé de mission Plateformes technologiques d?Aix-Marseille, le/la chargé.e d?affaires conçoit et met en oeuvre la stratégie de développement des

offres de services des plateformes technologiques afin de favoriser leur croissance.

Date limite de candidature 23 avril 2019

Top

6-11

(2019-02-27) Postdoctoral position at IRISA Rennes France

Postdoctoral researcher

Job description
CONTEXT IRISA (https://www.irisa.fr/) is the largest research laboratory dedicated to computer science in France, hosting more than 800 people and 40 research teams. Its activities spans all the fields of computer science. It is located in Rennes, Lannion, and Vannes. The Expression team (https://www-expression.irisa.fr/) focuses on natural language processing (NLP), be it through texts, speech or gestures. In particular, it has internationally recognized expertise in textto-speech (TTS). The opened position is part of a project aimed at the synthesis of the Breton language. The recruited person will collaborate with other researchers and engineers implied in NLP and TTS.
TASKS Development of NLP modules for Breton and integration in a TTS pipeline, i.e.:

1. Phonetization, grapheme-to-phoneme conversion.

2. Text normalization.

3. POS tagging and chunking ;

4. Adaptation of the team’s TTS engine. The position also includes data management and project monitoring tasks.
REQUIRED QUALIFICATION & SKILLS

• PhD in computer science

• Recent skills in natural language processing

• Recent skills in machine learning

• Top academic and publication records

• Good communication skills

• Team work experience

CONTRACT

• 18 month, full time.

• Campus of Lannion (22).

• Competitive salary, depending on the experience.

• Starting as soon as possible.

APPLICATION & INFORMATION

Please send a cover letter and resume to :

• Gwénolé Lecorvé, gwenole.lecorve@irisa.fr

• Damien Lolive, damien.lolive@irisa.fr

Application deadline: 31th March 2019.

Top

6-12

(2019-03-03) Postdoc at CRIM, Montreal, Canada

CRIM (COMPUTER RESEARCH INSTITUTE OF MONTREAL) IS HIRING!

POSTDOCTORAL RESEARCHER POSITION - SPEAKER RECOGNITION Speech and Text group

CRIM (http://www.crim.ca/en) is an applied research and expertise centre in information technology, dedicated to making organizations more effective and competitive through the development of innovative technology and the transfer of leading edge know-how, while contributing to scientific advancement.

JOB DESCRIPTION
CRIM is looking for a postdoctoral researcher with a background in speaker recognition, and, ideally, in other related fields such as speaker diarization, speech recognition and machine learning. The successful candidate will work on speaker recognition RD activities within the Speech and Text group.

The ideal candidate must be able to work on team research themes and supervise graduate students in an open environment where collaborations with experts in other fields at CRIM is valued.

The position is offered on a one-year basis with the possibility of renewal for up to 3 or 4 years depending on performance and funding.

Responsibilities:

● Perform high quality research on speaker recognition and anti-spoofing

● Assist in supervising graduate students

● Publish in referred journals and conferences

● Write/contribute to grant applications for new research projects

SKILLS AND EXPERIENCE
Required qualifications:

● Doctoral degree (Ph.D.) in a relevant field

● Exceptional academic record and a clear aptitude for research ● Experience in student supervision ● Good publication record

● Excellent verbal and written communication skills in English

Preferred qualifications:

● Familiarity with modern distributed programming environments and with languages such as C++, Python and Perl

● Programming experience with relevant tools such as Theano, TensorFlow, Torch or Kaldi

● Proficiency in written and spoken French

ABOUT THE ENVIRONMENT
Located in Montreal, an historical, vibrant and culturally diverse city with 6 universities, recognized for its safety and low cost of living. Already a favorite of high tech and creative industries, Montreal have recently received large public and private investments and been turned into a leading center in machine learning and artificial intelligence research.

GOOD REASONS FOR JOINING CRIM’S TEAM
● Benefit from various attractive employment conditions (Drug and health insurance plan, Pension plan, Competitive salary, French training programs)

● Reap the benefits of an outstanding work atmosphere, characterized by mutual support and good humour

● Work alongside passionate people in a collaborative setting

● Maintain work/family balance and quality of life

HOW TO APPLY
Apply directly to emploi@crim.ca, or use our online form.

CRIM is an equal opportunity employer and values diversity. We encourage the development of ideas as a team and cultivate an open work environment that respects differences. We encourage all candidates to apply for this position; however, only selected individuals will be contacted. Thank you for your interest in CRIM!

Join CRIM’s team and work with dynamic and passionate people!

Top

6-13

(2019-03-03) Professor (W2) Speech Technology and Hearing Devices at University of Oldenburg, Germany

Professor (W2) Speech Technology and Hearing Devices at University of
Oldenburg, Cluster of Excellence Hearing4all
Oldenburg, Germany (website: http://hearing4all.eu/EN/)

-> link: https://uol.de/stellen/?stelle=66194

Top

6-14

(2019-03-04) PhD positions at the University of Edinburgh, Scotland, UK

Title: PhD positions at the University of Edinburgh

Link: http://www.edinburghnlp.org/cdt/

Text:

FULLY FUNDED FOUR-YEAR PHD STUDENTSHIPS

UKRI CENTRE FOR DOCTORAL TRAINING IN NATURAL LANGUAGE PROCESSING

School of Informatics
School of Philosophy, Psychology and Language Sciences
University of Edinburgh

UK Research and Innovation has recently announced funding for a Centre
in Doctoral Training in Natural Language Processing (CDT in NLP) at
the University of Edinburgh. This CDT offers unique, tailored doctoral
training consisting of both taught courses and a doctoral
dissertation. Both components run concurrently over four years. Each
student will take a set of courses designed to complement their existing
expertise and give them an interdisciplinary perspective on NLP. They
will received full funding for four years, plus generous
funding for travel, equipment, and research costs.

The CDT brings together researchers in NLP, speech, linguistics, cognitive
science, and design informatics from across the University of Edinburgh.
Students will be supervised by a team of over 40 world-class faculty
and will benefit from cutting edge computing and experimental facilities,
including a large GPU cluster and eye-tracking, speech, virtual reality,
and visualization labs. The CDT involves over 20 industrial partners,
including Amazon, Facebook, Huawei, Microsoft, Mozilla, Reuters, Toshiba,
and the BBC. Close links also exist with the Alan Turing Institute and the
Bayes Centre.

The first cohort of CDT students will start in September 2019, and we
are now seeking applications. A wide range of research topics fall
within the remit of the CDT:

Natural language processing and computational linguistics
Speech technology
Dialogue, multimodal interaction, language and vision
Information retrieval and visualisation, computational social science
Computational models of human cognition and behaviour, including
language and speech processing
Human-Computer interaction, design informatics, assistive and
educational technology
Psycholinguistics, language acquisition, language evolution,
language variation and change
Linguistic foundations of language and speech processing
Approximately 8 studentships are available, covering both maintenance
at the research council rate of GBP 15,009 per year and tuition fees.
Studentships are available for UK, EU, and non-EU nationals.

Applicants should have an undergraduate or master?s degree in computer
science, linguistics, cognitive science, AI, or a related discipline. We
particularly encourage applications from women, minorities, and members
of other groups that are underrepresented in technology.

Further details including the application procedure can be found at:

http://www.edinburghnlp.org/cdt/

In order to ensure full consideration for funding, applications
(including all supporting documents) need to be received by 29 March
2019. Please direct inquiries to the PhD admissions team at
cdt-nlp-admissions@inf.ed.ac.uk.

The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.

Top

6-15

(2019-03-07) Two PhD positions in the area of Information Extraction, Data Mining and Machine Learning at Saarland University

Two PhD positions in the area of Information Extraction, Data Mining and Machine Learning at Saarland University

We anticipate the availability of funds for two PhD positions in the area of information extraction, data mining and machine learning.

The project aims at designing a framework for extracting evidence and actionable intelligence from large amount of noisy multilingual multimodal data based on advanced speech and language technologies (SLTs), visual analysis (VA) and network analysis (NA). The overall project goal is to achieve a significant improvement in identification of events, entities and relations, and to design a new generation of probabilistic and neural networks based tools interfacing SLT, VA and NA technologies. The research will be carried out together with a European consortium of high-profile research institutes and companies.

The successful candidate should have a degree in computer science, computational linguistics, mathematics, physics or a discipline with a related background. Excellent programming skills in modern object-oriented languages are required, as well as strong analytical and problem-solving skills. Relevant expertise in the area of the project is desired. Very good oral and written communication skills in English are required.
This work will be conducted at the Spoken Language Systems group (http://www.lsv.uni-saarland.de/) at Saarland University.
Saarland University (http://www.uni-saarland.de/en/) is a European leader in Computer Science research and teaching, and is particularly well-known for its research in Computational Linguistics and Natural Language Processing. In addition, the Max Planck Institute for Computer Science, the Max Planck Institute for Software Systems and the German Research Center for Artificial Intelligence (DFKI) are located on campus. Students and researchers come from many countries and the research language is English.

The department of Language Science and Technology at Saarland University is one of the leading departments in the speech and language area in Europe. The flagship project at the moment is the CRC on Information Density and Linguistic Encoding. It also runs a significant number of European and nationally funded projects.
Both positions are 3-years positions with a salary German TV-L E13 scale (75%). The starting salary is about 34,000 euros per year and increases with experience.
Each application should include:

* Curriculum Vitae including a list of publications (if applicable)
* Transcript of records
* Short statement of interest (not more than half a page)
* Names of two references
* Any other supporting information or documents

Applications (documents in PDF format in a single file) should be sent to: Dietrich.Klakow@lsv.uni-saarland.de
Priority will be given to applications received by Monday April 15th 2019.
Further inquiries regarding the project should be directed to: Michael A. Hedderich or Olga Petukhova

Top

6-16

(2019-03-23) Chargé de recherches à l'IRCAM, Paris, France

L'équipe Analyse-Synthèse de l'IRCAM recherche un/une chargé(e) de recherche pour le
développement des technologies relatives à l?analyse automatique d?enregistrements
musicaux (progression d?accords, identification d?instruments, qualité audio,
auto-tagging).

EXPÉRIENCE ET COMPÉTENCES REQUISES :

- Titulaire d?un doctorat, le/la candidat/e a une très bonne connaissance du traitement
du signal (analyse spectrale, extraction de descripteurs audio, estimation de paramètres)
ainsi qu?en algorithmes d?apprentissage automatique (SVM, ConvNet) et en calcul distribué
;
- Il/Elle maîtrise la programmation Matlab, Python et C/C++ ;
- Il/Elle a une bonne connaissance de l'environnement UNIX (GNU-Linux ou MacOSX) ;
- Haute productivité, travail méthodique, excellent style de programmation, bonne
communication, rigueur.

Toutes autres information sont disponibles à cette adresse :
https://www.ircam.fr/job-offer/chargee-de-recherche-3/

Prière d'envoyer une lettre de motivation avec la référence 201903UMGRES et un CV
détaillant le niveau d'expérience/expertise dans les domaines mentionnés ci-dessus (ainsi
que toute autre information pertinente) à mignot at ircam dot fr avec copie à vinet at
ircam dot fr et roebel at ircam dot fr.

Top

6-17

(2019-03-30) Lectureship @ Sheffield, UK

Lectureship @ Sheffield

We are recruiting a Lecturer/Senior Lecturer in Speech and Hearing Technologies to join the Speech and Hearing Group at the University of Sheffield. The group currently has interests that span speech recognition, speech enhancement and source separation, diarisation, speaker and language identification, language learning, assistive robotics and media application of speech technology, spoken dialogue systems and downstream integration such as machine translation, summarisation, and speech analytics. SPandH is host to the Voicebase Centre for Speech and Language Technology and the UKRI Centre for Doctoral Training in Speech and Language Technology and Their Applications.

For further details visit

https://www.jobs.ac.uk/job/BRD606/lecturer-senior-lecturer-in-speech-and-hearing-technologies

-- ************************ Professor Phil Green SPandH Dept of Computer Science University of Sheffield ************************

Top

6-18

(2019-04-01) Project manager in NLP

Job Description

En tant que Project Manager, vous participerez au développement et à l’amélioration du traitement du langage naturel. Cela inclut :

Selon le type de projet, vous serez amené(e) à former, encadrer, et superviser une équipe afin de réaliser diverses tâches de traitement automatique des langues (TAL)
Le développement, la maintenance et l’amélioration de la qualité des systèmes de traitement du langage naturel
Entraîner et/ou évaluer des systèmes de dialogue ou de traitement du langage naturel, identifier les erreurs/régressions et proposer des solutions
La consultation linguistique sur le traitement automatique appliqué à une langue et/ou sur l’analyse/modélisation du dialogue
Garantir la qualité de l’output linguistique pour les utilisateurs dans les dialogues homme-machine

Compétences/Expériences :

Niveau natif en français canadien et maîtrise courante de l’anglais.
Diplôme de licence en linguistique, linguistique informatique et/ou dans des disciplines associées. (niveau bac + 4)
Capacité à comprendre rapidement des concepts techniques et à apprendre à manier les outils internes à une entreprise.
Réel intérêt pour les (nouvelles) technologies.
Réel intérêt pour le traitement automatique des langues, la linguistique théorique et descriptive ainsi que toutes les formes de ressources linguistiques.
Attention du détails, compétences d’organisation et de communication.
Expérience avec la gestion de projet.
Capacité à travailler efficacement de manière indépendante et avec flexibilité dans un environnement collaboratif évoluant constamment de façon rapide.

Intitulé du poste :

Project manager

Champs linguistiques :

Anglais, français, canadien (obligatoire)

Lieu :

Paris, France

Salaire : selon expérience

CV + lettre de motivation en Anglais : celine.couillaud@adeccooutsourcing.fr

Top

6-19

(2019-04-02) Post-doc at IRISA, Rennes, France

The Expression team of the IRISA lab (France) is opening a 18-month postdoctoral position on speech synthesis and ?natural language processing for the Breton language, starting as soon as possible.

Detailed missions:

Development of NLP and TTS modules for Breton
Phonetization, grapheme-to-phoneme conversion
Text normalization
POS tagging and chunking
Adaptation of the team?s TTS engine.

This includes the use of machine learning techniques.

Profile / skills :

PhD in natural language processing, speech processing or machine learning
Good academic and publication records
Good communication skills.

The detailed description in attachement and here : http://www.irisa.fr/fr/page/text-speech-and-natural-language-processing-breton-language

Please contact : gwenole.lecorve@irisa.fr and damien.lolive@irisa.fr

Top

6-20

(2019-04-11) Senior Speech Processing Engineer (M/F) , Parrot Faurecia Automotive Paris, France

Senior Speech Processing Engineer (M/F)

CONTEXT

Within the software department of Parrot Faurecia Automotive in Paris, you will be in charge of developing the speech recognition architecture on our new platform, managing the expertise topics related to Virtual Personal Assistants and managing specific developments for demonstrations and prototypes.

MISSIONS

Reporting to the manager of the company's transversal activities (voice recognition, navigation, translations, App Market), your missions will be as follows:

Define, specify and implement the speech recognition architecture taking into account the limits and interests of both modes (embedded/deported)
Prototype and model the different dialogue platforms
Write the technical specifications of these HMIs
Study, analyze and respond to calls for tenders related to the subject
Participate in customer and supplier workshops
Develop and implement the speech recognition and Natural Language Understanding (NLU) part for modeling and prototyping
Identify and document speech recognition technological innovations to extract added value for the company
Study, analyze and characterize speech and NLU recognition solutions from suppliers and competitors
Collaborate within a dynamic and motivated team
Ensure that speech recognition works properly on the new platform
Create dialogue platforms that combine flexibility of use, robustness and performance
Provide expertise and support on the theme 'Speech Processing': advice, methods, tools

PROFILE

You are a graduate of an engineering school or thesis, ideally in computer science with a specialization in speech processing.

You have at least 5 years of experience in the field of speech recognition.

A good command of C++ and Python languages and a solid knowledge of speech processing are essential.

Ideally, you are familiar with:

Understanding natural language
Signal processing
Nuance Technologies
Google Assistant
Alexa
Java development under Android

Occasional travel is expected.

Fluent French is required.

Your level of English allows you to exchange in writing and orally, you master the technical vocabulary in particular.

Skills in Chinese would be a plus.

To apply you can send your resume directly to this email : mathilde.tarente@faurecia.com or go to our website and apply by clicking on the link under the corresponding job offer : https://parrot-faurecia-automotive.talentview.io/jobs/7lz0hr?source=site_entreprise

We will get back to you shortly !!

Annonce pour le poste de Voice Assistant Integration Engineer (M/F) :

Would you like to be part of a French company known for its innovation and agility?

Come and join Parrot Faurecia Automotive, a leading automotive equipment manufacturer with a start-up mentality, working to develop the cockpit of the future!

Located in central Paris, our teams engineer innovative technologies by developing man-machine interfaces based on Android Auto to provide an intuitive connected experience to end users. Our embedded electronic solutions are equipped with powerful multi-core graphics processors to provide quick response times for multiple simultaneous instructions.

Join us in taking up one of the greatest challenges of the automotive industry!

We are looking for a Voice Assistant Integration Engineer (M/F)

CONTEXT

Within the software department of Parrot Faurecia Automotive in Paris, you will be in charge of managing the expertise subjects related to Virtual Personal Assistants and managing specific developments for demonstrations and prototypes.

MISSIONS

Reporting to the manager of the company's transversal activities (voice recognition, navigation, translations, App Market), your missions will be as follows:

Prototype and model the different dialogue platforms
Write the technical specifications of these HMIs
Study, analyze and respond to calls for tenders related to the subject
Participate in customer and supplier workshops
Develop and implement the speech recognition and Natural Language Understanding (NLU) part for modeling and prototyping
Identify and document speech recognition technological innovations to extract added value for the company
Study, analyze and characterize speech and NLU recognition solutions from suppliers and competitors
Collaborate within a dynamic and motivated team

PROFILE

You are a graduate of an engineering school or thesis, ideally in computer science with a specialization in speech processing.

You have at least 2 years of experience in the field of speech recognition.

You are proficient in C++ language, Java development under Android and you have a solid knowledge of Google and Alexa assistants.

Ideally, you are familiar with:

Understanding natural language
Speech processing
Signal processing
Nuance Technologies
The Python language

Occasional travel is expected.

Fluent French is required.

Your level of English allows you to exchange in writing and orally, you master the technical vocabulary in particular.

Skills in Chinese would be a plus.

Top

6-21

(2019-04-11) Project Manager Intellectual Property and Language Data , ELDA, Paris, France

The European Language resources Distribution Agency (ELDA), a company specialised in Human Language Technologies within an international context is currently seeking to fill an immediate vacancy for a permanent Project Manager Intellectual Property and Language Data position.

Under the CEO?s supervision, the Project Manager Intellectual Property and Language Data will handle legal issues related to compilation, use and distribution of language datasets on a European and international scale. This yields excellent opportunities for young, creative, and motivated candidates wishing to participate actively to the Language Engineering field.

His main tasks will consist of:

drafting and negotiating distribution contracts for language datasets to be added to an online catalogue;
analysing of the legal status of language datasets;
implementing GDPR requirements in the processing of language data;
supervising data collection, particularly in the context of public Open Data;
implementing evaluation procedures for IPR clearance of digital data.

A successful candidate:

holds a Master?s degree (or equivalent) in IT Law, with good understanding of intellectual property and data protection;
holds a Bachelor?s degree (or justifies equivalent experience) in Information Science, Knowledge Management or a similar domain;
speaks fluent English, with advanced writing and analytical skills;
is familiar with public licensing schemes (CC, GPL, etc.);
justifies experience in project management and/or participation in European or international projects;
is dynamic, communicative, flexible and willing to work on various tasks;
is capable of working independently as well as in a team;
is an EU citizen, or has a residence permit enabling him to work in France.

All applications will be carefully examined until the position is filled. The position is based in Paris.

Gross annual salary: 30.000-36.000 EUR depending on experience.

Applicants should email a cover letter addressing the points listed above together with a curriculum vitae to: job@elda.org.

ELDA is a human-sized company (15 people) acting as the distribution agency of the European Language Resources Association (ELRA). ELRA was established in February 1995, with the support of the European Commission, to promote the development and exploitation of Language Resources (LRs). Language Resources include all data necessary for language engineering, such as monolingual and multilingual lexica, text corpora, speech databases and terminology. The role of this non-profit membership Association is to promote the production of LRs, to collect and to validate them and, foremost, make them available to users. The association also gathers information on market needs and trends.

For further information about ELDA/ELRA, visit: http://www.elra.info

Top

6-22

(2019-04-13) Post-doc/PhD position Pattern mining for Neural Networks debugging: application to speech recognition, LORIA , Nancy, France

Post-doc/PhD position Pattern mining for Neural Networks debugging: application to speech recognition

Advisors: Elisa Fromont & Alexandre Termier, IRISA/INRIA RBA ? Lacodam team (Rennes)
Irina Illina & Emmanuel Vincent, LORIA/INRIA ? Multispeech team (Nancy)
firstname.lastname@inria.fr

Location: INRIA RBA, team Lacodam (Rennes)

Keywords: discriminative pattern mining, neural networks analysis, explainability of black
box models, speech recognition.

Context:

Understanding the inner working of deep neural networks (DNN) has attracted a lot of attention in the past years [1, 2] and most problems were detected and analyzed using visualization techniques [3, 4]. Those techniques help to understand what an individual neuron or a layer of neurons are computing. We would like to go beyond this by focusing on groups of neurons which are commonly highly activated when a network is making wrong predictions on a set of examples. In the same line as [1], where the authors theoretically link how a training example affects the predictions for a test example using the so called ?influence functions?, we would like to design a tool to ?debug? neural networks by identifying, using symbolic data mining methods, (connected) parts of the neural network architecture associated with erroneous or uncertain outputs.

In the context of speech recognition, this is especially important. A speech recognition system contains two main parts: an acoustic model and a language model. Nowadays models are trained with deep neural networks-based algorithms (DNN) and use very large learning corpora to train an important number of DNN hyperparameters. There are many works to automatically tune these hyperparameters. However, this induces a huge computational cost, and does not empower the human designers. It would be much more efficient to provide human designers with understandable clues about the reasons for the bad performance of the system, in order to benefit from their creativity to quickly reach more promising regions of the hyperparameter search space.

Description of the position:

This position is funded in the context of the HyAIAI ?Hybrid Approaches for Interpretable AI? INRIA project lab (https://www.inria.fr/en/research/researchteams/inria-project-labs). With this position, we would like to go beyond the current common visualization techniques that help to understand what an individual neuron or a layer of neurons is computing, by focusing on groups of neurons that are commonly highly activated when a network is making wrong predictions on a set of examples. Tools such as activation maximization [8] can be used to identify such neurons. We propose to use discriminative pattern mining, and, to begin with, the DiffNorm algorithm [6] in conjunction with the LCM one [7] to identify the discriminative activation patterns among the identified neurons.

The data will be provided by the MULTISPEECH team and will consist of two deep architectures as representatives of acoustic and language models [9, 10]. Furthermore, the training data will be provided, where the model parameters ultimately derive from. We will also extend our results by performing experiments with supervised and unsupervised learning to compare the features learned by these networks and to perform qualitative comparisons of the solutions learned by various deep architectures. Identifying ?faulty? groups of neurons could lead to the decomposition of the DL network into ?blocks? encompassing several layers. ?Faulty? blocks may be the first to be modified in the search for a better design.

The recruited person will benefit from the expertise of the LACODAM team in pattern mining and deep learning (https://team.inria.fr/lacodam/) and of the expertise of the MULTISPEECH team (https://team.inria.fr/multispeech/) in speech analysis, language processing and deep learning. We would ideally like to recruit a 1 year (with possibly one additional year) post-doc with the following preferred skills:
? Some knowledge (interest) about speech recognition
? Knowledgeable in pattern mining (discriminative pattern mining is a plus)
? Knowledgeable in machine learning in general and deep learning particular
? Good programming skills in Python (for Keras and/or Tensor Flow)
? Very good English (understanding and writing)

However, good PhD applications will also be considered and, in this case, the position will last 3 years. The position will be funded by INRIA (https://www.inria.fr/en/). See the INRIA web site for the post-doc and PhD wages.

The candidates should send a CV, 2 names of referees and a cover letter to the four researchers (firstname.lastname@inria.fr) mentioned above. Please indicate if you are applying for the post-doc or the PhD position. The selected candidates will be interviewed in June for an expected start in
September 2019.

Bibliography:

[1] Pang Wei Koh, Percy Liang: Understanding Black-box Predictions via Influence Functions. ICML 2017: pp 1885-1894 (best paper).

[2] Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, Oriol Vinyals: Understanding deep learning requires rethinking generalization. ICLR 2017.

[3] Anh Mai Nguyen, Jason Yosinski, Jeff Clune: Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. CVPR 2015: pp 427-436.

[4] Christian Szegedy, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, Rob Fergus: Intriguing properties of neural networks. ICLR 2014.

[5] Bin Liang, Hongcheng Li, Miaoqiang Su, Pan Bian, Xirong Li, Wenchang Shi: Deep Text Classification Can be Fooled. IJCAI 2018: pp 4208-4215.

[6] Kailash Budhathoki and Jilles Vreeken. The difference and the norm?characterising similarities and differences between databases. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pages 206?223. Springer, 2015.

[7] Takeaki Uno, Tatsuya Asai, Yuzo Uchida, and Hiroki Arimura. Lcm: An efficient algorithm for enumerating frequent closed item sets. In Fimi, volume 90. Citeseer, 2003.

[8] Dumitru Erhan, Yoshua Bengio, Aaron Courville, and Pascal Vincent. Visualizing higher-layer features of a deep network. University of Montreal, 1341(3):1, 2009.

[9] G. Saon, H.-K. J. Kuo, S. Rennie, M. Picheny: The IBM 2015 English conversational telephone speech recognition system?, Proc. Interspeech, pp. 3140-3144, 2015.

[10] W. Xiong, L. Wu, F. Alleva, J. Droppo, X. Huang, A. Stolcke : The Microsoft 2017 Conversational Speech Recognition System, IEEE ICASSP, 2018.

Top

6-23

(2019-04-13) PhD Thesis position or research engineer or post-doc position in Natural Language Processing: Introduction of semantic information in a speech recognition system, LORIA, Nancy, France

PhD Thesis position or research engineer or post-doc position in Natural Language Processing: Introduction of semantic information in a speech recognition system

Supervisors: Irina Illina, MdC, Dominique Fohr, CR CNRS

Team: Multispeech, LORIA-INRIA

Contact: illina@loria.fr, dominique.fohr@loria.fr

Duration of post-doc or research engineer: 12-18 months

Duration of PhD Thesis : 3 years

Deadline to apply : May 15th, 2019

Required skills: background in statistics, natural language processing and computer program skills (Perl, Python). Candidates should email a detailed CV with diploma

Under noisy conditions, audio acquisition is one of the toughest challenges to have a successful automatic speech recognition (ASR). Much of the success relies on the ability to attenuate ambient noise in the signal and to take it into account in the acoustic model used by the ASR. Our DNN (Deep Neural Network) denoising system and our approach to exploiting uncertainties have shown their combined effectiveness against noisy speech.

The ASR stage will be supplemented by a semantic analysis. Predictive representations using continuous vectors have been shown to capture the semantic characteristics of words and their context, and to overcome representations based on counting words. Semantic analysis will be performed by combining predictive representations using continuous vectors and uncertainty on denoising. This combination will be done by the rescoring component. All our models will be based on the powerful technologies of DNN.

Main activities

study and implementation of a noisy speech enhancement module and a propagation of uncertainty module;
design a semantic analysis module;
design a module taking into account the semantic and uncertainty information.

Skills

Strong background in mathematics, machine learning (DNN), statistics

Following profiles are welcome, either:

Strong background in signal processing

Strong experience with natural language processing

Excellent English writing and speaking skills are required in any case.

References

[Nathwani et al., 2018] Nathwani, K., Vincent, E., and Illina, I. DNN uncertainty propagation using GMM-derived uncertainty features for noise robust ASR, IEEE Signal Processing Letters, 2018.

[Nathwani et al., 2017] Nathwani, K., Vincent, E., and Illina, I. Consistent DNN uncertainty training and decoding for robust ASR, in Proc. IEEE Automatic Speech Recognition and Understanding Workshop, 2017.

[Nugraha et al., 2016] Nugraha, A., Liutkus, A., Vincent E. Multichannel audio source separation with deep neural networks. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2016.

[Sheikh, 2016] Sheikh, I. Exploitation du contexte sémantique pour améliorer la reconnaissance des noms propres dans les documents audio diachroniques?, These de doctorat en Informatique, Université de Lorraine, 2016.

[Peters et al., 2017] Matthew Peters, Waleed Ammar, Chandra Bhagavatula, and Russell Power. 2017. ?Semi-supervised sequence tagging with bidirectional language models.? In ACL.

[Peters et al., 2018] Matthew Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, and Luke Zettlemoyer. 2018. ?Deep contextualized word representations?. In NAACL.

Top

6-24

(2019-04-15) PhD grant, Sigmedia, Trinity College, Dublin

PhD Title: Remote and Automatic Monitoring of Bird Populations

Studentship: Full Scholarship, including fees (EU/Non EU) plus annual stipend of €18,000.

Start Date: Sept 2019

PhD Supervisor: Dr. Naomi Harte, Sigmedia Group, Electronic & Electrical Engineering, Trinity College Dublin, Ireland

Background:

The analysis of birdsong has increased in the speech processing community in the past 5 years. Much of the reported research has concentrated on the identification of bird species from their songs or calls. Birdlife International has identified over 12,000 sites globally that are home to bird species of conservation concern and other forms of biodiversity. Out of these, 422 are in danger due to a number of threats including human encroachment and climate change. One of the main challenges in properly managing these sites is adequately monitoring them to determine their state, threats to the ecosystem and responses to these threats. Remote monitoring is the best potential option to achieve the level of coverage required.

The objective of this PhD project is to define the next-generation approaches to the use of remote monitoring for populations of birds of conservation concern. This PhD programme will develop acoustic techniques for the monitoring of bird species of conservation concern by leveraging recent developments in speech and language processing technologies. The PhD will develop appropriate approaches to acoustic data collection in the wild to ensure that acoustic surveys yield accurate bird population data and investigate audio signal analysis steps necessary to extract useful information from these long recordings. In particular the student will focus on signal enhancement to mitigate noise, and the idea of diarisation, i.e. the concept of 'who sang when'. This ambitious approach will take concepts from speaker diarisation in the speech processing domain and attempt to make sense of recordings overall. Birdsong presents significant challenges beyond speech, with more rapid pitch fluctuations coupled with noisier recordings in the wild. Thus the research is very far from a re-application of knowledge from one domain to another. Also, rather than trying to identify specific species in a recording from a closed set of possible birds, this approach will consider an unconstrained set to add to the technical challenges and make the results even more impactful. The desire is to exploit online archives of birdsong recordings from experts such as those available on xeno-canto.org and The Macaulay Library at Cornell. Based on the known geographical location of a recording, spontaneous models of bird vocalisations from populations in that area could be constructed using machine learning performed on available archived recordings. Techniques developed originally for speaker

identification will be further developed for this application. This work will also leverage deep learning to quickly build accurate models from these large datasets.

Envisaged Outputs of the Research:

• Signal processing algorithms to address noise issues specific to remote recordings in bird habitats.
• Exploitation of advanced machine learning approaches, including deep learning, to identify portions of recordings that contain bird activity.
• Disruptive approaches to automatic bird species identification to leverage opensource repositories to identify birds present in the recordings.

Requirements:

The ideal candidate for this position will:

• Have a primary degree (first class honours) in Electrical Engineering, Computer Engineering or a closely related discipline.
• Possess strong written and oral communication skills in English.
• Have a strong background and interest in digital signal processing (DSP)
• Have strong coding skills
• Be mathematically minded, and be curious about nature.

Interested candidates should send an email to Prof. Naomi Harte at nharte@tcd.ie. The email MUST include the following:

• Candidate CV (max 2 pages)
• A short statement of motivation (half page)
• Scanned academic transcripts
• Proof of English language competency (where applicable, see 1)
• Name and contact details for TWO academic referees

1 https://www.tcd.ie/study/international/how-to-apply/entry-requirements.php#language-requirements

Incomplete applications may not be considered.

Top

6-25

(2019-04-20) Two PhD Positions in Computational Linguistics or Phonetics or Speech Science, Saarland University, Germany

Two PhD Positions in Computational Linguistics or Phonetics or Speech Science

We are pleased to announce the availability of two PhD positions in the Language Science
and Technology department at Saarland University in a project funded by the German
Research Foundation (DFG). The three-year project is entitled 'Pause-internal phonetic
particles' and is directed by Jürgen Trouvain and Bernd Möbius. Its focus is on the
production and perception of vocalisations such as breath noises and tongue clicks
typically found in speech pauses. Research in the project will be based on corpus
analyses and production and perception experiments and develop pause models for speech
synthesis.

The successful candidates should have a Master's degree in Computational Linguistics or
Phonetics or Speech Science, or a related discipline. A good command of English is
mandatory. Working knowledge of German is desirable but not a prerequisite. Candidates
must have completed their Master studies by the time of the appointment. We are happy to
consider applicants who have not yet finished their MA/MSc by the time of application but
will have submitted their thesis by the starting date. Both contracts are funded for
three years at a 65% salary on the German TV-L 13 scale.

The doctoral researchers will join a vibrant community of speech and language researchers
at Saarland University whose expertise spans areas such as computational linguistics,
psycho-linguistics, language and speech technology, speech science, theoretical and
corpus linguistics, computer science, and psychology. Saarland University offers a lively
academic environment for phonetics research. The department of Language Science and
Technology is one of the leading departments in speech and language in Europe, with
approximately 50 postdoctoral researchers and PhD students. The flagship project at the
moment is the Collaborative Research Centre on Information Density and Linguistic
Encoding. It also runs a significant number of European and nationally funded projects.

Applicants with a degree in Phonetics, Computational Linguistics, Spoken Language
Processing, Speech Technology, or related fields, are encouraged to submit a full CV
(including transcripts and copies of certificates, as well as two references) and a
sample of written academic work, accompanied by a cover letter, to
trouvain[at]coli.uni-saarland.de and moebius[at]coli.uni-saarland.de. Review of
applications will begin on June 3, 2019, and will continue until the suitable candidates
are found.

Dr. Jürgen Trouvain and Prof. Bernd Möbius

--
Jürgen Trouvain
Saarland University
Language Science and Technology
Campus C7.2
D-66123 Saarbrücken
Tel.: +49 - (0)681 - 302 - 46 94

Delete \| Reply \| Reply to All \| Forward \| Redirect \| View Thread \| Blacklist \| Whitelist \| Message Source \| Save as \| Print
	Move \| Copy

Top

6-26

(2019-04-21) Technical Engineer/Scientist (Project Manager) position, ELDA, Paris

The European Language resources Distribution Agency (ELDA), a company specialised in Human Language Technologies within an international context is currently seeking to fill an immediate vacancy for a permanent Technical Engineer/Scientist (Project Manager) position, specialized in Speech and Multimodal technologies.

Technical Engineer / Scientist (Project Manager) in Speech and Multimodal Technologies

Under the supervision of the CEO, the responsibilities of the Technical Engineer/Scientist include designing/specifying language resources, setting up production frameworks and platforms, carrying out quality control and assessment. He/she will be in charge of renovating the current language resources production workflows. This yields excellent opportunities for young, creative, and motivated candidates wishing to participate actively to the Language Engineering field. He/she will be in charge of conducting the activities related to language resources and Speech and Multimodal technologies. The task will mostly consist in managing language resources production projects and co-ordinating ELDA?s participation in R&D projects while being also hands-on whenever required by the development team.

Profile :

PhD in computer science, speech, audiovisual/multimodal technologies
Experience and/or good knowledge in speech data collection, expertise in phonetics, transcription tools
Experience in speech recognition, synthesis, speaker ID and the well-used packages (e.g. KALDI) and the tools to produce, collect and assess quality of resources and datasets
Experience and/or good knowledge of the Language Technology area
Experience with technology transfer projects, industrial projects, collaborative projects within the European Commission or other international frameworks
Good knowledge of Linux and open source software
Proficiency in Python
Hands-on experience in Django is a plus
Ability to work independently and as part of a team, in particular the ability to supervise members of a multidisciplinary team
Dynamic and communicative, flexible to combine and work on different tasks
Proficiency in French and English
Citizenship of (or residency papers) a European Union country

All applications will be carefully examined until the position is filled. The position is based in Paris.

Salary: Commensurate with qualifications and experience.

Applicants should email a cover letter addressing the points listed above together with a curriculum vitae to: job@elda.org.

ELDA is a human-sized company (15 people) acting as the distribution agency of the European Language Resources Association (ELRA). ELRA was established in February 1995, with the support of the European Commission, to promote the development and exploitation of Language Resources (LRs). Language Resources include all data necessary for language engineering, such as monolingual and multilingual lexica, text corpora, speech databases and terminology. The role of this non-profit membership Association is to promote the production of LRs, to collect and to validate them and, foremost, make them available to users. The association also gathers information on market needs and trends.

For further information about ELDA/ELRA, visit: http://www.elra.info

Top

6-27

(2019-04-30) Ingénieur Développeur Domotique/Traitement multimédia temps-réel, LIG, Grenoble, France

Dans le cadre du projet national ANR VocADom, le Laboratoire d?Informatique de Grenoble
(LIG) recrute un ingénieur Développeur Domotique/Traitement multimédia temps-réel.
L?objectif général du projet de recherche industrielle VocADom vise à définir, en lien
avec les utilisateurs finaux, les fonctionnalités d?un système domotique à commande
vocale s?adaptant à l?utilisateur et utilisable au domicile dans des conditions réelles
(bruit, présence de plusieurs personnes). Plus détails peuvent être trouvé sur le site du
projet VocADom (https://vocadom.imag.fr).

*Mission :* La mission consiste à adapter le middleware OpenHAB gérant les capteurs et
les actionneurs de l?appartement intelligent du LIG et d?un mini-kit domotique portable
aux spécifications du projet et à réaliser l?interfaçage avec les outils de traitements
automatiques développés par les partenaires du projet. Par ailleurs, l?ingénieur sera
chargé de la mise en place technique des expérimentations et de leurs scénarios, incluant
la production et la synchronisation des données multimédia (vidéo, audio, traces
domotiques). Enfin, il sera soutien de l?intégration des algorithmes temps-réel des
équipes de recherche dans l?architecture temps-réel développée par l?entreprise THEORIS,
partenaire du projet. Le travail proposé comprendra les tâches suivantes :
* prise en main et adaptation de l?infrastructure domotique OpenHab de l?appartement
intelligent du LIG ;
* conception du mini-réseau domotique portable (sur la base d?une architecture existante)
;
* soutien aux expérimentations du projet ;
* soutien à l?intégration des algorithmes temps-réels des équipes de recherche ;
* documentation.

*Profil et compétences souhaitées :*

* niveau ingénieur ou M2 en informatique,
* compétences opérationnelles en génie logiciel (gestion de version, tests, qualité de
code),
* connaissance de Java, OSGi, C, C++, Linux,
* autonomie et force de proposition, capacité de gestion de projet,
* une première expérience d?utilisation des middlewares domotiques (OpenHAB, UPnP,
KNX...) serait un plus.

*Salaire :* 1650 à 1950e net/mois selon l?expérience

*Environnement de travail :* Le poste sera rattaché au Laboratoire d?Informatique de
Grenoble, UMR CNRS, au sein de l?équipe GETALP. L?équipe GETALP (http://getalp.imag.fr/)
regroupe plus de 40 chercheurs, ingénieurs et étudiants dans le domaine du traitement
automatisé des langues et de la parole multilingue. Le candidat sera également amené à
collaborer étroitement avec l?équipe IIHM du LIG et avec les collaborateurs de la Maison
de l?Innovation et de la Création (Maci).

*Candidature* Envoyer un CV, une lettre de motivation accompagnée éventuellement de 1 à 3
lettres de recommandation à Michel.Vacher@imag.fr et Francois.Portet@imag.fr. Les
candidatures seront examinées dès à présent au fil de l?eau jusqu?à la date de démarrage.
Merci de candidater dès que possible avant cette date.

Top

6-28

(2019-05-17) 2 PhDs in Trinity College Dublin, Ireland

2 PhDs in Trinity College Dublin, Ireland, to start in Sept 2019. Both come with a stipend of 18000Euros per year, along with full student fees for a 4 year period. Please contact me at nharte@tcd.ie if interested in either post.

Human Speech? How do I know it’s Real?

20 years ago, the major focus in developing speech synthesis systems was testing the intelligibility of the output speech. More recently, attention has switched focus to assessing not only intelligibility, but also naturalness, pleasantness, pauses, stress, intonation, emotion and listening effort. The intelligibility of systems is now so high, that synthetic voices are becoming more human-like. This is good news for generating realistic synthetic speech for applications such as voice reconstruction or gaming. In tandem, research in the area of speaker verification, or voice based biometrics, has started to pay closer attention to the issue of spoofing – where systems are attacked with reconstructed speech. Now, with improvements in speech synthesis, another realistic form of spoofing is the use synthetic speech generated by modelling the target user. So how can you tell when speech is real, or when it is fake? This is the focus of this PhD project and it goes to the very core of the nature of human speech.

Remote and Automatic Monitoring of Bird Populations

Associate Professor Naomi Harte

School of Engineering,

Trinity College Dublin

www.sigmedia.tv

Top

6-29

(2019-05-17) Post-doc/PhD position Pattern mining for Neural Networks debugging: application to speech recognition, INRIA,Rennes, France

Post-doc/PhD position Pattern mining for Neural Networks debugging: application to speech recognition

Advisors: Elisa Fromont & Alexandre Termier, IRISA/INRIA RBA ? Lacodam team (Rennes)

Irina Illina & Emmanuel Vincent, LORIA/INRIA ? Multispeech team (Nancy)

firstname.lastname@inria.fr

Location: INRIA RBA, team Lacodam (Rennes)

Keywords: discriminative pattern mining, neural networks analysis, explainability of blackbox models, speech recognition.

Context:

Description of the position:

? Some knowledge (interest) about speech recognition

? Knowledgeable in pattern mining (discriminative pattern mining is a plus)

? Knowledgeable in machine learning in general and deep learning particular

? Good programming skills in Python (for Keras and/or Tensor Flow)

? Very good English (understanding and writing)