ISCA - International Speech
Communication Association


ISCApad Archive  »  2017  »  ISCApad #231  »  Jobs

ISCApad #231

Sunday, September 10, 2017 by Chris Wellekens

6 Jobs
6-1(2017-04-10) 3 Funded PhD Research Studentships at CSTR, Edinburgh, Scotland, UK

Three Funded PhD Research Studentships at the Centre for Speech Technology Research,
University of Edinburgh.

Please see http://www.cstr.ed.ac.uk/opportunities for full details, eligibility
requirements, application procedure and deadlines.

1. Embedding enhancement information in the speech signal

Speech becomes harder to understand in the presence of noise and other distortions, such
as telephone channels. This is especially true for people with a hearing impairment. It
is difficult to enhance the intelligibility of a received speech+noise mixture, or of
distorted speech, even with the relatively sophisticated enhancement algorithms that
modern hearing aids are capable of running. A clever way around this problem might be for
the sender to add extra information to the original speech signal, before noise or
distortion is added. The receiver (e.g., a hearing aid) would use this to assist speech
enhancement.

Funding: Marie Sklodowska-Curie fellowship


2. Broadcast Quality End-to-end Speech Synthesis

Advances in neural networks made jointly in the fields of automatic speech recognition
and speech synthesis, amongst others, have led to a new understanding of their
capabilities as generative models. Neural networks can now directly generate synthetic
speech waveforms, without the limited quality of a vocoder. We have made separate
advances, using neural networks to discover representations of spoken and written
language that have applications in lightly-supervised text processing for almost any
language, and for adaptation of speaker identity and style. The project will combine
these techniques into a single end-to-end model for speech synthesis. This will require
new techniques to learn from both text and speech data, which may have other
applications, such as automatic speech recognition.

Funding: EPSRC Industrial CASE award (in collaboration with the BBC)


3. Automatic Extraction of Rich Metadata from Broadcast Speech (in collaboration with the
BBC)

The research studentship will be concerned with automatically learning to extract rich
metadata information from broadcast television recordings, using speech recognition and
natural language processing techniques.  We will build on recent advances in
convolutional and recurrent neural networks, using architectures which learn
representations jointly, considering both acoustic and textual data. The project will
build on our current work in the rich transcription of broadcast speech using neural
network based speech recognition systems, along with neural network approaches to machine
reading and summarisation.  In particular, we are interested in developing approaches to
transcribing broadcast speech in a way appropriate to the particular context.  This may
include compression or distillation of the content (perhaps to fit in with the
constraints of subtitling), transforming conversational speech into a form that is more
easy to read as text, or transcribing broadcast speech in a way appropriate for a
particular reading age.

Funding: EPSRC Industrial CASE award (in collaboration with the BBC)


--

Top

6-2(2017-04-20) Postdoc for project LaDyCa, Sorbonne, Paris

Applicants must have a PhD in linguistics as well as publications in their field of specialization. Independent research experience in one or several of the core areas of the LaDyCa project (i.e. language dynamics, linguistic typology, sociolinguistics, geolinguistics, dialectology & dialectometry) is expected. An experience in working with scholars of diverse backgrounds, e.g. linguists, sociologists, anthropologists, historians and, to some extent, mathematicians or statisticians would be greatly appreciated.The project will be funded by the IDEX (?Initiative d?Excellence?) consortium of Sorbonne Universités, France, in partnership with Ilia State University, Tbilisi, Georgia. Apart from an efficient and fluent command of English and/or French, for collegial relations. with an international team of scholars, applicants should have a good command of Georgian (written & oral skills); efficient reading skills in Russian would be an asset too. A good command of database software, and previous training or experience in computational linguistics would be also appreciated. A strong performing ability in entering data and in designing linguistic databases would be an asset.
Applications should include a statement of interest (letter of motivation), giving accurate details on the applicant?s skills corresponding to the aim of the LaDyCa project, and
how (s)he plans to process data with computing tools and gather information on the ecological, historical and social context of linguistic diversity in the Caucasus. (S)he will also
provide a CV including a list of publications, a copy of the PhD certificate, and the names and e-mail addresses of two referees. Applications should be sent as a single PDF file to the e-
mail addresses below, entitled ?Application_LaDyCa_PostDoc?:

Prof. Jean Léo Léonard < leonardjeanleo@gmail.com >
Prof. Claude Montacié < Claude.Montacie@paris-sorbonne.fr >
Deadline: applications must be submitted by 2 nd of May 2017.

The position is available from July 2017 to June 2018.The duration of employment is intended to last one year.
Net salary: around 2100 euros per month.

http://www.stih.paris-sorbonne.fr/?p=1203

Top

6-3(2017-04-22) Poste d'ATER à Paris Sorbonne, France

un poste d'ATER en Traitement automatique des langues et de la Parole est disponible à
l'Université Paris-Sorbonne. Le lien pour postuler est
http://concours.univ-paris4.fr/PostesAter?entiteBean=posteCandidatureCourant&modif=839.

Les conditions pour candidater sont disponibles sur www.paris-sorbonne.fr/ater

Top

6-4(2017-04-23) Associate research scientist-Speech at ETS, Princeton, New Jersey, USA

http://ets.pereless.com/careers/index.cfm?fuseaction=83080.viewjobdetail&CID=83080&JID=243925&type=&cfcend

 

Associate Research Scientist - Speech

 
Back to Listings Add Job to Basket
(0 Jobs)
Date Updated: April 25, 2017
Location: Princeton, NJ
Job Type: Full-Time/Regular
Travel: Not Specified
Position ID: 243925
Job Level: Entry Level(less than 2 years)
Years of Experience: Less Than 1 Year
Level of Education: Doctoral Degree
Starting Date : ASAP
 
 

Job Description

 

ETS is the world’s premier educational measurement institution and a leader in educational research. As an innovator in developing achievement and occupational tests for clients in business, education, and government, we are determined to advance educational excellence for the communities we serve.

 

ETS's Research & Development division has an opening for a research scientist in the NLP, Speech & DIAMONDS (Dialog, Multimodal, and Speech) research group. The projects in this research group focus on the application of NLP & Speech processing algorithms in automated scoring capabilities for assessment tasks involving constructed responses (such as essays and spoken responses) as well as on the application of spoken and multimodal dialog systems to assessment tasks. This is an excellent opportunity to be part of a world-renowned research and development team and have a significant impact on existing and next generation NLP & Speech systems and their application to assessment.

 

BASIC FUNCTIONS AND RESPONSIBILITIES

 

Take responsibility for conceptualizing, proposing, obtaining funding for, and directing small projects in the areas of speech processing, speech recognition, and automated speech scoring and/or assisting in moderate-to-major speech research projects. 

 

Projects may include (1) research projects, (2) development projects that use scientific principles to create (a) tools to improve the efficiency or quality of the practice of test development or statistical analysis: (b) innovative item types: or (c) the scoring of responses to open-ended items and (3) development projects that use scientific principles to create new products or product prototypes. Small research and development projects typically have minimal budgets, few or no staff other than the project director, a timeline of a year or less, and a single deliverable that is relatively narrow in scope. Major projects have substantial budgets, involve the coordination of many individuals internal and possibly external to ETS, may run across years, and may produce multiple deliverables.  Moderate projects fall in between these two types.

 

Assist in generating or contributing to new knowledge or capability in the field of speech processing, speech recognition, and spoken language technology, and in applying that new knowledge and capability to existing and/or new ETS products and services. New knowledge may take the form of new or modified educational or psychological theories: new research methodology: new development methodology: new statistical, analytic or interpretative procedures: new test designs and item types: new approaches to scoring examinee responses: and new approaches to reporting. New capabilities include developing software to instantiate new and existing knowledge. 

 

Document and disseminate the results of research and/or development projects through publication and presentation. Publication includes peer-review journals, peer-review conference proceedings, patents, books and book chapters, and other print media. Presentation may be at international, national, or regional conferences, client meetings, and ETS seminars.

 

  • Participate in setting substantive research and development goals and priorities for a group or initiative within a vice presidential area.

  • Actively seek input from peers on the quality of one’s work. Participate as a reviewer of others’ work.

  • Actively seek mentoring from more senior scientific and other R&D staff, developing a continuing mentoring relationship. 

  • Develop proposals and budgets for small projects and/or assist in development for moderate-to-major ones.

  • Assist more senior scientific staff in consulting on testing program, R&D management, or other ETS management concerns.

  • Manage small projects, and/or assist in the management of moderate-to-major ones, by accomplishing directed tasks according to schedule and within budget.

  • Develop external professional relationships and work to cultivate a scientist’s identity. 

  • Become a member and regular presenter at the annual meetings of one or more organizations substantively related to the work of ETS. 

 

Experience and Skills

EDUCATION

A Ph.D. in Computer Science, Electrical Engineering, Natural Language Processing, Computational Linguistics, or a similar area with major education in speech technology, and particularly in speech recognition is required.

EXPERIENCE

  • Evidence of independent substantive research experience in spoken language technology and/or development experience for deploying speech technology capabilities is required.

  • One year of independent substantive research experience in spoken language technology and/or development experience for deploying speech technology capabilities is required.

  • Experience may be gained through doctoral studies.

  • Practical expertise with automatic speech recognition systems, experience with machine learning toolkits (e.g., Weka, scikit-learn), and fluency in at least one major programming language (e.g. Java, Python) is required.

  • Practical experience with deep learning paradigms and/or deep-learning-based speech recognition systems (e.g., Kaldi) is highly desirable.

Our strength and success are directly linked to the quality and skills of our diverse associates.  A background and/or knowledge of accessibility and accommodations for individuals with disabilities, whether through your own experiences or those of someone close to you, is highly desirable.

ETS is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, protected veteran status, or other characteristic protected by law.

 
Top

6-5(2017-05-02) PhD at IRISA, Rennes, France

L'équipe Expression de l'IRISA ouvre un poste de doctorant en informatique sur le sujet 'caractérisation de registres de langue par extraction de motifs séquentiels' dans le cadre du projet ANR TREMoLo.

 

Domaines : traitement automatique des langues et fouille de données.

 

Détails de l'offre : http://www.irisa.fr/fr/offres-theses/caracterisation-registres-langue-extraction-motifs-sequentiels

 

Date limite de candidature : vendredi 2 juin.

 

Dossier de candidature (* : éléments obligatoires) :

- CV détaillé*

- lettre de motivation*

- relevés de notes (avec classement si possible)*

- contacts pour recommandation*

- rapport(s) de stage recherche (si applicable).

 

Envoyer à : del.battistelli@gmail.com, nicolas.bechet@irisa.fr, gwenole.lecorve@irisa.fr.

Top

6-6(2017-05-05) Post-doctoral positions in Multimodal Behavior Analysis: Speech, Vision and Healthcare, CMU, Pittsburgh, PA, USA

Post-doctoral positions in Multimodal Behavior Analysis: Speech, Vision and Healthcare

              Carnegie Mellon University, School of Computer Science

 

Multiple post-doctoral positions are available in the School of Computer Science at Carnegie Mellon University. We are seeking creative and energetic applicants for two-year postdoctoral positions. The positions include a competitive salary with full benefits and travel resources.

 

Candidates must have a strong research track record for one or more of the following topics: (1) speech and paralinguistic processing for affect, emotion and human behavior analysis, (2) automatic recognition of facial expressions, gestures and human visual activities, (3) multimodal machine learning algorithms for text, audio and video, (4) technologies to help clinicians with mental health diagnoses and treatments.

 

Required

  • PhD in computer science or related field (at the time of hire)
  • International applicants welcome! No US citizenship requirement.

 

Desired

  • Publications in top machine learning, speech processing and/or computer vision conferences and journals.
  • Research involving clinical patients with mental health disorders (e.g., depression, schizophrenia, suicidal ideation)
  • Experience mentoring graduate and undergraduate students

 

Job details

  • Preferred start date: September 1st, 2017 (negotiable)
  • Candidate will work under the supervision of Dr. Louis-Philippe Morency, CMU MultiComp Lab’s director
  • Competitive salary with full benefits and travel resources.

 

How to apply

  • Email applications should be sent to morency@cs.cmu.edu with the title “Postdoc application”, preferably before June 12th, 2017. The email should include:
    • a brief cover letter (with expected date of availability),
    • a CV including list of publications,
    • contact information of two references,
    • links to three representative publications
Top

6-7(2017-05-10) CDI Ingénieur docteur en informatique ou sciences du langage, LNE, Trappes, France

 

 

Ingénieur docteur en informatique ou sciences du langage

CDI – TRAPPES

 

 

 

Référence:AP/TAI/DE

 

L’entreprise: WWW.LNE.FR

 

Leader dans l’univers de la mesure et des références, jouissant d’une forte notoriété en France et à l’international, le LNE soutient l’innovation industrielle et se positionne comme un acteur important pour une économie plus compétitive et une société plus sûre. Au carrefour de la science et de l’industrie depuis sa création en 1901, le LNE offre son expertise à l’ensemble des acteurs économiques impliqués dans la qualité et la sécurité des produits.

Pilote de la métrologie française, notre recherche est au cœur de notre mission de service public et constitue un facteur fondamental au soutien de la compétitivité des entreprises.

Nous avons à cœur de répondre aux exigences des industriels et du monde académique, pour des mesures toujours plus justes, effectuées dans des conditions de plus en plus extrêmes ou sur des sujets innovants tels que les véhicules autonomes, les nanotechnologies ou la fabrication additive.

 

Le LNE en quelques chiffres: 700 collaborateurs.

5 métiers (la mesure, les essais, la certification, la formation et la R&D).

8 domaines d’intervention (Métrologie, Santé, Bâtiment, Environnement, Energie, Transports, Sécurité et Défense, Biens de consommation).

55 000 m2 de laboratoires (dont 10 000m2 à Paris et 45 000m2 à Trappes).

7 implantations (2 sites en Ile de France, 2 délégations régionales à Poitiers et Nîmes, 1 antenne  à St Etienne, 2 filiales à Washington, Hong Kong).

9000 clients. 

 

 

Missions :



Le docteur sera intégré à une équipe de 4 ingénieur-docteurs qui encadrent différents stagiaires et doctorants. Cette équipe est historiquement spécialiste de l’évaluation des systèmes de traitement de l’information multimédia (transcription de parole, reconnaissance du locuteur, dialogue, traduction…). Elle s’ouvre aujourd’hui à de nouveaux enjeux que sont l’évaluation des systèmes d’intelligence artificielle en général (robotique, smart-grid, domaine de la défense, véhicule autonome…).

 

Le docteur se verra attribuer les missions suivantes :

 

  • Le développement de la R&D en évaluation de systèmes de traitement de la parole et du langage

    • Définition de nouvelles métriques

    • Analyse de corpus

    • Publication de résultats scientifiques

    • Mise en place de protocoles perceptifs

    • Contribution au montage et déroulement de projets de recherche européens et nationaux.

 

  • Animation des campagnes d’évaluation

    • Aide aux équipes participantes pour l’utilisation des outils du LNE

    • Contrôle formel des données

    • Scoring des systèmes

    • Organisation de rencontres scientifiques et industrielles

    • Rédaction des rapports d’évaluation

 

  • Encadrement de stagiaires, post-docs

 

 

Profil :

 

Titulaire d’un doctorat en Informatique ou Sciences du langage, vous avez des compétences en traitement automatique de la langue ou en linguistique de corpus. Vous maitrisez également la programmation (R ou S, C++, PYTHON).

Vous êtes doté de bonnes qualités rédactionnelles et relationnelles. Vous avez une bonne communication orale et vous aimez travailler en collaboration avec votre équipe et les clients.

Vous avez un anglais vous permettant une communication professionnelle.

 

Déplacements en région parisienne, 1 jour par semaine et dans le monde 1 fois par an.

 

 

Pour déposer votre candidature :envoyer CV+LM à recrut@lne.fr – réf AP/TAI/DE

Top

6-8(2017-05-10) Open Rank Research Scientist, Spoken and Multimodal Dialog Systems, ETS, San Francisco, CA, USA

Open Rank Research Scientist, Spoken and Multimodal Dialog Systems

ETS (Educational Testing Service) is a global not for profit organization whose mission is to advance quality and equity in education. With more than 3,400 global employees, we develop, administer and score more than 50 million tests annually in more than 180 countries.

Our San Francisco Research and Development division is seeking a Research Scientist for our Dialog, Multimodal, and Speech (DIAMONDS) research center. The center’s main focus is on foundational research as well as on development of new capabilities to automatically score spoken, interactive, and multimodal test responses in conversational settings in a wide range of ETS test programs, promote learning and other educational areas. This is an excellent opportunity to be part of a world-renowned research and development team and have a significant impact on existing and next generation spoken and multimodal dialog systems and their application to assessment and other areas in education.

Primary responsibilities include:

  • Developing and collaborating on interdisciplinary projects that aim to transfer techniques to a new context or scientific field. Successful candidates are self-motivated and self-driven, and have a strong interest in emerging conversational technology that can contribute to education in assessment and instructional settings.

  • Providing scientific and technical skills to conceptualize, design, obtain support for, conduct, and manage new research projects, grants, or parts of existing projects.

  • Generating or contributing to new or modified methods that support research on and development of spoken and multimodal dialog systems and related technologies relevant in assessment and instructional settings.

  • Designing and conducting scientific studies and functioning as an expert in the major facets of the projects: responding as a subject matter expert in presenting the results of acquired knowledge and experience.

  • Developing or assisting in developing proposals for external and internal research grants and obtain financial support for new or continuing research activities. Prepare initial and final proposal and project budgets.

  • Participating in dissemination activities through the publications of research papers in peer-reviewed journals and in the ETS Research Report series, the issuing of progress and technical reports, the presentation of seminars at major conferences and at ETS, or the use of other appropriate communication vehicles, including patents, books and chapters, that impact practice in the field or at ETS.

Depending on experience this position is open to entry level candidates as well as mid-level and senior level professionals.

REQUIREMENTS FOR A JUNIOR LEVEL POSITION

  • A Doctorate in computer science, linguistics, cognitive psychology or a related field is required. One year of research experience is required, in education is desirable. Experience can be gained through doctoral studies. Candidates should be very skilled in programming and be able to work effectively as a research team member.

 

 

 

REQUIREMENTS FOR A MID-LEVEL POSITION

  • A Doctorate in computer science, linguistics, cognitive psychology, or a related field is required. Research experience in education is desirable. Candidates should be very skilled in programming and be able to work effectively as a research team member. Three years of progressively independent substantive research in the area of computer science, linguistics, cognitive psychology, or education are required.

REQUIREMENTS FOR A SENIOR-LEVEL POSITION

  • A Doctorate in computer science, linguistics, cognitive psychology, or a related field is required. Research experience in education is desirable. Candidates should be very skilled in programming and be able to work effectively as a research team member. Eight years of progressively independent substantive research in the area of computer science, linguistics, cognitive psychology, or education are required.

We offer a competitive salary, comprehensive benefits and excellent opportunities for professional and personal growth. For a full list of position responsibilities and to apply please visit the following link: http://ets.pereless.com/careers/index.cfm?fuseaction=83080.viewjobdetail&CID=83080&JID=235623&BUID=2538

ETS is an Equal Opportunity Employer

Top

6-9(2017-05-10) Research Scientist, Disney Research, Pittsburgh, PA, USA

Position: Research Scientist

Focus Area: Autonomous Agents for Multimodal Character Interaction

Disney Research


Disney Research Pittsburgh is seeking applicants for a Research
Scientist position, at either the junior or senior level, in
Autonomous Agents. The research emphasis is on architecture to support
the integration of natural language with character-based reasoning and
behavior.


As part of The Walt Disney Company, Disney Research builds upon a rich
legacy of innovation and technology leadership in the entertainment
industry that continues to this day. Disney Research was launched in
2008 offering the best attributes of academia and industry with the
goal of driving value across the company through technological
innovation. Our research covers a broad range of exciting and
challenging applications that are experienced daily by millions of
people around the world.


Our staff interacts directly with all core business areas of The Walt
Disney Company including Theme Parks and Imagineering, Consumer
Products, our Live Action and Animation Studios, and Media Networks.
We publish our research and are actively engaged with the global
research community. Our researchers collaborate closely with
co-located academic institutions.


We are seeking applicants in the following areas:

·         Agent architectures for language-based character interaction.

·         AI and machine learning methods for autonomous,
semantically-rich character behavior

Duties:

·         Drive value for Disney through groundbreaking research and innovation

·         Lead a research group with post-doctoral researchers,
interns, and external collaborators

·         Publish results and patent inventions in multimodal interaction

·         Participate in conferences, workshops and academic-industrial events

·         Develop a strong network of business partners within the company

Required Qualifications:

·         Ph.D. in Computer Science or equivalent

·         Proven track record of developing autonomous, integrated
agents with real-time NL components.

·         Experience with both symbolic and statistical machine
learning methods as applied to modeling semantics, action, or behavior

·         Possess strong technical presentation skills and able to
clearly communicate with technical and non-technical audiences

Desired Qualifications:

·         Experience in interaction design for entertainment

·         Background in NLP (e.g., relationship extraction, word sense
disambiguation, narrative generation) desirable



To apply:

Please email careers@disneyresearch.com. Please use DRP-RS-NLP-2017 in
your subject line. If you're interested in the position or for any
further information, please contact Jill Lehman
(jill.lehman@disneyresearch.com).

Top

6-10(2017-05-23) Lead Speech Recognition Engineer, Cambridge, UK

Lead Speech Recognition Engineer

Location: Cambridge, UK

Contact: careers@speechmatics.com

Background

Speechmatics is a leader in automatic speech recognition (ASR). Using proprietary technology, we have built one of the most accurate ASR systems in the world, with a vision to power a voice-enabled economy. We are already working at a time when the global economy is actively adopting all types of speech-related technologies. In developing our technology we combine our years of experience, the latest developments in the field and our own focus on cutting-edge research to produce a world-class service.

In the office, we pride ourselves on a relaxed but productive environment whilst we stay in touch with the progress of others by attending both academic and commercial conferences and have fun together with regular outings (in the past we have been punting, go-karting, attended a cooking workshop and played bubble football...).

We are expanding rapidly and are seeking more people in the coming months to help us keep pushing the boundaries of speech recognition. This is an opportunity to join a high growth team and form a major part of its future direction.

The Opportunity

We are looking for a talented speech scientist to help us build the best speech technology for anybody, anywhere, in any language. You will be a part of a team that is working on our core ASR capabilities to improve our speed and accuracy and develop novel features so that we can support all languages. Your work will feed into ‘Auto-Auto’, our ground-breaking framework to support the building of ASR models, and hence the delivery of every language pack published by the company. You will be responsible for keeping our system the most accurate and useful commercial speech recognition available.

Because you will be joining a small team, you will need to be a team player who thrives in a fast paced environment, with a focus on rapidly moving research developments into products. Bringing skills to the team is as important as a can-do attitude. We strongly encourage versatility and knowledge transfer within the team, so we can share efficiently what needs to be done to meet our commitments to the rest of the company.

Key Responsibilities

  • Ensuring that our speech recognition meets or exceeds that published by others

  • Improving our core modelling (acoustic, pronunciation, language)

  • Leading the extension of our ML framework so that we can build any language



Experience

Essential

  • MSc, PhD or equivalent experience in the academic aspects of speech recognition

  • Several years practical experience in speech recognition, covering all aspects (acoustic, pronunciation and language modelling as well as decoders/search)

  • Experience working with standard speech and ML toolkits, e.g. Kaldi, KenLM, TensorFlow, etc.

  • Solid programming skills with Python and / or C/C++

  • Experience using Unix/Linux for big data

Desirable

  • PhD degree

  • Experience of team leadership and line management

  • Experience of working in an Agile framework

  • Expertise in modern speech recognition, including WFSTs, lattice processing, neural net (RNN / DNN / LSTM), acoustic and language models, Viterbi decoding

  • Comprehensive knowledge of machine learning and statistical modelling

  • Experience in deep machine learning and related toolkits, e.g. Theano, Torch, etc.

  • Deep expertise in Python and/or C++ software development

  • Experience working effectively with software engineering teams or as a Software Engineer

Salary

We offer a competitive salary, bonus scheme, pension contribution matching (up to 5%) and a generous EMI share option scheme. We also have several additional benefits including holiday purchase, massages, fully stocked beer fridge, Cyclescheme, fruit boxes and many more.

The overall package will depend on your motivations and level of experience.

Top

6-11(2017-05-23) Software Development Engineer, Cambridge, UK

Software Development Engineer

Location: Cambridge, UK

Contact: careers@speechmatics.com

Background

Speechmatics is a leader in automatic speech recognition (ASR). Using proprietary technology, we have built one of the most accurate ASR systems in the world, with a vision to power a voice-enabled economy. We are already working in the world at a time when the global economy is actively adopting all types of speech-related technologies. In developing our technology we combine our years of experience, the latest developments in the field and our own focus on cutting-edge research to produce a world-class service.

In the office, we pride ourselves on a relaxed but productive environment whilst we stay in touch with the field by attending both academic and commercial conferences and have fun together with regular team events (in the past we have been punting, go-karting, attended a cooking workshop and played bubble football...).

We are expanding rapidly and are seeking more people in the coming months to help us keep pushing the boundaries of speech recognition. This is an opportunity to join a high growth team and form a major part of its future direction.

The Opportunity

You will be joining the ‘Languages’ team within Speechmatics, focussing on two key goals. We maintain and develop Auto-Auto, our ground-breaking framework to support the building of languages for use in ASR. And we use it to build new language models.

We are looking for an experienced Software Development Engineer to join us. As a member of the team, you will be working on the development, maintenance and expansion of our pipeline, and participating in building and solving the challenges of a growing language portfolio. You will have significant influence on implementing or integrating new features, drive the system architecture, and spearhead the best practices that enable a quality product.

Auto-Auto is core to our business and by working on it you will have a chance to build something that will be used in businesses and homes worldwide. Working in a rapidly growing start-up also means opportunities to contribute to other projects, depending on the candidate’s background and skills.

If you are a talented, detail-oriented engineer with a solid software development foundation and a commitment to deliver the best possible technology solutions, then we want to hear from you!

Key Responsibilities

  • Delivering high quality, maintainable and robust code on time, as part of a team.

  • Executing projects and developing against an outlined design.

  • Developing pragmatic solutions and building flexible systems without over-engineering.

  • Involvement at all stages of the software development cycle, including designing and developing new architectural systems and improvements, and QA processes.

  • Participation in estimation and sprint planning in an agile environment.

  • Participation in delivering new language models for the ASR engine.

  • Working closely with other technical teams and product team to deliver on the company’s technical vision.

Experience

Essential

  • Bachelor's Degree in Computer Science or related field.

  • Professional experience in software development.

  • Computer Science fundamentals in object-oriented design, data structures, algorithm design, problem solving, and complexity analysis.

  • Knowledge of professional software engineering practices & best practices for the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations.

  • Excellent Python skills.

  • Good Linux skills.

  • Experience of working within a team to deliver and run high quality systems.

Desirable

  • Master's degree in Computer Science or related field.

  • Demonstrable professional experience in software development.

  • Proficiency in C and C++ (ideally with strong STL and Boost experience).

  • Strong skills and experience in cloud-based software development, preferably AWS:

    • Working with distributed and/or clustered systems.

    • Building and running horizontally scaling architectures.

    • Using cloud-based queueing, messaging, monitoring and storage techniques.

  • Experience in flow-based programming.

  • Familiarity with statistical models and data mining algorithms.

  • Analytical with a data-driven approach to making decisions and attention to detail.

  • Previous experience with Natural Language Processing techniques.

  • Comfortable collaborating with teams with very different technical skills, and non-technical teams.

Salary

We offer a competitive salary, bonus scheme, pension contribution matching (up to 5%) and a generous EMI share option scheme. We also have several additional benefits including holiday purchase, massages, fully stocked beer fridge, Cyclescheme, fruit boxes and many more.

Top

6-12(2017-05-29) PhD & Post-Doc Research positions in Speech Signal Processing and Electronic Design, Autonomous University of Zacatecas, Zacatecas, Mexico

PhD & Post-Doc Research positions in Speech Signal Processing and Electronic Design


Place:        Autonomous University of Zacatecas, Zacatecas, Mexico

Duration:  PhD (3 years) / Post-Doc (1 year)

Start:          PhD / Post-Doc (January 10th, 2018)

Benefits: - Economical support according to experience - Health insurance from the Mexican Social Security Institute - Round-trip international airfare at the beginning and end.


Position description: Department of Signal Processing and Acoustics, Autonomous University of Zacatecas, is looking for candidates for: fully-funded PhD and Post-doc positions in Signal Processing,  Filtering  Design, Embedded Systems, Speech Recognition and Synthesis. The signal processing group (led by  Dr. Hamurabi GamboaRosales) at Autonomous University of Zacatecas works on algorithm designing, signal processing, electronic design, machine learning, probabilistic modeling in speech recognition and synthesis. The group belongs also to the National Laboratory in embedded systems, advanced electronic design and Microsystems. We are looking for outstanding candidates to join our research group as PhD students and Post-doc researchers to work on any of our research themes, for example: • Digital Signal processing • Optimal Filtering • FPGA’s • Microsystems • large-vocabulary speech recognition and text- to-speech synthesis • ASR in noisy environments


Candidate Profile for PhD / Post-Doc:    The candidate will have:  Master’s / PhD degree, as required by the program for which is requested, in digital signal processing, electronic design, speech signal processing, acoustics, machine learning, computer science, electrical engineering, psychology or a related discipline.  Background in signal processing or electronic design (FPGAs).  Good programming skills in Java, C/C++, Python or Matlab.


Contact: Interested applicants can contact PhD Hamurabi Gamboa-Rosales for more information or directly email a candidacy letter including Curriculum Vitae, a list of publications and a statement of research interests. Email: hamurabigr@uaz.edu.mx  ; hamurabigr@hotmail.com  Telephone MX: +52 (1) 492-121-6787

Top

6-13(2017-06-01) Appel à chercheurs 2017-2018 à l'INA Paris France

Appel à chercheurs 2017-2018

Nouveaux dispositifs de soutien à la recherche à l’Ina :

Chercheurs associés et bourses de recherche


Afin d’encourager le développement de travaux scientifiques menés à partir de ses fonds et des outils d’analyse qu’il développe, l'Ina a décidé de créer en 2017 deux nouveaux dispositifs de soutien à la recherche et à la valorisation scientifique de ses collections :

  • l’octroi d’un statut de chercheur associé à l’Ina
  • l’attribution de bourses de recherche

 

Par ces dispositifs, l’Ina entend accompagner des doctorants et des chercheurs dans la réalisation de projets de recherche originaux et innovants portant sur (ou faisant appel à) ses collections, ou portant sur l’analyse ou le traitement des images et/ou des sons et/ou de données associées.

 

L’Institut offre aux chercheurs sélectionnés un accueil privilégié, assorti de divers soutiens matériels.

Ces nouveaux dispositifs sont complémentaires des prix de l’Inathèque créés en 1997, et ajoutent un nouveau volet à la politique scientifique de l’Institut.

 

Le règlement de l’appel est disponible sur le site de l’Inathèque : http://www.inatheque.fr/actualites/2017/mai-2017/appel-chercheurs-2017-2018.html

Top

6-14(2017-06-023) Vacataires à la police technique et scientifique, Ecully (Lyon), France

Le service audio de la Police Technique et Scientifique (Ecully, près de Lyon, France) recherche des vacataires pour effectuer un travail de segmentation et de correction d'alignement automatique dans le cadre d'études phonétiques.
Le profil suivant est recherché:

- un intérêt pour la linguistique ou pour les langues
- une bonne maitrise de l'informatique et des nouvelles technologiques
- une connaissance du logiciel Praat sera appréciée
Les vacations peuvent commencer dès que possible et peuvent se poursuivre jusqu'en octobre. 
Pour plus d'informations, merci d'envoyer un mail à l'adresse suivante ptsvox@gmail.com, avec vos coordonnées.
Top

6-15(2017-06-06) Post-doctoral Research Associate in Advanced Deep Neural network Architectures for ASR , Univ. of Crete, Greece

Department of Computer Science, University of Crete, Greece
Post-doctoral Research Associate in Advanced Deep Neural network Architectures for ASR  (Fixed Term)
 
SALARY: €24000-€28000 per year
CLOSING DATE: 30 June 2017
REFERENCE: ASR1
TO APPLY: Send detailed CV, a motivation letter and 3 major publications to yannis@csd.uoc.gr
 
In the past few years, Deep Neural Networks (DNNs) have achieved tremendous success for many supervised machine learning tasks, including acoustic modelling for Automatic Speech Recognition (ASR). Advanced models such as Convolutional Neural Networks (CNNs) and Long Short Term Recurrent Neural Networks (LSTMs) have contributed to recent empirical breakthroughs. Network depth has played perhaps the most important role in these successes. However, increased depth represents challenges in the optimization of the network and despite the efforts to overcome these challenges some of the optimization issues are still important resistant. Advanced networks such as highway networks and (wide) residual networks seems to offer solutions to these issues.
This position represents an ideal opportunity to work in or move into advanced deep neural networks, as it will involve collaborating widely across academia and industry, and working on one of the most pressing research areas of machine learning for the development of robust ASR systems.
Based in Heraklion Crete the post will be with Prof. Yannis Stylianou and Dr. Vassilis Tsiaras as part of the speech processing group within the Department of Computer Science at the University of Crete. You will explore a rich set of network architectures and thoroughly examine how several different aspects affect the accuracy of ASR. The work will be performed within the framework of advanced deep neural network architectures for various signal processing tasks including 1D and 2D signals. The focus of the post will be to perform various experiments with well-known architectures, explore and suggest modifications, process and reshape knowledge from various signal processing/classification tasks towards speech processing for the purpose of ASR. Outcomes will directly feed into improvements of ASR systems in-house working with state-of-the art ASR tasks (i.e., CHiME4, REVERB, etc) and of our industrial partners using real-life data.
The post involves travel to international conferences and project meetings with our academic and industrial partners. There will be the possibility to co-advise doctoral students and potentially other teaching opportunities.
Applicants should have a doctorate in speech signal processing area for ASR, computer science, applied mathematics or related field and ideally a strong background in deep learning and mathematics. Knowledge of deep learning systems such as Tensorflow or Theano etc and ASR systems like Kaldi are an advantage. Proficiency in computer programming in C and/or Python are expected.
Informal inquiries should be directed to Prof. Yannis Stylianou by email, yannis@csd.uoc.gr
Fixed term: In the first instance, the funding supporting the post is for two years. We are expecting project extension which will provide funding for a further 7-12 months for this post.
Interviews are expected to take place the week commencing 10th July 2017. Expected start date: September 2017, however earlier and later start dates will be considered.
 
To apply, please send detailed CV, a motivation letter and 3 major publications of yours to: yannis@csd.uoc.gr (Prof. Yannis Stylianou)

Top

6-16(2017-06-06) Post-doctoral Research Associate in Data Augmentation in the context of Deep Neural network ASR, Univ.of Crete, Greece

 

Department of Computer Science, University of Crete, Greece

Post-doctoral Research Associate in Data Augmentation in the context of Deep Neural network ASR

(Fixed Term)

SALARY: €24000-€28000 per year

CLOSING DATE: 30 June 2017

REFERENCE: ASR2

TO APPLY: Send detailed CV, a motivation letter and 3 major publications to yannis@csd.uoc.gr

In the past few years, Deep Neural Networks (DNNs) have achieved tremendous success for many supervised machine learning tasks, including acoustic modelling for Automatic Speech Recognition (ASR). Advanced models such as Convolutional Neural Networks (CNNs) and Long Short Term Recurrent Neural Networks (LSTMs) have contributed to recent empirical breakthroughs. However, deep learning methods are quite demanding in the amount of data for training an acoustic model for ASR and as a result significant amounts of transcribed data has become available for training use. But data transcription is a quite expensive and time consuming process. On the other hand, just adding data recorded in real-world conditions puts serious constraints on the efficient training of the acoustic models. Various works on data augmentation show that word error rate (WER) can be significantly reduced if proper augmented data are processed.

This position represents an ideal opportunity to work in or move into data augmentation research area in the context of advanced deep neural networks for ASR, as it will involve collaborating widely across academia and industry, and working on one of the most pressing research areas of machine learning for the development of robust ASR systems.

Based in Heraklion Crete the post will be with Prof. Yannis Stylianou and Dr. George Kafentzis as part of the speech processing group within the Department of Computer Science at the University of Crete. You will design and develop smart approaches for spoken data augmentation for the purpose of multi-condition training of deep learning-based ASR systems. The work will be performed within the framework of advanced deep neural network architectures for various ASR tasks. The focus of the post will be to perform various experiments with spoken data generation, explore and suggest modifications, process and reshape knowledge from various signal processing for the purpose of ASR. Outcomes will directly feed into improvements of ASR systems in-house working with state-of-the art ASR tasks (i.e., AURORA-4, CHiME4, REVERB, etc) and of our industrial partners using real-life data.

The post involves travel to international conferences and project meetings with our academic and industrial partners. There will be the possibility to co-advise doctoral students and potentially other teaching opportunities.

Applicants should have a doctorate in speech signal processing area for ASR, statistical speech synthesis and voice conversion, audio signal processing, computer science, applied mathematics or related field and ideally a strong background in deep learning and mathematics. Knowledge of deep

learning systems such as Tensorflow or Theano etc and ASR systems like Kaldi are an advantage. Proficiency in computer programming in C and/or Python are expected.

Informal inquiries should be directed to Prof. Yannis Stylianou by email, yannis@csd.uoc.gr

Fixed term: In the first instance, the funding supporting the post is for two years. We are expecting project extension which will provide funding for a further 7-12 months for this post.

Interviews are expected to take place the week commencing 10th July 2017.

Expected start date: September 2017, however earlier and later start dates will be considered.

To apply, please send detailed CV, a motivation letter and 3 major publications of yours to: yannis@csd.uoc.gr (Prof. Yannis Stylianou)

Top

6-17(2017-06-18) 2 (W/M) researcher positions at IRCAM, Paris, France

Position: 2 (W/M) researcher positions at IRCAM

Starting: September 1st, 2017

Duration: 18 months

Deadline for application: July, 1st, 2017

Offre d’emploi: 2 poste de chercheur (H/F) à l’IRCAM

Démarrage: 1er Septembre 2017

Durée: 18 mois

Date limite pour candidature: 1er Juillet 2017

IRCAM is proposing 2 researcher positions related to the European

H2020 Project ABC_Dj and Future Pulse.

The goal of the first position is to develop robust algorithms for

automatic melody extraction (AME) and use them to characterize the

dominant melodic profiles.

The goal of the second position is to develop robust machine

learning algorithms for the automatic recognition of music style/mood

in live performances.

Candidates should have

High Skill in audio signal processing (spectral analysis, audiofeature

extraction, parameter estimation) (the candidate

should preferably hold a PHD in this field)

High skill in machine learning (the candidate should preferably

hold a PHD in this field)

High-skill in Matlab/Python programming, skills in C/C++

programming

Good knowledge of Linux, Windows, Mac-OS environments

High productivity, methodical works, excellent programming

style.

The hired Researchers will also collaborate with the development

team and participate in the project activities (evaluation of technologies, meetings,

specifications, reports)

 

Introduction to IRCAM:

IRCAM is a leading non-profit organization associated to Centre

Pompidou, dedicated to music production, R&D and education in

sound and music technologies. It hosts composers, researchers and

students from many countries cooperating in contemporary music

production, scientific and applied research. The main topics

addressed in its R&D department include acoustics, audio signal

processing, computer music, interaction technologies and

musicology. Ircam is located in the centre of Paris near the Centre

Pompidou, at 1, Place Igor Stravinsky 75004 Paris.

Salary:

According to background and experience

Applications:

Please send an application letter with the reference 201706RES

together with your resume and any suitable information addressing

the above issues preferably by email to: peeters at ircam dot fr with

cc to vinet at ircam dot fr, roebel at Ircam dot fr.

 

Top

6-18(2017-06-18) Proposition de thèse CIFRE en Informatique, traitement automatique du langage naturel

Proposition de thèse CIFRE en Informatique, traitement automatique du langage naturel

Société Calystene et Laboratoire d'Informatique de Grenoble

Calystene propose une thèse CIFRE dans le cadre d'une collaboration avec l'équipe GETALP du laboratoire d'Informatique de Grenoble.

Début de la thèse : Dès que possible.
Localisation : le poste est basé à Grenoble - Eybens
Le salaire est de 30 000 keuros brut annuel environ.

Description du sujet :

Avec l'évolution des systèmes d'information des établissements de santé, les praticiens sont incités à saisir de plus en plus d'informations de manière numérique à travers différents logiciels spécialisés. La prise en main de ces logiciels implique toujours une étape de formation et d?adaptation qui est problématique pour certains clients de Calystene. Ainsi, Calystene souhaite rendre naturelle la saisie de prescription médicamenteuse sur son logiciel FUTURA SMART DESIGN® pour les praticiens, notamment ceux extérieurs aux établissements de santé. En effet, ces intervenants, précieux mais ponctuels, n'ont pas le temps de se former aux interfaces des logiciels et n'ont pas d'accès immédiat à un poste de travail (cas notamment des médecins de ville qui interviennent en maison de retraite). Des prescriptions sont donc régulièrement rédigées sur papier et n'entrent pas dans le système d'information, ce qui est dommageable pour la gestion des soins.

Pour répondre à ce problème, ce doctorat vise à définir des modèles d'interaction en langage naturel et leur mise en ?uvre dans la plateforme FUTURA SMART DESIGN® de Calystene, afin de proposer une saisie naturelle d'informations par écrit et/ou orale de prescriptions médicales sur un terminal mobile et pour la résolution intuitive et intelligente d'erreurs ou d'information manquante. Le prototype, baptisé « Intelligent Prescription Completion » permettra de se rapprocher le plus possible du langage de prescription des médecins. Les lignes d?ordonnance seront saisies soit par dictée vocale, reconnaissance d?écriture sur tablette ou saisie classique via le clavier. L?ordonnance sera saisie en langage quasi-naturel (le langage des prescriptions médicales) et la structuration et validation des données (nécessaire aux contrôles et à la gestion) sera réalisée en temps réel par un algorithme développé par Calystene.

Pour atteindre ces objectifs, la solution s?appuiera sur une approche de type dialogue [1] où la sémantique de l'information est d'abord extraite de l?énoncé [2] --- qu'il soit écrit ou acquis à partir de la parole [3] --- puis analysée par la logique métier afin de valider, suggérer des modifications ou provoquer des demandes de complément d?information à la manière d'un chatbot. La thématique du doctorat se situera donc dans le domaine de la compréhension automatique du langage naturel et de la parole et du raisonnement automatique avec un fort aspect prototypage et validation. Le/la candidat(e) sera encouragé(e) à publier ses progrès dans les grandes conférences du domaine en TALN (ACL,Interspeech) et en IA appliquée à la médecine (AIME, AMIA).

[1] S. Young, M. Ga?i?, S. Keizer, F. Mairesse, J. Schatzmann, B. Thomson, and K. Yu, ?The hidden information state model: A practical framework for pomdp-based spoken dialogue management,? Computer Speech & Language, vol. 24, no. 2, pp. 150?174, 2010
[2] Xu H, Stenner SP, Doan S, Johnson KB, Waitman LR, Denny JC. MedEx: a medication information extraction system for clinical narratives. Journal of the American Medical Informatics Association?: JAMIA. 2010;17(1):19-24.
[3] M. Vacher, S. Caffiau, F. Portet, B. Meillon, C. Roux, E. Elias, B. Lecouteux, P. Chahuara. Evaluation of a context-aware voice interface for Ambient Assisted Living: qualitative user study vs. quantitative system evaluation ACM - Transactions on Speech and Language Processing, Association for Computing Machinery, 2015, Special Issue on Speech and Language Processing for AT (Part 3), 7 (issue 2), pp.5:1-5:36.

Profil du candidat recherché :
Possédant ou terminant un Master 2 Recherche en informatique ou en TALN, vous souhaitez préparer un doctorat CIFRE en entreprise en liaison avec un laboratoire de recherche. Vous êtes passionné par les technologies de l?information et de la communication, vous avez de bonnes connaissances en développement Android/iPhone. Vous avez une formation et une expérience dans l'étude et/ou le développement de traitement automatique du langage naturel. Des connaissances en apprentissage automatique et en acquisition de corpus seraient un plus.

Merci d'envoyer un CV + une lettre de motivation + lettres de recommandation à Jean-Marc BABOUCHKINE (jm.babouchkine@calystene.com) et François Portet (francois.portet@imag.fr)

Calystene : http://www.calystene.com/
Getalp : http://lig-getalp.imag.fr/

Top

6-19(2017-06-19) PhD position in Computational Linguistics for Ambient Intelligence, Grenoble, France

Keywords: Natural language understanding, decision support system, smart home

The Laboratoire d'Informatique de Grenoble (LIG) of the University Grenoble Alpes, Grenoble, France invites applications for a PhD position in Computational Linguistics for Ambient Intelligence.

University of Grenoble Alpes is situated in a high-tech city located at the heart of the Alps, in outstanding scientific and natural surroundings. It is 3h by train from Paris ; 2h from Geneva and is less than 1h from Lyon international airport.

The position starts in September 2017 and ends in July 2020 and is proposed in the context of the national project Vocadom (http://vocadom.imag.fr/) whose aim is to build technologies that make natural hand-free speech interaction with a home automation system possible from anywhere in the home even in adverse conditions [Vacher2015].

The aim of the PhD will be to build a new generation of situated spoken human machine interaction where uttered sentences by a human are understood within the context of the interaction in the home. The targeted application is a distant speech hand free and ubiquitous voice user interface to make the home automation system react to voice commands [Chahuara2017]. The system should be able to process possibly erroneous outputs from an ASR system (Automatic Speech Recognition) to extract meaning related to a voice command and to decide about which command to execute or to send a relevant feed-back to the user. The challenge will be to constantly adapt the system to new lexical phrases (no a priori grammar), new situations (e.g., unseen user, context) and change in the house (e.g., new device, device out of order). In this work, we propose to extend classical S/NLU (Natural Language Understanding) approaches by including non-linguistic contextual information in the NLU process to tackle the ambiguity and borrow zero-shot learning techniques [Ferreira2015] to extend the lexical space on-line. Reinforcement learning is targeted to adapt the models to the user(s) all along the use of the system [Mnih2015].
The candidate will be strongly encouraged to publish their progress to the main events of the field (ACL, Interspeech, Ubicomp).
The PhD candidate will also be involved in experiments including real smart-home and real users (elderly people and people with visual impairment) [Vacher2015].


REFERENCES :
[Mnih2015] Mnih, Kavukcuoglu et al.  Human-level control through deep reinforcement learning. Nature 518,529?533
[Chahuara2017] Chahuara, F. Portet, M. Vacher Context-aware decision making under uncertainty for voice-based control of smart home Expert Systems with Applications, Elsevier, 2017, 75, pp.63-79.
[Ferreira2015] E Ferreira, B Jabaian, F Lefevre Online adaptative zero-shot learning spoken language understanding using word-embedding Acoustics, Speech and Signal Processing (ICASSP), 2015
[Vacher2015] M. Vacher, S. Caffiau, F. Portet, B. Meillon, C. Roux, E. Elias, B. Lecouteux, P. Chahuara. Evaluation of a context-aware voice interface for Ambient Assisted Living: qualitative user study vs. quantitative system evaluation. ACM - Transactions on Speech and Language Processing, Association for Computing Machinery, 2015, pp.5:1-5:36.

JOB REQUIREMENTS AND QUALIFICATIONS

- Master?s degree in Computational Linguistics or Artificial Intelligence (Computer Science can also be considered)
- Solid programming skills,
- Good background in machine learning,
- Excellent English communication and writing skills,
- Good command of French (mandatory),
- Experience in experimentation involving human participants would be a plus
- Experience in dialogue systems would be a plus plus

Applications should include:

- Cover letter outlining interest in the position
- Names of two referees
- Curriculum Vitae (CV) (with publications if applicable)
- Copy of the university marks (grade list)

and be sent to michel.vacher@imag.fr and francois.portet@imag.fr



Research Group Website : http://getalp.imag.fr
Research project website : http://vocadom.imag.fr/
Top

6-20(2017-06-20) Full-time post-Doctoral researcher position at LORIA Nancy, France

Loria a computer science lab in Nancy - France, has 12 months funded full-time post-Doctoral researcher position starting on October 2017.
The post-doctoral position is funded by AMIS (Access Multilingual Information OpinionS), a Chist-Era project (http://deustotechlife.deusto.es/amis/).

The topic of the post-doc is the automatic comparison of multilingual opinions in videos. Two videos in two different languages concerning the
same topic have to be compared. One of the videos is summarized and translated to the language of the second one. This last one is summarized
and then the opinions of the two original videos are compared in terms of emotional labels such as: anger, disgust, fear, joy, sadness, surprise...
They should be compared also in terms of basic sentiments.

 

Social network will be used in order to reinforce the analysis of the contents in terms of opinions and sentiments.

 

AMIS group will make available the summary of videos in terms of text. The candidate will work on NLP, but skills in video analysis will
be appreciated.

 

The applicant will contribute also to other tasks in collaboration with other partners of AMIS project.

 

The successful candidate will join the SMarT research team, he will be supervised by Prof. Kamel Smaïli, Dr D. Langlois and Dr D. Jouvet.
The applicant will work also with Dr O. Mella and Dr D. Fohr.

 

Location: Loria - Nancy (France) 
Duration: October 2017 ? September 2018 
Net salary: from 1800 Euros to 2400 Euros per month.

The ideal applicant should have: 

 

* A PhD in NLP, opinion and sentiment mining or other strongly related discipline.
* A very solid background in statistical machine learning. 
* Strong publications. 
* Solid programming skills to conduct experiments. 
* Excellent level in English. 

Applicants should send to smaili@loria.fr: 

* A CV
* A cover letter outlining the motivation
* Three most representative papers 

 

 

 

Delete | Reply | Reply to List | Reply to All | Forward | Redirect | View Thread | Blacklist | Whitelist | Message Source | Save as | Print
Move | Copy
Top

6-21(2017-06-20) Thèse CIFRE at Orange Labs, Lannion, France
Orange Labs propose une thèse CIFRE en Informatique en lien avec les domaines suivants : Apprentissage automatique, Prédiction structurée et Traitement automatique du langage naturel.
 
Cette thèse se place dans le cadre d'une collaboration avec l?équipe Expression de l?IRISA à Lannion. 
 
La description du sujet est disponible à cet emplacement :
 
 
Profil des candidats : Les candidat(e)s doivent être titulaire d'un Master recherche en informatique, statistique, traitement du signal. Les formations mixtes mathématique/informatique sont privilégiées. 
Excellent niveau en anglais requis.
Date limite de candidature : 31 juil. 2017
Localisation : Lannion
Top

6-22(2017-06-25) Two positions are available for internships at FBK, Trento, Italy
Two positions are available for internships at FBK, Trento, Italy

Title: Deep machine learning for speaker diarization
Duration: Jan 1 - Oct 31, 2018
Url:https://hr.fbk.eu/en/jobs


Title: DNN adaptation for acoustic modeling in speech recognition
Duration: Jan 1 - Oct 31, 2018
Url:https://hr.fbk.eu/en/jobs
 
Application deadline 15th of september 2017
---------
 
Top

6-23(2017-06-26) Language Resources Project Manager -Junior position at ELDA Paris France

The European Language resources Distribution Agency (ELDA), a company specialized in Human Language Technologies within an international context is currently seeking to fill an immediate vacancy for a Language Resources Project Manager ? Junior position. This yields excellent opportunities for young, creative, and motivated candidates wishing to participate actively to the Language Engineering field.

Language Resources Project Manager - Junior (m/f)

Under the supervision of the Language Resources Sales Manager, the Language Resources Project Manager ? Junior will be in charge of the identification of Language Resources (LRs) and the negotiation of rights in relation with their distribution.

The position includes, but is not limited to, the responsibility of the following tasks:

  • Identification of LRs and Cataloguing.
  • Negotiation of distribution rights, including interaction with LR providers, drafting of distribution agreements, definition of prices of language resources to be integrated in the ELRA catalogue.
  • LR Packaging and Archiving.
  • Designing and evaluating workflows for IPR clearance in the digital environment.

Profile:

  • Master?s degree or equivalent in Law and Computer science, with an awareness of Intellectual Property Rights and Data Protection issues in the digital environment.
  • The ideal candidate will have experience in computational linguistics, information science, knowledge management or similar fields.
  • Experience in project management and participation in European projects, as well as practice in contract and partnership negotiation at an international level, would be a plus.
  • Dynamic and communicative, flexible to combine and work on different tasks.
  • Ability to work independently and as part of a team.
  • Proficiency in English, with strong writing and documentation skills. Communication skills required in a French-speaking working environment.
  • Citizenship of (or residency papers) a European Union country.

All positions are based in Paris. Applications will be considered until the position is filled.

Salary is commensurate with qualifications and experience.
Applicants should email a cover letter addressing the points listed above together with a curriculum vitae to:

ELDA
9, rue des Cordelières
75013 Paris
FRANCE
Fax : 01 43 13 33 30
Mail: job@elda.org

ELDA is acting as the distribution agency of the European Language Resources Association (ELRA). ELRA was established in February 1995, with the support of the European Commission, to promote the development and exploitation of Language Resources (LRs). Language Resources include all data necessary for language engineering, such as monolingual and multilingual lexica, text corpora, speech databases and terminology. The role of this non-profit membership Association is to promote the production of LRs, to collect and to validate them and, foremost, make them available to users. The association also gathers information on market needs and trends.

For further information about ELDA and ELRA, visit:
http://www.elra.info

Top

6-24(2017-06-27) Thèse de doctorat en traitement de la parole Parkinsonienne à INRIA Bordeaux, France

nceThèse de doctorat en traitement de la parole Parkinsonienne à INRIA Bordeaux

Sujet : Traitement non-linéaire de la parole pour l'analyse et la classification de voix

Parkinsoniennes

Contexte scientifique :

La maladie de Parkinson (MP) est la maladie neurodégénerative la plus répandue après la maladie

d'Alzheimer. Elle touche 1.5% de la population âgée de plus de 65 ans et 143000 français. L'atrophie

multi-systématisée (AMS) est une maladie neurodégénérative rare et sporadique d'évolution

progressive et d’étiologie inconnue. Elle a une prévalence de 2 à 5/100000 et n'a pas de traitement

effectif. AMS appartient au groupe des troubles parkinsoniens atypiques et est responsable d’un

pronostic péjoratif. Dans les premiers stades de la maladie, les symptômes de MP et AMS sont très

similaires, surtout pour AMS-P où le syndrome parkinsonien prédomine. Le diagnostic différentiel

entre AMP-P et MP peut être très difficile dans les stades précoces de la maladie, tandis que la certitude

de diagnostic précoce est important pour le patient en raison du pronostic divergent. En effet, malgré

des efforts récents, aucun marqueur objectif valide n'est actuellement disponible pour guider le

clinicien dans ce diagnostic différentiel. La nécessité de ces marqueurs est donc très élevé dans la

communauté de la neurologie, en particulier compte tenu de la gravité du pronostic de AMS-P.

Les troubles de la parole et de la voix, communément appelés dysarthrie [1,2], dans la maladie de

Parkinson sont un marqueur clinique qui coïncide avec une déficience motrice et l'apparition d'une

déficience cognitive. Comme les patients MP, en fonction des zones du cerveau qui sont endommagées,

les personnes souffrant d'AMS peuvent également avoir des troubles de la parole: difficultés

d'articulation, rythme staccato, voix grinçante ou silencieuse. La dysarthrie dans l'AMS est plus sévère

et plus précoce dans le sens où elle nécessite plus de rééducation précoce par rapport à la MP.

Comme les troubles de la parole sont un symptôme précoce commun aux deux maladies et d'origine

différente. Notre approche consiste à utiliser la dysarthrie, grâce à un traitement numérique des

enregistrements vocaux des patients, comme un vecteur pour l'aide au diagnostic différentiel entre MP

et AMS-P dans les stades précoces de la maladie.

Objectif de la thèse :

Les voix pathologiques, telles que la MP et la MSA, présentent généralement une forte non-linéarité et

une turbulence élevée. Les phénomènes non-linéaires/turbulents ne sont pas naturellement bien décrits

par le traitement linéaire du signal. Ce dernier règle cependant actuellement sur la technologie de la

parole. Ainsi, du point de vue méthodologique, l'objectif de cette thèse est d'étudier la parole

Parkinsonienne dans le cadre des signaux et systèmes non linéaires et turbulents [3] . Ce cadre est en

effet mieux adapté à l'analyse de la gamme des phénomènes non linéaires et turbulents observés dans

les voix pathologiques en général, et dans la voix MP et MSA en particulier. Nous adopterons

notamment une approche basée sur de nouveaux algorithmes d'analyse non-linéaire de la parole

récemment développés dans l'équipe Gestation [4]. L'objectif est d'extraire les caractéristiques de la

parole pertinentes pour concevoir de nouvelles mesures de dysarthrie qui permettent une discrimination

précise entre les voix de MP et de MSA. Cela nécessitera également l'utilisation de méthodes

d'apprentissage statistique (Machine learning) afin de développer des classificateurs robustes (pour

discriminer les voix de MP et MSA) et d'établir des correspondances (régression) entre des mesures de

la parole et les scores cliniques standard quantifiant la sévérité de la maladie.

Les partenaires cliniques de ce projet sont des centres du CHU-Bordeaux et du CHU-Toulouse de

renommée internationale sur MP et AMS. Les partenaires académiques sont l'équipe Samova de l’IRIT,

qui a une grande expertise en traitement (linéaire) de la parole, et l'Institut Mathématique de Toulouse

(IMT) pour les aspects Machine learning.

Le doctorant participera ainsi activement à la collecte de données, en coordination avec les neurologues

et phoniatres du CHU-Bordeaux et CHU-Toulouse. Ces données consisteront en l'enregistrement des

voix des patients à l'aide d'un enregistreur numérique et du dispositif EVA2 (http://www.sqlab.fr/), ainsi

que de signaux électroglottographiques (EGG).

Références:

[1] Freed, D. Motor speech disorders. Thomson Learning Eds. 2000.

[2] Auzou, P.; Rolland, V.; Pinto, S., Ozsancak C. (eds.). Les dysarthries. Editions Solal. 2007.

[3] Kantz, H. and T. Schreiber, Nonlinear time series analysis. 2nd ed. 2004, Cambridge; New York: Cambridge

University Press.

[4] PhD thesis of Vahid Khanagha. GeoStat team, INRIA Bordeaux-Sud Ouest. January 2013.

http://geostat.bordeaux.inria.fr/images/vahid%20khanagha%204737.pdf

Directeur : Dr. Khalid Daoudi, équipe Gestat (khalid.daoudi@inria.fr).

Lieu : INRIA- Bordeaux Sud Ouest (http://www.inria.fr/bordeaux). Bordeaux, France.

Financement : Projet ANR (Voice4PD-MSA)

Début de la thèse : entre Le 1er Octobre et le 31 décembre 2017

Durée : 3 ans

Rémunération : ~1600€ net/mois, incluant la couverture sociale et médicale.

Compétences requises : De très bonnes connaissances en traitement de la parole/signal ainsi qu'en

programmation C++/Python et Matlab sont nécessaires. Des connaissances en apprentissage statistique

(Machine learning) seraient un grand plus.

Les candidatures doivent être adressées à khalid.daoudi@inria.fr

Top

6-25(2017-07-12) Open PhD and postdoc positions at LIMSI - CNRS, Orsay, France
Open PhD and postdoc positions at LIMSI - CNRS, Orsay, France
Automatic enrichment of TV series and movies transcripts
Keywords : natural language processing, speech processing, machine learning, deep learning

The goal of this project is to fully exploit the audio stream to automatically enrich speech transcripts and subtitles of TV series and movies with the name and position of the characters.

speaker A  ? 'Nice to meet you, I am Leonard, and this is Sheldon. We live across the hall.'
speaker B ? 'Oh. Hi. I?m Penny.'

speaker A ? 'Sheldon, what the hell are you doing?'
speaker C ? I am not quite sure yet. Do you know where Howard lives?

Just looking at these two short conversations, a human can easily infer that 'speaker A' is actually 'Leonard', 'speaker B' is Penny and 'speaker C' is Sheldon. The objective of this project is to combine natural language processing, speech processing, and computer vision to do the same automatically.
 
Top

6-26(2017-07-25) Post-doc à Paris et Saclay , France

eurosPost-doc à Paris et Saclay : Apprentissage semi-supervisé et apprentissage profond pour une mesure de l'expérience émotionnelle du client dans les interactions orales


[1] Telecom ParisTech, 46 rue Barrault, 75013 Paris

[2] EDF Lab Paris Saclay, 7 boulevard Gaspard Monge 91120 Palaiseau

Durée :  1 an

Début :  Novembre 2017

Salaire : en fonction de l'expérience et à partir de 2300 euros/mois

       

*Description du sujet de post-doctorat*

 

Avec l'engouement récent du Big Data pour le Feel data, EDF souhaite développer des méthodes d'analyse automatique de l'expérience émotionnelle du client dans ses interactions orales. Les interactions orales considérées sont les données des centres d'appels et données d'interaction avec des assistants virtuels vocaux (de type Alexa, Siri). L'objectif du post-doctorat est de mettre en place desméthodes d'apprentissage semi-supervisées, combinées à de l'apprentissage profond (deep learning) pour l'analyse automatique de l'expérience émotionnelle du client. Les méthodes reposeront sur des caractéristiques (features) linguistiques et acoustiques extraites des interactions orales.

 

Le chercheur post-doctorant prendra part à une collaboration entre EDF Lab et Telecom-ParisTech et son travail de recherche reposera sur les étapes suivantes :

- la mise en place de schéma d?annotation de l?expérience émotionnelle de l?utilisateur sur les données collectées à EDF Lab (données des centres d'appels et données d'interaction avec des assistants virtuels vocaux (de type Alexa, Siri)

- l?extraction d?indices acoustiques caractéristiques des réalisations non verbales de l?expérience émotionnelle de l?utilisateur

- l?extraction d?indices linguistiques caractéristiques des réalisations verbales de l?expérience émotionnelle de l?utilisateur

-  la mise en place de stratégies d?apprentissage permettant de mixer des approches non supervisées et des approches d?apprentissage profond

Le post-doc aura lieu au sein des deux centres de recherche : EDFLab [2] et Telecom-Paristech (département IDS image and signal processing department of Telecom ParisTech [1], équipe S2a, thème Social computing).


* Profil du candidat*

 

Le candidat devra posséder a minima les compétences suivantes :

 

?    un diplôme de doctorat

?   des recherches et publications dans au moins l?une des thématiques suivantes : machine learning, traitement de la parole, affective computing. Traitement automatique des langues

?    d?excellentes compétences en programmation  (avec une préférence pour le python)

?    excellente maîtrise du français et bonne maîtrise de l?anglais

 

Les compétences ci-dessous, seront également un atout :

?        apprentissage semi-supervisé

?        Deep learning

?        analyse acoustique des émotions

 

-- Informations pratiques

Lieu du post-doc: Paris (Telecom-ParisTech) et Palaiseau (EDF Lab Paris-Saclay)

Encadrement : Delphine Lagarde, Aurélie Dano (EDF Lab) et Chloé Clavel (Telecom-ParisTech)

 

-- Comment candidater :

Les candidatures sont à envoyer à Chloé Clavel (chloe.clavel@telecom-paristech.fr)

 

Elles doivent être compilées ** en un seul fichier pdf ** et inclure :

?    Un CV complet et détaillé incluant la liste des publications

?    Une lettre de motivation spécifique au sujet du post-doc

?   Les noms et les adresses de 2 référents.

 

[1]http://www.tsi.telecom-paristech.fr

[2]htttps://www.edf.fr/groupe-edf/qui-sommes-nous/activites/recherche-et-developpement/nos-centres-de-recherche/le-centre-de-recherche-edf-lab-paris-saclay-souffle-sa-premiere-bougie  

Top

6-27(2017-07-26) DSP engineer positions, Proactivaudio, Vienna, Austria
Several  positions related to DSP engineering for speech and audio applications are open at Proactivsaudio  in Vienna,Austria. More details can be found here http://www.proactivaudio.pro/?page_id=2136.


1.Software Engineer (f/m) – Digital Audio 
 
Location Vienna, Austria.

Description: We are looking for a full-time software engineer with experience and passion for software development of audio systems. The successful candidate has the unique opportunity to grow along with the Start-up. The main roles are to port and optimize advanced audio processing algorithms in a variety of platforms and architectures (Windows, macOS, iOS, Android), as well as the development of GUI-based host applications interfacing with heterogeneous audio devices.

Qualifications:
– 3 years of experience in industry or as academic post-graduate.
– Excellent software design/programming skills in C/C++ and Objective-C.
– Solid knowledge of multi-threaded programming and debugging.
– Be able to multi-task and learn new technologies quickly.
– Thorough understanding of real-time digital audio.

Desired Skills:
– Working experience with Audio Units, VST, and Core Audio APIs.
– Experience with WebRTC as committer.

Education Bachelor or (preferrably) Master degree in Software Engineering, Electrical/Computer Engineering, or equivalent.

Conditions
Full-time employment in accordance to the Austrian regulations. A gross annual salary of EUR 39.550 can be expected. Overpayment is possible depending on qualifications and work experience. Workplace has very good connections with public transport.

How to Apply
Send your application by e-mail to contact@proactivaudio.pro with a single attachment (CV and proof documents) in PDF format.

About
proactivaudio is a young Austrian startup with an innovative patented technology for acoustic echo reduction (AER) and noise reduction. Our AER solution is fully operative under any hostile acoustic scenario, such as under permanent double talk, background noise and changes in the acoustic room. Altogether.

2. Software Engineer (f/m) – Embedded DSP
 
 
Location Vienna, Austria.

Description:
We are looking for a full-time software engineer with experience and passion for software development of audio systems. The successful candidate has the unique opportunity to grow along with the Start-up. The main roles are to port and optimize advanced audio processing algorithms in a variety of embedded systems, from Texas Instruments, Analog Devices and ARM Cortex.

Qualifications:
– 3 years of experience in industry or as academic post-graduate.
– Excellent software design/programming skills in C/C++ and SIMD instructions. – Solid knowledge of real-time embedded debugging.
– Be able to multi-task and learn new technologies quickly.
– Good knowledge of digital signal processing.

Desired Skills:
– Working experience with Code Composer Studio.
– Working experience with VisualDSP++ and CrossCore Embedded Studio.

Education Bachelor or (preferrably) Master degree in Software Engineering, Electrical Engineering, or equivalent.

Conditions:
Full-time employment in accordance to the Austrian regulations. A gross annual salary of EUR 39.550 can be expected. Overpayment is possible depending on qualifications and work experience. Workplace has very good connections with public transport.

How to Apply
Send your application by e-mail to contact@proactivaudio.pro with a single attachment (CV and proof documents) in PDF format.

About
proactivaudio is a young Austrian startup with an innovative patented technology for acoustic echo reduction (AER) and noise reduction. Our AER solution is fully operative under any hostile acoustic scenario, such as under permanent double talk, background noise and changes in the acoustic room. Altogether.
Top

6-28(2017-07-28) R&D Engineer/Scientist at VOCAPIA Research, Orsay, France

Vocapia Research is hiring an R&D Engineer/Scientist, to build and improve multilingual language processing systems relying on machine learning techniques. We are seeking candidates with substantial programming experience who are able to carry out careful experimental work, paying attention to details. We are looking for highly-motivated, goal-driven individuals who also desire to be team players.

Main qualifications

  • Master's, PhD or Engineering degree in Computer Science or Electrical Engineering
  • Experience in language processing, speech recognition or machine learning
  • Strong programming skills using C/C++ and scripting languages in a UNIX/Linux environment
  • Good knowledge of written and spoken English. Fluency in one or more other languages is a strong plus (in particular Arabic, Dutch, German, Italian, Japanese, Mandarin, Polish, Portuguese, Romanian, Russian, Spanish and Turkish)

Preference will be given to candidates who have experience in speech and language processing.

Location: Orsay, France (about 25km out of Paris )

To apply please submit your CV by email (including the job reference and your contact information) to recruit(at)vocapia.com or directly fill in the online application form (http://www.vocapia.com/applyforjob.html?VR1706IRD)

PDF or PS file, please.

 
Top

6-29(2017-07-31) Postdoctoral researcher in Cryptography, Intelligent Voice Ltd, London City, UK

Intelligent Voice Ltd, London City, UK
Postdoctoral researcher in Cryptography

In the framework of a H2020 program of the European Commission, Intelligent Voice Ltd is receiving financial support to hire a postdoctoral researcher for one year on the following topic :

The cloud offers an ideal opportunity for storing large volumes of data. However, the storage of sensitive data such as speech in plain text format on the cloud is not permitted in many industry sectors such as finance, health care etc. Hence speech data should be encrypted before storage on the cloud, and because it contains biometric identifiers it must remain encrypted. The challenge then is to search over large amounts of encrypted speech and return encrypted search results that can be decrypted by the user only. Intelligent Voice are providers of the world's fastest speech to text engine, and we are looking for a talented researcher in semantic security and searchable encryption to join our research team. This post builds on existing research within Intelligent Voice on Searchable and Homomorphic cryptographic protocols for speech processing.

Applicants should have already completed, or be close to completing, a PhD in computer science, mathematics, or a related discipline. Applicants should have an excellent research track record demonstrated by publications at major cryptography/security venues, and should have significant experience in the design and deployment of cryptographic protocols.

To apply please send your CV (with publication list), a 1-page cover letter, and the names of at least two people who can provide reference letters (e-mail).

Contact: Gérard Chollet, Head of Research, Intelligent Voice Ltd

St Clare House, 30-33 Minories, London EC3N 1BP

gerard.chollet(at)intelligentvoice.com

Phone: +44 20 3627 2670

More Information: http://www.intelligentvoice.com/https://www.slideshare.net/cholletge/ppsp-icassp17v10-72961572



Closing Date for Applications: 2017-08-31

 

 

Top

6-30(2017-08-03) AI-NLP Scientist at Sparted, France


 
 
AI – NLP Researcher
 
 
 
COMPANY: SPARTED is an innovative and disruptive startup that is changing the way people learn. We offer companies and organizations a unique and scalable game platform for micro learning on mobile devices. www.sparted.com
 
MISSION:  In the context of ambitious and strategic project targeting to automatically generate questions from descriptive texts in a variety of semantical contexts, the mission consists in: • Establishing a state of the art of capabilities of NLP regarding the project • Designing and identifying the main milestones of a long-term research plan working collaboratively with a research lab  • Contributing to the development of proofs of concepts with our team • Spreading Machine Learning knowledge inside the team and train our engineers to Natural Language Processing specificities • Designing and administrating flexible, scalable datasets • Contributing to the company vision
 
REQUIRED SKILLS: • Depth and breadth of knowledge in Machine Learning and NLP • Experience solving analytical problems using quantitative approaches • Ability to manipulate and analyze data from varying sources • Knowledge in some programming language
PROFILE: You have or you’re about to defend a PhD in Machine learning, AI, Applied Mathematics, Data Science, Statistics , Computer Science or related technical field with a strong focus on Natural
Language Processing and related technologies such as pattern recognition, sequence-to-sequence models, word2vec, etc. You like taking up challenges, teamwork and building awesome products.
 
BONUS SKILLS THAT WOULD BE GREAT: • French language  • Experience in Text Mining • Beer pong master • Demonstrated experience with Natural Language Technologies and engines  • Familiarity with Semantic Technologies • White Russian expertise • Extreme skiing or Wingsuit practice
 
REMUNERATION: Depending on your status, talent and experience (€ 30K - xxK) Internship, PhD project (CIFRE) or other
 
LOCATION: The position is based in Paris, avenue Kléber, next to the Place de l’Etoile, in our so cool top floor premises with an exceptional view on Paris and the Eiffel tower.
 
JOIN THE FUN DISRUPTION: SPARTED is at war. The fun rebellion is running against the empire of boredom. SPARTED is strengthening, growing and completing its forces by recruiting superheroes, in addition to an exceptional 20 people team of world-class best developers, project managers, designers and other performers.
 
Fun disruptors are men and women, enthusiastic personalities involved in the future, smart cookies and bold people, wanting to continuously evolve, improve themselves and change the world. If you’re made of the right stuff, this position is the opportunity to join a daring project in a successful start-up with a strong identity and mindset, with great career prospects.
 
APPLICATION: Say us hello at start@sparted.com +33 6 52 14 86 9

Top

6-31(2017-08-10) 3 job offers at Laboratoire de Phonétique et Phonologie, Paris, France (1phD and 2 postdocs)

Three job offers at Laboratoire de Phonétique et Phonologie, Paris, France (1phD and 2 postdocs)

 

In the context of the MoSpeeDi project, funded by the FNS and run in collaboration with the University of Geneva, the HUG and IDIAP,  the Laboratoire de Phonétique et Phonologie* (LPP ? CNRS/Sorbonne Nouvelle) in Paris is offering 3 positions (1 phD and 2 postdocs) starting November 2017 or later.

 

The general goal of the project is to better understand the processes and representations at play during speech production, focusing on the latest stages of the process where an encoded linguistic message is transformed into articulated speech. At the interface between linguistic and motor processes, these stages are also associated with different breakdowns considered as Speech Motor Disorders (dysarthria and apraxia of speech).

 

Articulatory, acoustic and speech behaviour data will be experimentally collected and analysed for both healthy speakers and speakers with speech motor disorders in order to (a) improve the characterization of phonetic speech planning and motor speech programming, (b) to identify phonetic and speech behavior markers of these processes, and (c) to better isolate and classify speech alterations in speech motor disorders. 

 

Within this Swiss-French collaborative project, the LPP is recruiting 1 doctoral student and 2 post-docs to join the team under the supervision of Cécile Fougeron (DR CNRS) and in close collaboration with Simone Falk (MC Sorbonne Nouvelle), and Leonardo Lancia (CR CNRS). See job description and application procedure below.

 

 

Contact: cecile.fougeron@univ-paris3.fr

 

 

 

* The Laboratoire de Phonétique et Phonologie (LPP) is a joint CNRS and Université Sorbonne Nouvelle structure.  It is located in the center of Paris (19 rue des Bernardins, 7500 Paris).

More information on http://lpp.in2p3.fr/Presentation-du-LPP

Top

6-32(2017-08-16) 2 Post-Doc/Research positions in Audio Music Content Analysis at IRCAM, Paris, France

2 Post-Doc/Research positions in Audio Music Content Analysis
Project: 'Dig that Lick: Analyzing large-scale data for melodic patterns in jazz performances'
Place: IRCAM & L2S, Paris, France
Duration: 2 years and 1 year
Start: October 1st, 2017
Salary: according to background and experience


IRCAM ( www.ircam.fr ) and L2S at University Paris Saclay ( www.l2s.centralesupelec.fr ) are jointly offering two new PostDoc positions for our current international project Dig that Lick: Analyzing large-scale data for melodic patterns in jazz performances ( dig-that-lick.eecs.qmul.ac.uk ). The project gathers six different universities across four countries (USA, UK, Germany, France).

The planned dates for the project are: 1 Oct 2017 - 30 Sep 2019.

* About the 'Dig that Lick' project: *
The recorded legacy of jazz spans a century and provides a vast corpus of data documenting its development. Recent advances in digital signal processing and data analysis technologies enable automatic recognition of musical structures and their linkage through metadata to historical and social context. Automatic metadata extraction and aggregation give unprecedented access to large collections, fostering new interdisciplinary research opportunities. This project aims to develop innovative technological and music-analytical methods to gain fresh insight into jazz history by bringing together renowned scholars and results from several high-profile projects. Musicologists and computer scientists will together create a deeper and more comprehensive understanding of jazz in its social and cultural context. We exemplify our methods via a full cycle of analysis of melodic patterns, or licks, from audio recordings to an aesthetically contextualised and historically situated understanding.
More information can be found at: dig-that-lick.eecs.qmul.ac.uk


* Position description: *
Dig that Lick relies on audio music content analysis algorithms. IRCAM & L2S are looking for 2 Post-Doctoral Research assistants (starting Oct. 1st, 2017) for the development of MIR algorithms:
? one 24-month post-doctoral position to develop robust algorithms for automatic melody extraction (AME) from audio signals
? one 12-month post-doctoral position to develop robust algorithms for automatic analysis of harmonic and metrical structure from audio signals

* Required profiles: *
? High skills in audio signal processing and in machine learning (the candidate should preferably hold a PHD in one of these fields)
? High skills in Matlab and/or Python programming
? Good knowledge of Linux, Windows, Mac-OS environments
? High productivity, methodical works, ability to work independently and in a creative way, excellent programming style.
? Strong communication skills in English (French is not required)
? Interest/background in Jazz music will be appreciated

The hired researchers will collaborate with the development team and participate in the project activities (evaluation of technologies, meetings, specifications, reports). Strong interactions with teams from the other participating universities are expected.

* Application Procedure / More information: *
Applications must include a detailed CV with a list of publications and a motivation letter. Applications and any suitable information are to be emailed to: H. Papadopoulos [ircam.l2s.digthatlick@gmail.com] and G. Peeters [peeters at ircam dot fr]





Top

6-33(2017-08_29) Several post-doctoral positions at IDIAP, Martigny, Switzerland

Amongst several openings at Idiap in general, we have *three* post-doctoral positions in
the areas of multilingual speech processing and cross-lingual indexing:
 http://www.idiap.ch/en/join-us/job-opportunities
They are funded by EU and US projects, so there will be plenty of opportunity to
collaborate with partners in the EU and US.

Idiap has well established speech and NLP groups.  We are located in a wine region in the
Swiss Alps, a couple of hours out of Geneva.  The lab functions in English, the locality
in French.

--

Top

6-34(2017-08-31)Trois offres d'emploi au Laboratoire de Phonétique de l'Université de Mons, Belgique
'***************************************************************************
Trois offres d'emploi sont disponibles au Laboratoire de Phonétique de
l'Université de Mons, Belgique.
(1) Assistant(e) sous mandat
(2) Orthophoniste
(3) Chercheur (doctoral et post-doctoral)
 
***************************************************************************
 
(1) Offre d?emploi : « Assistant(e) sous mandat »
 
Le service de métrologie et sciences du langage (laboratoire de
phonétique) de l'Université de Mons recherche des candidats à l'engagement
dans un poste d'assistant sous mandat à partir du début de l'année
académique 2017-2018.
 
PROFIL DU CANDIDAT (M/F):
 
Les candidats (M/F) seront détenteurs d'un diplôme de niveau Master
(obtenu à l'issue d'études comptant pour un minimum de 300 crédits au sens
ECTS) ou jugé équivalent. Ils seront en mesure de se prévaloir:
· de compétences en rapport avec les thématiques d'enseignement du
Service (Voir : 
age/Pages/default.aspx)
· d'une formation initiale susceptible de leur permettre l?accès aux
études doctorales organisées par la Faculté de psychologie et des sciences
de l?éducation (accès direct aux diplômés du domaine Sciences
psychologiques et de l'éducation, accès sur dossier aux autres diplômés) ;
le sujet précis de la thèse de doctorat (centrée sur les sciences de la
parole) sera arrêté de commun accord avec le Service.
 
Les candidats se caractériseront, complémentairement, par:
· de bonnes aptitudes au travail en équipe, de la créativité, de
l?autonomie, une curiosité scientifique aigue,
· une bonne maîtrise de l?outil informatique (tableurs, gestion de bases
de données, traitements de texte et autres éléments de suites
bureautiques),
· une maîtrise suffisante de la langue française à titre de langue de
travail au quotidien,
· une maîtrise de l?anglais permettant les échanges scientifiques à l?oral
en contexte international et la production d?écrits scientifiques.
 
Des connaissances dans le domaine du traitement des phénomènes
acoustiques, une formation préalable en phonétique, la maîtrise d?outils
d?analyse de la parole (tels Praat, Childes,?), une sensibilité à la
problématique des langues étrangères, la maîtrise d?outils statistiques,
des compétences en programmation, la possession d?un permis B et d?un
moyen de transport personnel constituent des atouts complémentaires.
 
OPERATIONS DE RECRUTEMENT:
 
Les personnes intéressées sont priées d?adresser, pour le 11 septembre
2017 au plus tard, un dossier comportant :
· une lettre de motivation,
· un curriculum vitae (comportant adresse mail et numéro de téléphone
susceptibles d?être utilisés en vue d?une éventuelle prise de contact),
· les relevés de notes de chaque année d?études supérieures,
· tout document jugé utile.
au format pdf (exclusivement)
 
Après une première phase d?évaluation des candidatures sur dossier, un
sous-ensemble des candidats sera retenu en vue d?une seconde phase
procédant par entretien de sélection :
· Les candidats retenus pour la seconde phase en seront prévenus par mail
et/ou téléphone le 12
septembre 2017.
· Les entretiens se dérouleront le 15 septembre 2017.
 
Prise de fonction : possible à partir d?octobre 2017.
 
DESCRIPTION DE FONCTION:
 
Type  de contrat : Mandat de 2 ans renouvelable 2 fois avec possibilité
éventuelle, sur justification expresse, de mandats exceptionnels
ultérieurs (maximum 4) d?une durée d?un an chacun.
 
Participation aux activités du service dans ses missions :
· d?enseignement
· de recherche
· de service
Préparation d?une thèse de doctorat
 
***************************************************************************
 
(2) Offre d?emploi : « Orthophoniste »
 
 
Poste de spécialiste de l?étude clinique et du traitement des troubles du
langage
 
Le service de métrologie et sciences du langage (laboratoire de
phonétique) de l?Université de Mons recherche, en vue de l?engagement dans
un poste à mi-temps intitulé « logopède », un spécialiste (M/F) de l?étude
clinique et du traitement des troubles du langage.
 
PROFIL DU POSTE:
 
La personne recrutée prend part aux travaux en clinique du langage que
développe
le Laboratoire. Ceux-ci peuvent être de nature :
· pédagogique (travaux pratiques au sein de cours centrés sur le langage :
sesspécificités, ses difficultés, ses troubles, les actions évaluatives et
curatives qui le concernent)
· scientifique (participation aux recherches du service)
· clinique (impliquant l?action directe au profit de patients dans une
relation à caractère évaluatif et/ou curatif).
 
La possession d?un permis B et d?un moyen de transport personnel
constituent des atouts complémentaires.
 
Compétences :
· Aptitude au travail en équipe, créativité, autonomie, curiosité
scientifique.
· Bonne rmaîtrise de l?outil informatique (tableurs, gestion de bases de
données,
traitements de texte et autres éléments de suites bureautiques).
· Maîtrise native de la langue française à l?écrit et à l?oral.
· Maîtrise au moins passive de l?anglais scientifique au moins à l?écrit.
 
Diplômes requis :
Secteur de formation initiale :
Sciences du langage (orthophonie, logopédie, psychologie du langage,?) à
titre de formation de base ou à tout le moins de formation complémentaire
Niveau à l?entrée :
· soit niveau « bachelier » professionnalisant (bac+3, 180 crédits au sens
ECTS),
· soit niveau « master » (bac+5, 300 crédits au sens ECTS).
 
Expérience professionnelle requise :
Une expérience clinique, des connaissances en phonétique, la maîtrise
d?outil d?analyse de la parole (tels Praat, Childes,?), une sensibilité à
la problématique des langues étrangères.
 
Type de contrat : CDD de 12 mois avec possibilité de renouvellement en CDI
Régime horaire : Mi-temps (19h/semaine)
Grade de recrutement : 1er agent spécialisé (si bachelier) / Attaché (si
master) 
Salaire brut minimum proposé : 1er agent spécialisé : 1119,36 ? / Attaché
: 1487,44 ?
Date de prise de fonction : Possible à partir d?octobre 2017
 
DEPOT DES CANDIDATURES:
 
Les personnes intéressées sont priées d?adresser, pour le 11 septembre 2017
au plus tard, un dossier comportant :
· une lettre de motivation,
· un curriculum vitae (comportant adresse mail et numéro de téléphone
susceptibles d?être utilisés en vue d?une éventuelle prise de contact),
· les relevés de notes de chaque année d?études supérieures,
· tout document jugé utile. au format pdf (EXCLUSIVEMENT) à l?adresse :
 
Procédure de recrutement :
Après une première phase d?évaluation des candidatures sur dossier, un
sous-ensemble des candidats sera retenu en vue d?une seconde phase
procédant par entretien de sélection :
· Les candidats retenus pour la seconde phase en
seront prévenus par mail et/ou téléphone le 12 septembre 2017.
· Les entretiens se dérouleront le 15 septembre
2017.
 
 
***************************************************************************
 
(3) Chercheurs
 
Le service de métrologie et sciences du langage (laboratoire de phonétique
de l?Université de Mons), recherche un spécialiste (M/F) de l?étude de la
parole humaine désireux d?apporter son concours au développement du projet
BIOVOC, dans sa partie centrée sur les effets sur le langage et la parole
de la pression situationnelle (charge cognitive, fatigue, émotions, etc.)
exercée sur le sujet humain par les tâches de conduite de processus
complexes (conduite de véhicules terrestres, pilotage d?aéronefs, gestion
de flux aéronautiques, etc.).
 
Deux profils de poste sont disponibles dans ce contexte.
 
**********PROFIL 1 : Chercheur doctorant***********
 
PROFIL DU CANDIDAT (M/F) :
 
Secteur de formation initiale : Sciences du langage (linguistique,
logopédie, psychologie du langage,?) à titre de formation de base ou à
tout le moins de formation complémentaire approfondie.
 
Niveau à l?entrée : Au moins niveau « master » (« bac+5 », 300 crédits au
sens ECTS).
 
Compétences transversales:
·      Aptitude au travail en équipe, créativité, autonomie, curiosité
scientifique.
·      Bonne maîtrise de l?outil informatique (tableurs, gestion de bases
de données, traitements de texte).
·      maîtrise suffisante de la langue française à titre de langue de
travail au quotidien,
·      maîtrise de l?anglais permettant les échanges scientifiques à
l?oral en contexte international et la production d?écrits scientifiques.
 
Une sensibilité à la problématique des langues étrangères, la maîtrise
d?autres langues, des connaissances en phonétique (en particulier
l?analyse objective de la parole), des connaissances et des compétences en
mathématiques appliquées (en particulier en statistiques), constituent des
atouts complémentaires.
 
Track record:
La personne recrutée peut se prévaloir de mentions égales ou supérieures à
la « grande distinction » en bachelier et en master.
 
Profil fonctionnel:
La personne recrutée se voit octroyer une bourse de doctorat d?un an.
Elle s?engage à tenter d?obtenir, durant ce terme, un financement plus
étendu en concourant pour des postes tels que chercheur FRIA, FRESH et/ou
aspirant FNRS.
Au terme de la première année, en cas d?absence du financement alternatif
ainsi recherché, la personne est évaluée par le comité d?accompagnement.
Sur cette base, elle peut se voir octroyer ?ou non- un terme additionnel
d?un an. Durant ce nouveau terme, elle s?engage à poursuivre sa recherche
de financements extérieurs de son doctorat.
En aucun cas, la durée cumulée des bourses de recherche doctorale ne peut
excéder 4 années.
Prise de fonctions : au plus tôt.
 
 
*******PROFIL 2 : Chercheur en séjour post-doctoral******
 
PROFIL DU CANDIDAT (M/F) :
Secteur de formation initiale : Sciences du langage (linguistique,
logopédie, psychologie du langage,?) à titre de formation de base ou à
tout le moins de formation complémentaire approfondie.
 
Niveau à l?entrée : Doctorat
 
Compétences transversales:
·      Aptitude au travail en équipe, créativité, autonomie, curiosité
scientifique.
·      Bonne maîtrise de l?outil informatique (tableurs, gestion de bases
de données, traitements de texte).
·      Maîtrise suffisante de la langue française à titre de langue de
travail au quotidien.
·      Maîtrise de l?anglais permettant les échanges scientifiques à
l?oral en contexte international et la production d?écrits scientifiques.
 
Une sensibilité à la problématique des langues étrangères, la maîtrise
d?autres langues, des
connaissances en phonétique (en particulier l?analyse objective de la
parole), des connaissances et des compétences en mathématiques appliquées
(en particulier en statistiques), constituent des atouts complémentaires.
 
Track record:
La personne recrutée peut se prévaloir de hautes mentions en bachelier et
en master et, le cas échéant, d?une mention ou d?une évaluation chiffrée
élevées pour le doctorat. Son rapport de thèse atteste ses qualités
scientifiques de haut niveau. Elle a à son actif, un nombre jugé
significatif de publications scientifiques.
 
Profil fonctionnel:
La personne retenue est engagée pour une durée maximale de post-doc de 18
mois.
Durant son séjour post-doc, le chercheur (M/F) travaille à la co-rédaction
d?articles scientifiques susceptibles d?être publiés dans des revues du
domaine à haut facteur d?impact.
Les articles ainsi produits sont signés par le chercheur en premier
auteur, le chef de service en dernier auteur, et les autres publiants en
position intermédiaire, dans un ordre consensuellement déterminé.
Un minimum de 4 publications est attendu pour un séjour correspondant à la
durée maximale.
Un premier contrat à durée déterminée de six mois est établi. Moyennant
évaluation satisfaisante par le Service, ce contrat peut être prolongé.
Durant son séjour post-doctoral, le chercheur peut, moyennant accord du
service, entamer des démarches visant à son maintien au sein du service
après la fin de son séjour post-doctoral ; il bénéficie alors du soutien
du service dans ses démarches auprès des bailleurs de fonds scientifiques
(par exemple, au FNRS : chercheur post-doc, chargé de recherche, chercheur
qualifié, etc.).
 
Prise de fonctions : au plus tôt.
 
 
OPERATIONS DE RECRUTEMENT
 
Les personnes intéressées sont priées d?adresser, pour le 11 septembre
2017 au plus tard, un dossier comportant :
· une lettre de motivation,
· un curriculum vitae (comportant adresse mail et numéro de téléphone
susceptibles d?être utilisés en vue d?une éventuelle prise de contact),
· le rapport de soutenance de thèse,
· les publications jugées comme les plus significatives
· les relevés de notes de chaque année d?études supérieures,
· tout document jugé utile au format pdf (exclusivement) à l?adresse :
 
Après une première phase d?évaluation des candidatures sur dossier, un
sous-ensemble des candidats sera retenu en vue d?une seconde phase
procédant par entretien de sélection :
·      les candidats retenus pour la seconde phase en seront prévenus par
mail et/ou
téléphone le 12 septembre 2017.
·      les entretiens se dérouleront le 15 septembre 2017.
 
PROJET SCIENTIFIQUE
 
Des concepts tels que fatigue ou  stress sont fréquemment évoqués tant en
sciences de la vie qu?en sciences humaines. Ils se caractérisent non
seulement par le fait que leur portée s?étend aussi bien à la biochimie
qu?au psychisme humains, mais surtout par l?idée qu?une action sur
l?esprit peut ici avoir des répercussions sur le corps et vice-versa. Ces
notions demeurent pourtant variablement définies et diversement
objectivées, comme l?est aussi, dans ce contexte, l?interaction entre
physiologie et psychisme.
 
Le projet BIOVOC vise à élucider ces relations complexes en étudiant
l'évolution conjointe, dans une approche intra-sujets, de trois types de
variables: (i) des variables situationnelles (tant à variation invoquée
qu?à variation provoquée); (ii) des marqueurs biologiques de l'état du
sujet humain (approche métabonomique et bio-marqueurs spécifiques
enregistrés au sein de divers biofluides); (iii) les mesures révélatrices
du traitement du langage par le sujet (gestion de la parole en émission et
en réception). 
Les contextes dans lesquels sont recueillies les observations sont ceux du
contrôle par le sujet humain de processus complexes, spécialement en
aéronautique, domaine hautement générateur de « situations-problèmes »
potentiellement suscitatrices de phénomènes.
 
Les recherches à mener dans le cadre du poste offert se centrent plus
particulièrement sur les variables situationnelles et cible spécifiquement
celles qui sont liées aux langues utilisées par le sujet. On sait
aujourd?hui que la réalité physique des sons de parole est notamment
influencée par divers facteurs qui s?y rapportent. Ceux-ci peuvent être
liés à l?inscription communautaire du sujet (on peut par exemple citer les
variabilités diatopique, diastratique, ou diachronique), voire aux actions
exogènes visant délibérément à modifier les caractéristiques phoniques des
sons de parole, par exemple dans les contextes de
l?enseignement/apprentissage (et/ou de l?utilisation) de langues
non-maternelles. D?autres déterminants, de nature endogène au sujet,
peuvent également se manifester, qu?ils trouvent leur origine dans la
sphère cognitive (maîtrise des langues, expertise multilingue, etc.), soit
dans la sphère affective (posture personnelle par rapport aux langues
utilisées). 
 
Ces travaux qui, à des titres divers, démontrent l?action de ces facteurs
endogènes sur les productions vocales, ouvrent la voie à un positionnement
moins descriptif mais plus centré sur la valeur indicielle des
observations effectuées: puisque ces facteurs ont un effet sur le signal
vocal, la détection de leurs marques dans le signal ouvre la voie d?une
caractérisation, par la seule analyse des productions phoniques, de l?état
du locuteur. Le secteur des transports, et en particulier celui de
l?aéronautique, s?est montré graduellement plus intéressé par ces
perspectives lors des dernières décennies ; à un moment où nombre
d?incidents ou d?accidents y sont aujourd?hui imputables au facteur humain
plus qu?à des défauts techniques, le développement de recherches
susceptibles de contribuer à l?élaboration de systèmes d?alerte propres à
détecter des altérations de la fonctionnalité du pilote et ce, sur la base
de variations du seul signal vocal, fait figure de défi stratégique. Si
plusieurs recherches ont certes démontré l?intérêt de ces perspectives,
force est cependant de constater que les résultats en sont extrêmement
diversifiés, voire parfois contradictoires. Ceci
s?explique probablement d?une part par l?insuffisance du volume de données
global recueilli et d?autre part par l?importante diversité méthodologique
qui caractérise le champ. De ce point de vue, trois dimensions
apparaissent nécessiter une attention particulière. D?une part, ces
recherches, sont d?ordinaire restreintes à des productions vocales
anglophones, laissent dans l?ombre les autres langues de communication
aéronautique et, ipso facto, négligent les interactions possibles entre le
facteur langue et les divers facteurs étudiés ; peu, par ailleurs,
prennent en considération le caractère fréquemment multilingue des
communications aéronautiques et le fait que, souvent, les agents sont
amenés à s?exprimer dans une langue non maternelle ; aucune, par ailleurs,
n?interroge le problème de la perte différentielle de compétence phonique
en L2 et en L1 sous l?effet des conditions adverses de communication.
 
Les opérations de recherce à mener dans le cadre du poste offert se
centreront en conséquence tant sur les effets exercés par les variables
situationnelles liées aux situations de contrôle de processus complexes
sur la performance multilingue que sur les effets de divers types de
multilinguismes sur l?efficacité du contrôle de processus complexes gérés
en contextes multilingues.
 
 
***************************************************************************
Dr. Véronique Delvaux
Chercheur qualifié FNRS
Chargée de cours UMONS
Service de Métrologie et Sciences du Langage
Local ?1.7, Place du Parc, 18, 7000 Mons
+3265373140
age/Pages/VeroniqueDelvaux.aspx
Top

6-35(2017-09-01) TheVoice project - PhD Offers at IRCAM, Paris, France

TheVoice project - PhD Offers

TheVoice project (2017-2021) funded by the French Research National Agency (ANR) proposes two PhD thesis. TheVoice project aims to create voices for audiovisual production in the field of the creative, cultural, and entertainment industry. The scientific objective of the project is to study the voices of professional actors, naturally expressive, in order to create innovative voice design solutions. The consortium, composed of recognized laboratories and industrialists (Ircam, LIA, Dubbing Brothers), aims to consolidate a position of excellence for “Made-in-France” research and digital technologies, and to promote the French culture all over the the world.

Deep learning for voice recommendation

The objective of the thesis is to create a voice recommendation system based on deep neural networks, by exploiting the entire “vocal palette” of professional actors and integrating information related to acoustics, perception (what the listener perceives, without context), and reception (what the spectator perceives “in situation” in a movie depending on his social and cultural expectations).

 

Contacts: jean-francois.bonastre@univ-avignon.fr, Nicolas.Obin@ircam.fr

Expressive voice identity conversion

 

The objective of the thesis is to create a voice identity conversion system able to reproduce the voice of professional actors from naturally expressive and real acted conditions, by exploiting the audio tracks of movies, series, etc. The thesis will be based on the long-term experience in voice analysis and transformation and the existing voice conversion system developed at Ircam, currently used for professional productions.

Contact: Nicolas.Obin@ircam.fr



Candidates must have a master degree in computer science (or equivalent) with skills in audio signal processing, machine learning, and informatics (python, C++). A preliminary experience in speech processing would be greatly appreciated.

Applications (CV + motivation letters) must be sent before: 15/09/2017

Top

6-36(2017-09-09)Positions at Reykjavik University's School of Science and Engineering, Iceland

Applications are invited for a research position in text to speech systems at the Language and Voice Laboratory (lvl.ru.is) at Reykjavik University's School of Science and Engineering. The position is sponsored by the Icelandic Language Technology fund grant 'Environment for building text-to-speech synthesis for Icelandic.' The main aim of the work will be to set up and advance research on speech synthesis back-end architectures for parametric speech synthesis. The successful candidate will work closely with the other members of the lab who focus on the language specific problems such as text normalization, phonemic analysis and phrasing. Even though the main objective of the work will be on Icelandic, working with other languages will be welcome. The successful candidate will contribute to the academic output of the lab as well as the publication of an open TTS environment for Icelandic.
 
Skills and qualifications
        ? MSc/PhD degree in applied mathematic, statistics, computational engineering or computer science is preferable
        ? Knowledge of mathematical modeling is preferrable (e.g. differential equations, neural networks, linear systems)
        ? Good skills in applying and analysing algorithms
        ? Good knowledge of Linux is preferable
        ? Ability to use shell scripts is advantageous (bash, awk, sed)
        ? Good programming skills are necessary (e.g. C++, Java or Python)
 
Fixed-term
The funds for this post are available for 24 months in the first instance.
 
Further information regarding the position provides Jón Guðnason (jg@ru.is), associate professor at School of Science and Engineering, and Anna Björk Nikulásdóttir, research specialist at School of Science and Engineering. Interviews will start September 14th, but applications received after that date will be taken into consideration if the position has not been filled by December 1st 2017. Applications must be submitted on the Reykjavík University website, here below. All inquiries and applications are treated as confidential.

http://radningar.hr.is/storf/ViewJobOnWeb.aspx?jobid=3005

Top

6-37(2017-09-09) PhD position in Computational Linguistics for Ambient Intelligence, University Grenoble Alpes, France

Keywords: Natural language understanding, decision support system, smart
home

The Laboratoire d'Informatique de Grenoble (LIG) of the University
Grenoble Alpes, Grenoble, France invites applications for a PhD position
in Computational Linguistics for Ambient Intelligence.

University of Grenoble Alpes is situated in a high-tech city located at
the heart of the Alps, in outstanding scientific and natural
surroundings. It is 3h by train from Paris ; 2h from Geneva and is less
than 1h from Lyon international airport.

The position starts in September 2017 and ends in July 2020 and is
proposed in the context of the national project Vocadom
(http://vocadom.imag.fr/) whose aim is to build technologies that make
natural hand-free speech interaction with a home automation system
possible from anywhere in the home even in adverse conditions
[Vacher2015].

The aim of the PhD will be to build a new generation of situated spoken
human machine interaction where uttered sentences by a human are
understood within the context of the interaction in the home. The
targeted application is a distant speech hand free and ubiquitous voice
user interface to make the home automation system react to voice
commands [Chahuara2017]. The system should be able to process possibly
erroneous outputs from an ASR system (Automatic Speech Recognition) to
extract meaning related to a voice command and to decide about which
command to execute or to send a relevant feed-back to the user. The
challenge will be to constantly adapt the system to new lexical phrases
(no a priori grammar), new situations (e.g., unseen user, context) and
change in the house (e.g., new device, device out of order). In this
work, we propose to extend classical S/NLU (Natural Language
Understanding) approaches by including non-linguistic contextual
information in the NLU process to tackle the ambiguity and borrow
zero-shot learning techniques [Ferreira2015] to extend the lexical space
on-line. Reinforcement learning is targeted to adapt the models to the
user(s) all along the use of the system [Mnih2015].  The candidate will
be strongly encouraged to publish their progress to the main events of
the field (ACL, Interspeech, Ubicomp).  The PhD candidate will also be
involved in experiments including real smart-home and real users
(elderly people and people with visual impairment) [Vacher2015].

REFERENCES :
[Mnih2015] Mnih, Kavukcuoglu et al.  Human-level control through deep
reinforcement learning. Nature 518, 529?533

[Chahuara2017] Chahuara, F. Portet, M. Vacher Context-aware decision
making under uncertainty for voice-based control of smart home Expert
Systems with Applications, Elsevier, 2017, 75, pp.63-79.

[Ferreira2015] E Ferreira, B Jabaian, F Lefevre Online adaptative
zero-shot learning spoken language understanding using word-embedding
Acoustics, Speech and Signal Processing (ICASSP), 2015

[Vacher2015] M. Vacher, S. Caffiau, F. Portet, B. Meillon, C. Roux, E.
Elias, B. Lecouteux, P. Chahuara. Evaluation of a context-aware voice
interface for Ambient Assisted Living: qualitative user study vs.
quantitative system evaluation. ACM - Transactions on Speech and
Language Processing, Association for Computing Machinery, 2015,
pp.5:1-5:36.

JOB REQUIREMENTS AND QUALIFICATIONS

- Master?s degree in Computational Linguistics or Artificial
Intelligence (Computer Science can also be considered)
- Solid programming skills,
- Good background in machine learning,
- Excellent English communication and writing skills,
- Good command of French (mandatory),
- Experience in experimentation involving human participants would be a
   plus
- Experience in dialogue systems would be a plus plus

Applications should include:

- Cover letter outlining interest in the position
- Names of two referees
- Curriculum Vitae (CV) (with publications if applicable)
- Copy of the university marks (grade list)

and be sent to michel.vacher@imag.fr and francois.portet@imag.fr

Research Group Website : http://getalp.imag.fr
Research project website : http://vocadom.imag.fr/

Top

6-38(2017-09-09) new PostDoc positions for our current international project Dig that Lick: Analyzing large-scale data for melodic patterns in jazz performances ( dig-that-lick.eecs.qmul.ac.uk ).

IRCAM ( www.ircam.fr ) and L2S at University Paris Saclay ( www.l2s.centralesupelec.fr ) are jointly offering two new PostDoc positions for our current international project Dig that Lick: Analyzing large-scale data for melodic patterns in jazz performances ( dig-that-lick.eecs.qmul.ac.uk ). The project gathers six different universities across four countries (USA, UK, Germany, France).

The planned dates for the project are: 1 Oct 2017 - 30 Sep 2019.

* About the 'Dig that Lick' project: *
The recorded legacy of jazz spans a century and provides a vast corpus of data documenting its development. Recent advances in digital signal processing and data analysis technologies enable automatic recognition of musical structures and their linkage through metadata to historical and social context. Automatic metadata extraction and aggregation give unprecedented access to large collections, fostering new interdisciplinary research opportunities. This project aims to develop innovative technological and music-analytical methods to gain fresh insight into jazz history by bringing together renowned scholars and results from several high-profile projects. Musicologists and computer scientists will together create a deeper and more comprehensive understanding of jazz in its social and cultural context. We exemplify our methods via a full cycle of analysis of melodic patterns, or licks, from audio recordings to an aesthetically contextualised and historically situated understanding.
More information can be found at: dig-that-lick.eecs.qmul.ac.uk


* Position description: *
Dig that Lick relies on audio music content analysis algorithms. IRCAM & L2S are looking for 2 Post-Doctoral Research assistants (starting Oct. 1st, 2017) for the development of MIR algorithms:
? one 24-month post-doctoral position to develop robust algorithms for automatic melody extraction (AME) from audio signals
? one 12-month post-doctoral position to develop robust algorithms for automatic analysis of harmonic and metrical structure from audio signals

* Required profiles: *
? High skills in audio signal processing and in machine learning (the candidate should preferably hold a PHD in one of these fields)
? High skills in Matlab and/or Python programming
? Good knowledge of Linux, Windows, Mac-OS environments
? High productivity, methodical works, ability to work independently and in a creative way, excellent programming style.
? Strong communication skills in English (French is not required)
? Interest/background in Jazz music will be appreciated

The hired researchers will collaborate with the development team and participate in the project activities (evaluation of technologies, meetings, specifications, reports). Strong interactions with teams from the other participating universities are expected.

* Application Procedure / More information: *
Applications must include a detailed CV with a list of publications and a motivation letter. Applications and any suitable information are to be emailed to: H. Papadopoulos [ircam.l2s.digthatlick@gmail.com] and G. Peeters [peeters at ircam dot fr]

Top



 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA