ISCA - International Speech
Communication Association


ISCApad Archive  »  2022  »  ISCApad #289  »  Jobs

ISCApad #289

Sunday, July 10, 2022 by Chris Wellekens

6 Jobs
6-1(2022-06-15) Ingénieur.e science des données et corpus – Laboratoire d’Informatique de Grenoble, France

Ingénieur.e science des données et corpus – Laboratoire d’Informatique de Grenoble

 

Analyse, conception, mise en forme et diffusion des corpus vocaux et multimodaux du LIG et du LIDILEM

 

Poste à pourvoir : ingénieur - CDD

Durée : 1 an (possibilité de prolongation)

Début : à partir du 1er septembre 2022

Date limite de candidature : 30 juin 2022

Lieu : Laboratoire d’informatique de Grenoble – Équipe Getalp

Domaine : Traitement Automatique des Langues et de la Parole
 
Profil : Master 2 informatique ou doctorat en informatique linguistique

 

Contexte

Le poste à pouvoir est soutenu par la Chaire Artificial Intelligence & Language de l'Institut MIAI Grenoble Alpes. MIAI est un centre d’excellence en intelligence artificielle qui vise à conduire des recherches au plus haut niveau, à proposer des enseignements attractifs pour les étudiant.e.s et les professionnel.le.s de tous les niveaux, à soutenir l'innovation dans les grandes entreprises, les PMEs et les startups et enfin à informer et interagir avec les citoyen.ne.s sur tous les aspects de l'IA. La personne recrutée sera hébergée au sein de l'équipe GETALP du Laboratoire d'Informatique de Grenoble (LIG), qui offre un cadre dynamique, international et stimulant pour mener des recherches pluridisciplinaires de haut niveau. L'équipe GETALP est hébergée dans un bâtiment moderne (IMAG) situé sur un campus paysager de 175 hectares qui a été classé huitième plus beau campus d'Europe par le magazine Times Higher Education en 2018.

 

Missions confiées

  • Organiser des corpus contenant des données multimodales (audio, texte, vidéo).

  • Traiter et transformer les données en format d’usage pour faciliter les traitements et la reproductibilité.

  • Développer des scripts pour la transformation, le formatage et le test des données (Python, Bash, Java).

  • Superviser des campagnes d’annotation de données (Elan, doccano, Brat).

  • Diffuser ces corpus sur des plateformes ouvertes (ORTOLANG, Zenodo, ELRA) et faciliter leur exploitation.

  • Participer à la rédaction de documents scientifiques et techniques.

  • Assister la mise en œuvre et gérer divers pipelines logiciels pour soutenir l'analyse de données et l'exploration de textes.

  • Aider les autres membres de l'équipe à réaliser des expériences concernant les données.

  • Documenter le cycle de vie des données et mettre à jour le plan de gestion des données.

Vous travaillerez en étroite collaboration avec des doctorants, des stagiaires et des chercheurs du bassin Grenoblois de l’institut MIAI.

Vous bénéficierez également des compétences et de l'environnement de recherche de 2 unités de recherche : le LIG (https://www.liglab.fr) et le LIDILEM (https://lidilem.univ-grenoble-alpes.fr/).

 

Compétences

  • Master en data science, humanités numériques ou sciences sociales computationnelles ;

  • Maîtrise de l’anglais technique et scientifique ;

  • Excellent relationnel ;

  • Savoir travailler en équipe pluridisciplinaire ;

  • Savoir s’adapter au contexte projet ;

  • Être autonome dans son organisation personnelle et le reporting ;

  • Avoir une bonne communication écrite et orale en français ;

  • Maîtrise de langages de scripts (Python, bash, Perl, PhP) ;

  • Connaissance des outils d’annotations (Elan, Praat) ;

  • Expérience en outils de linguistique de corpus, en recherche sur corpus, en analyse quantitative et qualitative des données.

  • Une expérience en traitement du langage naturel, traitement de la parole ou en linguistique computationnelle sont jugées comme un plus.

 

Instructions pour postuler

Les candidatures sont attendues jusqu'au 30 juin 2022.

Veuillez envoyer votre CV + une lettre/message de motivation + les notes de vos études antérieures + des références pour une ou plusieurs lettres de recommandation potentielles à :

Francois.portet@imag.fr

  


 


Back  Top

6-2(2022-02-01) Two post-docs at ADAPT, Dublin, Ireland
We are looking to recruit two Post-Doctoral Researchers to join the UCD team in the ADAPT Research Centre (www.adaptcentre.ie).  These projects are part of the ADAPT Digital Content Transformation Strand.
 
1) Harnessing Speech for Social Inclusion
Rather than focusing on accuracy and error rates, evaluation of speech recognition systems should be contextualized with respect to how well they perform in situations where the interlocutor is not the ‘typical’ native speaker (e.g. a senior citizen, a citizen with disabilities or a non-native speaker). Through publicly engaged research, data will be collected on how well existing speech technology is serving our citizens. A systematic evaluation of the output data produced by existing ASR systems and the interactions that arise when an ASR ‘converses’ with a range of users will enable a categorization of interaction issues and error patterns that need to be accounted for when developing applications which provide interfaces to essential services. This project will involve the development of a system which facilitates diagnostic evaluation of ASRs in a variety of interaction scenarios providing linguistic cues the augmentation of the ASR and building on the low-resource MT technologies being developed in other ADAPT projects.
 
2) Embedding of Multi-level Speech Representations
While deep learning has led to huge performance gains in speech recognition and synthesis, only recently more focus is being placed on what deep learning may be able uncover about the patterns which humans use intuitively when interacting via speech and which distinguish native from non-native speakers. Such patterns are typically the focus of speech perception and experimental phonetic studies. This project aims to build on the notion of multi-linear or multi-tiered representations of speech, creating embeddings of multiple (sub- word) levels of representation – phonetic features, phonemes, syllable pieces and syllables – enabling a closer investigation of systematicity and variability of speech patterns. This research will find application in non-native speech recognition, in speech adaptation/accommodation for native and non-native interactions and in pronunciation training scenarios.
 
ADAPT is the world-leading SFI research centre for AI Driven Digital Content Technology, coordinated by Trinity College Dublin and based within Dublin City University, University College Dublin, Technological University Dublin, Maynooth University, Munster Technological University, Athlone Institute of Technology, and the National University of Ireland Galway. ADAPT’s research vision is to pioneer new forms of proactive, scalable, and integrated AI-driven Digital Content Technology that empower individuals and society to engage in digital experiences with control, inclusion, and accountability with the long-term goal of a balanced digital society by 2030. ADAPT is pioneering new Human Centric AI techniques and technologies including personalisation, natural language processing, data analytics, intelligent machine translation human-computer interaction, as well as setting the standards for data governance, privacy and ethics for digital content.

ADAPT Digital Content Transformation Strand
From the algorithmic perspective, new machine learning techniques will both enable more users to engage meaningfully with the increasing volumes of content globally in a more measurably effective manner, while ensuring the widest linguistic and cultural inclusion. It will enhance effective, robust integrated machine learning algorithms needed to provide multimodal content experiences with new levels of accuracy, multilingualism and explainability.
 
Full details of the positions, requirements and link to submit applications can be found at:
 
 
The closing date is 17:00hrs (local Irish time) on Monday 14th February 2022 and and candidates must apply via https://www.ucd.ie/workatucd/jobs/  
 
The reference is 
 
1) 014028 for Harnessing Speech for Social Inclusion
2) 014079 for Embedding of Multi-level Speech Representations
 
Back  Top

6-3(2022-02-02) Professeur(-e), Sorbonne Université, Paris, France
Un poste de Professeure / Professeur des Universités en Intelligence artificielle : théorie et applications, est à pourvoir à Sorbonne Université avec une affection recherche dans un des laboratoires : ISIR, LIB, LIMICS ou LIP6. 
 
Professeure / Professeur des Universités 
Section 27 – Informatique 
Profil : Intelligence artificielle : théorie et applications

Date limite des candidatures au poste : le 04 mars 2022 à 16h

La personne recrutée contribuera significativement aux enseignements de Licence d’informatique dont les besoins couvrent l’ensemble de la discipline (algorithmique, programmation notamment objet, concurrente, fonctionnelle, web, mathématiques discrètes, structures de données, système, architecture, réseaux, compilation, bases de données, etc.) ainsi qu’au master d’informatique, en particulier pour les parcours ANDROIDE, BIM ou DAC.

Recherche :
Le poste est ouvert à tous les domaines de l’IA et de ses applications. La personne retenue intégrera l’un des laboratoires : ISIR, LIB, LIMICS ou LIP6 selon ses thématiques de recherche, et/ou de projets impliquant plusieurs laboratoires d’accueil au sein de SCAI (Sorbonne Center for Artificial Intelligence). La professeure ou le professeur devra être capable de coordonner des programmes collaboratifs nationaux et internationaux. La participation de la candidate ou du candidat, dans le passé, à des projets multidisciplinaires sera appréciée.

Lien vers la fiche de poste : https://www.galaxie.enseignementsup-recherche.gouv.fr/ensup/ListesPostesPublies/FIDIS/0755890V/FOPC_0755890V_391.pdf
 
 
En vous remerciant d’avance pour votre aide dans le partage de cette offre.
Bien cordialement, 



 
Back  Top

6-4(2022-02-11) 2 research fellowship grant for collaboration in research activities, Kore University of Enna - Enna (Sicily), Italy

A public selection procedure is called, based on qualifications and interview, for the assignment of n. 2 research fellowship grant for collaboration in research activities

 

Project main aim: Multidisciplinary Research on AI for Health.

Location: Kore University of Enna - Enna (Sicily), Italy

Funding Programme: Research Projects of National Relevance - PRIN 2017

 

Description: The project focuses on idiopathic Parkinson’s disease dysarthric speech, produced by speakers of two varieties of Italian that show different segmental (consonantal, vocalic) and prosodic characteristics. The project as a whole aims at: identifying phonetic features that impact on speech intelligibility and accuracy, separating variability due to dysarthria from features due to sociolinguistic variation, and developing perspectives and tools for clinical practice that take variation into account.

 

Duration: 12 months

 

Link to apply: https://unikore.it/index.php/it/contratti-di-ricerca/item/41282-d-p-n-33-2022-2-assegni-di-ricerca-presso-l-universita-degli-studi-di-enna-kore

 

Further information contact Prof. Sabato Marco Siniscalchi, E-mail: marco.siniscalchi-at-unikore.it

 

Back  Top

6-5(2022-02-08) PhD position at Delft University, The Netherlands

Job description

 

One of the most pressing matters that holds back robots from taking on more tasks and reach a widespread deployment in society is their limited ability to understand human communication and take situation-appropriate actions. This PhD positions is dedicated to addressing this gap by developing the underlying data-driven models that enable a robot to engage with humans in a socially aware manner.

This position is specifically targeted at the development of an argumentative dialogue system for human-robot interaction.  The PhD candidate will explore how to fuse multimodal behaviour to infer a person's perspective. The candidate will use, and further develop, reinforcement learning techniques in order to drive the robot's argumentative strategy for deliberating topics of current social importance such as global warming or vaccination.

The ideal candidate will have a keen interest in speech technology and reinforcement learning. He or she has strong interactive system background will design and run the experiments to evaluate the created hybrid-AI models through human-robot interaction.

Topics of interest:

1) long-term human-robot interaction

2) affective computing

2) NLP&argument-mining

 

Requirements

 
  • MSc in Computer Science or related field.
  • At least 3 years of programming experience in python (java  or C++is a plus).
  • Motivation to meet deadlines.
  • Affinitty to design and social science research.
  • Interest in collaborating with colleagues from Industrial Design.
  • Willingness to teach and guide students.
  • The ability to work in a team, take initiative, be results oriented and systematic.

Click here to apply:
Back  Top

6-6(2022-02-17) Postdoctoral position at INRIA, Bordeaux, France

Postdoctoral position in Speech Processing at INRIA, Bordeaux, France

 

Title: Glottal source inverse filtering for the analysis and classification of pathological speech

Keywords: Pathological speech processing, Glottal source estimation, Inverse filtering, Machine learning, Parkinsonian disorders, Respiratory diseases

Contact and Supervisor: Khalid Daoudi (khalid.daoudi@inria.fr)

INRIA team: GEOSTAT (geostat.bordeaux.inria.fr)

Duration: 13 months (could be extended)

Starting date: between 01/04/2022 and 01/06/2022 (depending on the candidate availability)

Application : via https://recrutement.inria.fr/public/classic/en/offres/2022-04481

Salary: 2653€/month (before taxes, net salary 2132€)

Profile: PhD thesis in signal/speech processing (or a solid post-thesis experience in the field) Required Knowledge and background: A solid knowledge in speech/signal processing; Basics of machine learning; Programming in Matlab and Python. Scientific research context During this century, there has been an ever increasing interest in the development of objective vocal biomarkers to assist in diagnosis and monitoring of neurodegenerative diseases and, recently, respiratory diseases because of the Covid-19 pandemic. The literature is now relatively rich in methods for objective analysis of dysarthria, a class of motor speech disorders [1], where most of the effort has been made on speech impaired by Parkinson’s disease. However, relatively few studies have addressed the challenging problem of discrimination between subgroups of Parkinsonian disorders which share similar clinical symptoms, particularly is early disease stages [2]. As for the analysis of speech impaired by respiratory diseases, the field is relatively new (with existing developments in very specialized areas) but is taking a great attention since the beginning of the pandemic. The speech production mechanism is essentially governed by five subsystems: respiratory, phonatory, articulatory, nasalic and prosodic. In the framework of pathological speech, the phonatory subsystem is the most studied one, usually using sustained phonation (prolonged vowels). Phonatory measurements are generally based on perturbations or/and cepstral features. Though these features are widely used and accepted, they are limited by the fact that the produced speech can be a product of some or all the other subsystems. The latter thus all contribute to the phonatory performance. An appealing way to bi-pass this problem is to try to extract the glottal source from speech in order to isolate the phonatory contribution. This framework is known as glottal source inverse filtering (GSIF) [3]. The primary objective of this proposal is to investigate GSIF methods in pathological speech impaired by dysarthria and respiratory deficit. The second objective is to use the resulting glottal parameterizations as inputs to basic machine learning algorithms in order to assist in the discrimination between subgroups of Parkinsonian disorders (Parkinson’s disease, Multiple-System Atrophy, Progressive Supranuclear Palsy) and in the monitoring of respiratory diseases (Covid-19, Asthma, COPD). Both objectives benefit from a rich dataset of speech and other biosignals recently collected in the framework of two clinical studies in partnership with university hospitals in Bordeaux and Toulouse (for Parkinsonian disorders) and in Paris (for respiratory diseases).

Work description GSIF consists in building a model to filter out the effect of the vocal tract and lips radiation from the recorded speech signal. This difficult problem, even in the case of healthy speech, becomes more challenging in the case of pathological speech. We will first investigate time-domain methods for the parameterization of the glottal excitation using glottal opening and closure instants. This implies the development of a robust technique to estimate these critical time-instants from dysarthric speech. We will then explore the alternative approach of learning a parametric model of the entire glottal flow. Finally, we will investigate frequency-domain methods to determine relationships between different spectral measures and the glottal source. These algorithmic developments will be evaluated and validated using a rich set of biosignals obtained from patients with Parkinsonian disorders and from healthy controls. The biosignals are electroglottography and aerodynamic measurements of oral and nasal airflow as well as intra-oral and sub-glottic pressure. After dysarthric speech GIFS analysis, we will study the adaptation/generalization to speech impaired by respiratory deficits. The developments will be evaluated using manual annotations, by an expert phonetician, of speech signals obtained from patients with respiratory deficit and from healthy controls. The second aspect of the work consists in manipulating machine learning algorithms (LDA, logistic regression, decision trees, SVM…) using standard tools (such as Scikit-Learn). The goal here will be to study the discriminative power of the resulting speech features/measures and their complementarity with other features related to different speech subsystems. The ultimate goal being to conceive robust algorithms to assist, first, in the discrimination between Parkinsonian disorders and, second, in the monitoring of respiratory deficit.

Work synergy - The postdoc will interact closely with an engineer who is developing an open-source software architecture dedicated to pathological speech processing. The validated algorithms will be implemented in this architecture by the engineer, under the co-supervision of the postdoc. - Giving the multidisciplinary nature of the proposal, the postdoc will interact with the clinicians participating in the two clinical studies.

References: [1] J. Duffy. Motor Speech Disorders Substrates, Differential Diagnosis, and Management. Elsevier, 2013. [2] J. Rusz et al. Speech disorders reflect differing pathophysiology in Parkinson's disease, progressive supranuclear palsy and multiple system atrophy. Journal of Neurology, 262(4), 2015. [3] P. Alku. Glottal inverse filtering analysis of human voice production – A review of estimation and parameterization methods of the glottal excitation and their applications. Sadhana – Academy Proceedings in Engineering Sciences. Vol. 36, Part 5, pp. 623-650, 2011

Back  Top

6-7(2022-02-15) Thèse CIFRE L3i La Rochelle-EasyChain



La jeune compagnie EasyChain et le laboratoire de recherche L3i de La Rochelle lancent un appel à candidatures pour un poste de doctorant.e CIFRE dans le domaine de développement des agents conversationnels.

Les détails de l'offre sont disponibles à cette adresse :
Dynamic Human-Agent Interactions adapted to users? profiles.

Le.la candidat.e retenu.e devra être titulaire d'un master ou d'un diplôme équivalent en informatique ou en traitement automatique des langues. Un solide bagage en apprentissage automatique et une bonne communication en anglais sont requis.

La thèse se déroulera principalement à Niort chez EasyChain, dans un environnement francophone.

Si vous êtes intéressé.e par ce poste, veuillez envoyer les informations suivantes à Antoine Doucet (antoine.doucet@univ-lr.fr),  et Ahmed Hamdi (ahmed.hamdi@univ-lr.fr):
 * CV détaillé
 * Diplômes de licence et de master.
* Lettres de recommandation

Les candidatures seront étudiées jusqu'au 1 mars 2022.

Bien cordialement

Antoine Doucet

Back  Top

6-8(2022-04-05) Junior professor position at Université du Mans, France
L?Université du Mans ouvre une Chaire Professeur Junior en traitement du langage multimodal.
 
Les candidatures sont ouvertes sur Galaxie à déposer avant le 2 mai 2022.
 

Projet de recherche / Description of the research project

L?objectif principal est de développer une IA de traitement du langage multimodale et multilingue qui repose sur un espace de représentation commun pour les modalités parole et texte dans différentes langues. Le ou la candidat.e devra développer ses activités de recherche afin de renforcer le caractère transverse de ces représentations à travers une combinaison pertinente de modalités (par ex. : vidéo et texte ou texte et parole), de tâches (par ex. : caractérisation du locuteur et synthèse de la parole, compréhension de la parole et traduction automatique, reconnaissance de la parole et synthèse de résumé?automatique) et de langues. Ses travaux de recherche tendront à?développer des systèmes automatiques intégrant l?humain au c?ur du traitement en utilisant des approches d?apprentissage actif et en explorant les problématiques d?expliquabilité?et d?interpretabilité afin de permettre à l?utilisateur naïf d?enseigner au système automatique ou d?en extraire des éléments compréhensibles. Ce projet visera également le renforcement de collaborations existantes (Facebook, Orange, Airbus) ou la création de nouveaux partenariats (Oracle, HuggingFace?). 

The research project should take place in the LST team goal that aims at developping a multimodal and multilingual representation space for speech and text modalities. The Junior Professor is expected to develop his/her own research diretions between the topics already existing in the LST team and to develop hybrid approaches by mixing for instance speaker characterization and speech synthesis or speech translation and speech understanding. He/She should also integrate the strategy of the team to involve the human in the loop for deep learning systems and work towards a better explainability/interpretability of speech processing algorithms.

Projet d'enseignement / Description of the teaching project 

Le ou la candidat.e intégrera l?équipe pédagogique du Master en intelligence artificielle du département d?informatique de l?UFR Sciences et Techniques de l?Université? du Mans. Son implication aura pour but de renforcer les compétences en apprentissage profond (apprentissage auto-supervisé, GANs, Transfomer, méthodologies et protocoles pour l?IA?) mais également dans les infrastructures dédiées à l?apprentissage automatique et aux sciences des données (calcul distribué, SLURM, MPI), l?utilisation d?un cluster de calcul (ssh, temux, jupyter-lab, conda) ou le cloud computing. Fort de compétences reconnues en traitement automatique du langage et de la parole l?équipe pédagogique souhaite élargir son offre de formation en adaptant les contenus à d?autres types de données (images, séquences temporelles générées par différents types de capteurs, graphes?) afin de répondre aux besoins spécifiques du tissu industriel local et régional en apprentissage automatique. Cette action s?inscrira dans la volonté de l?équipe pédagogique de développer l?apprentissage et la formation continue en lien avec les partenaires industriels mais également à destination d?un public académique de chercheurs et enseignant chercheurs non-informaticiens souhaitant développer des compétences en apprentissage automatique.

Teaching activities will take place within the Master of Computer Sciences and Artificial Intelligence from Le Mans University. The candidate is expected to strengthen the teaching on deep learning (self-supervised training, GANs, Transformers, machine learning methodology and protocols?) but also teach tools for distributed learning (SLURM, MPI, ssh, temux, jupyter-lab, conda?) and cloud computing. In mid terms, the candidate will contribute to the development of a  continuing learning in artificial intelligence adapted to the need of local companies and industry but also for researchers non-specialist in computer sciences. 

Conditions de candidature / Application requirements

être titulaire d'un doctorat  / hold a PhD

Pour les candidats exerçant ou ayant cessé d'exercer depuis moins de dix-huit mois une fonction l'enseignant-chercheur, d'un niveau équivalent à celui de l'emploi à pourvoir, dans un établissement d'enseignement supérieur d'un État autre que la France:  titres, travaux et tout élément permettant d'apprécier le niveau de fonction permettant d'accorder une dispense de doctorat. 

For candidates exercising or having ceased to exercise for less than eighteen months a function of teacher-researcher, of a level equivalent to that of the position to be filled, in a higher education establishment of a State other than France: titles, works and any element allowing to appreciate the level of function allowing to grant a dispence of doctorate.

Contact 

Antoine LAURENT

Antoine.laurent@univ-lemans.fr

Anthony LARCHER

Anthony.larcher@univ-lemans.fr

Back  Top

6-9(2022-03-17) un poste de doctorant(e) dans le domaine de la détection multimodale de deep fakes, IRISA, Lannion, France
L?équipe EXPRESSION de l?IRISA lance un appel à candidatures pour un poste de doctorant ou doctorante dans le domaine de la détection multimodale de deep fakes.

Les détails de l'offre sont disponibles à cette adresse : 
MUDEEFA - MUltimodal DeEEp Fake detection using Text-To-Speech Synthesis, Voice Conversion and Lips Reading
 
Le candidat ou la candidate devra mener des recherches appliquées de pointe dans un ou plusieurs des domaines suivants : traitement du signal, apprentissage automatique statistique, reconnaissance de la parole et des gestes. Il/elle devra posséder d'excellentes compétences en programmation informatique (par exemple C/C++, Python/Perl, etc.), et des connaissances en apprentissage automatique, en traitement du signal ou en interaction homme-machine.
Le poste nécessite d'être titulaire d'un master en informatique ou d'un diplôme d'ingénieur donnant le titre de master en informatique.

La thèse se déroulera à Lannion, dans les Côtes d?Armor, au sein de l?équipe EXPRESSION.

Merci d'envoyer un CV détaillé, une lettre de motivation, une ou plusieurs lettres de référence et les résultats académiques du diplôme précédent (Master ou Ingénieur donnant le titre de Master) à tous les contacts indiqués dans le sujet avant le vendredi 8 avril 2022, limite stricte.

Bien cordialement

Arnaud Delhay-Lorrain

 

Arnaud Delhay-Lorrain - Associate Professor
IRISA - Université de Rennes 1 
IUT de Lannion - Département Informatique
Rue Edouard Branly - BP 30219
F-22 302 LANNION Cedex
Back  Top

6-10(2022-03-18) 3 speech-to-speech translation positions available at Meta/Facebook FAIR
We are seeking research scientists, research engineers and postdoctoral researchers with expertise on speech translation and related fields to join our team.

FAIR?s mission is to advance the state-of-the-art in artificial intelligence through open research for the benefit of all. As part of this mission, our goal is to provide real-time, natural-sounding translations at near-human level quality. The technology we develop will enable multilingual live communication. We aim for our technology to be inclusive: it should support both written and unwritten languages. Finally, in order to preserve the authenticity of the original content, especially for more creativity related content, we aim to preserve non-lexical elements in the generated audio translations. Ideal candidates will have expertise on speech translation or related fields such as speech recognition, machine translation or speech synthesis. Please send email to juancarabina@fb.com with a CV if you are interested in applying.
Back  Top

6-11(2022-03-21) Thèse ou postdoc at Laboratoire d'Informatique de Grenoble, France
Sujet de thèse ou de postdoctorat dans le cadre du projet  Popcorn (projet collaboratif avec deux entreprises)
encadrée par Benjamin Lecouteux, Gilles Sérasset et Didier Schwab (Laboratoire d?Informatique de Grenoble, Groupe d?Étude en Traduction Automatique/Traitement Automatisé des Langues et de la Parole)


Titre : Peuplement OPérationnel de bases de COnnaissances et Réseaux Neuronaux


Le projet aborde le problème de l?enrichissement semi-automatisé d?une base de connaissance au travers de l?analyse automatique de textes. Afin d?obtenir une innovation de rupture dans le domaine du Traitement Automatique du Langage Naturel (TALN) pour les clients sécurité et défense, le projet se focalise sur le traitement du français (même si les approches retenues seront par la suite généralisables à d?autres langues). Les travaux aborderont différents aspects :
? L?annotation automatique de documents textuels par la détection de mentions d?entités présentes dans la base de connaissance et leurs désambiguïsation sémantique (polysémie, homonymie) ;
? La découverte de nouvelles entités (personnes, organisations, équipements, événements, lieux), de leurs attributs (âge d?une personne, numéro de référence d?un équipement, etc.), et des relations entre entités (une personne travaille pour une organisation, des personnes impliquées dans un événement, ...). Une attention particulière sera donnée au fait de pouvoir s?adapter souplement à des évolutions de l?ontologie, la prise en compte de la place de l?utilisateur et de l?analyste pour la validation/capitalisation des extractions effectuées.
Le projet se focalise autour des trois axes de recherches suivants :
? Génération de données synthétiques textuelles à partir de textes de référence ;
? La reconnaissance des entités d?intérêt, des attributs associés et des relations entre les entités.
? La désambiguisation sémantique des entités (en cas d?homonymie par exemple)


Profil recherché:
    - Solide expérience en programmation & machine learning pour le Traitement Automatique de Langues (TAL), notamment l?apprentissage profond
    - Master/Doctorat Machine Learning ou informatique, une composante TAL ou linguistique computationnelle sera un plus apprécié
    - Bonne connaissance du français


Détails pratiques:
    - Début de la thèse rentrée 2022
    - Contrat doctoral à temps plein au LIG (équipe Getalp) pour 3 ans (salaire: min 1768? brut mensuel)
    - ou Contrat postdoctoral à temps plein au LIG (équipe Getalp) pour 20 mois (salaire: min 2395? brut mensuel)



 

Environnement scientifique:


  • Le doctorat ou le postdoctorat sera mené au sein de l'équipe Getalp du laboratoire LIG  (https://lig-getalp.imag.fr/).
  • La personne recrutée sera accueillie au sein de l?équipe qui offre un cadre de travail stimulant, multinational  et agréable. 
  • Les moyens pour mener à bien le (post)doctorat seront assurés tant en ce qui concerne les missions en France et à l?étranger qu?en ce qui concerne le matériel (ordinateur personnel, accès aux serveurs GPU du LIG, Grille de calcul Jean Zay du CNRS).


Comment postuler ?

 

  • Pour postuler sur une thèse de doctorat, les candidats doivent être titulaires d'un Master en informatique, en apprentissage machine ou en traitement automatique du langage naturel (obtenu avant le début du contrat doctoral, les étudiants actuellement en master 2 peuvent ainsi postuler).
  • Pour postuler sur un postdoctorat, les candidats doivent être titulaires d'une thèse de doctorat en informatique,  en apprentissage machine ou en traitement automatique du langage naturel (obtenu avant le début du contrat doctoral, les étudiants dont la soutenance est prévue avant fin septembre 2022 peuvent ainsi postuler).
  • Ils doivent avoir une bonne connaissance des méthodes d?apprentissage automatique et idéalement une expérience en collecte et gestion de corpus.
  • Ils doivent également avoir une bonne connaissance de la langue française. 
Les candidatures doivent contenir : CV + lettre/message de motivation + notes de master + lettre(s) de recommandations; et être adressées à Benjamin Lecouteux (benjamin.lecouteux@univ-grenoble-alpes.fr), Gilles Sérasset (gilles.serasset@univ-grenoble-alpes.fr) et Didier Schwab (Didier.Schwab@univ-grenoble-alpes.fr
Back  Top

6-12(2022-04-04) PhD position at INRIA-LORIA, Nancy, France

2022-04676 - PhD Position F/M Nongaussian models for deep learning based audio signal processing Level of qualifications required :Graduate degree or equivalent Fonction : PhD Position Context The PhD student will join the Multispeech team of Inria,that is the largest French research group in the field of speech processing. He/she will benefit from the research environment and the expertise in audio signal processing and machine learning of the team, which includes many researchers, PhD students, post-docs, and software engineers working in this field. He/she will be supervised by Emmanuel Vincent (Senior Researcher, Inria), and Paul Magron (Researcher, Inria). Assignment Audio signal processing and machine listeningsystems have achieved considerable progress over the past years, notably thanks to the advent of deep learning. Such systems usually process a timefrequency representation of the data, such as a magnitude spectrogram, and model its structure using a deep neural network (DNN). Generally speaking, these systems implicitly rely on the local Gaussian model [1],that is an elementary statistical model for the data. Even though it is convenient to manipulate, this model builds upon several hypotheses which are limiting in practice: (i) circular symmetry, which boils down t o discarding the phase information (= the argument of the complex-valued time-frequency coefficients); (ii) independence of the coefficients, which ignores the inherent structure of audio signals (temporal dynamics, frequency dependencies); and (iii)Gaussian density, which is not observed in practice. Statistical audio signal modeling is an active research field. However, recent advances in this field are usually not leveraged in deep learning-based approaches, thus their potential is currently underexploited. Besides, some of these advances are not mature enough to be fully deployed yet. Therefore, the objective of this PhD is to design advanced statistical signal models for audio which overcome the limitations of the local Gaussian model, while combining them with DNN-based spectrogram modeling. The developed approaches will be applied to audio source separation and speech enhancement. Main activities The main objectives of the PhD student will be: 1. To develop structured statistical models for audio signals, which alleviate the limitations of the local Gaussian model. In particular, t he PhD student will focus on designing models by leveraging properties that originate from signal analysis, such as the temporal continuity [2] or the consistency of the representation [3], in order to favor interpretability and meaningfulness of the models. For instance, alpha-stable distributions have been exploited in audio for their robustness [4]. Anisotropic models are an interesting research direction since they overcome the circular symmetry assumption, while enabling an interpretable parametrization of the statistical moments [5]. Finally, a careful design of the covariance matrix allows for explicitly incorporating time and frequency dependencies [6]. 2. To combine these statistical models withDNNs. This raises several technical difficulties regarding the design of, e.g., the neural architecture, the loss function, and the inference algorithm. The student will exploit and adapt the formalism developed in Bayesian deep learning, notably the variational autoencoding framework [7], as well as the inference procedures developed in DNN-free nongaussian models [8]. 3. To validate experimentally these methods on realistic sound datasets. To that end, the PhD student will use public datasets such as LibriMix (speech) and MUSDB (music), which are reference datasets for source separation and speech enhancement. The PhD student will disseminate his/her research results in international peer-reviewed journals and conferences. In order to promote reproducible research, these publications will be self-archived at each step of the publication lifecycle, and accessible through open access repositories (e.g., arXiv, HAL). The code will be integrated to Asteroid, that is the reference soDware for source separation and speech enhancement developed by Multispeech. Bibliography [1] E. Vincent, M. Jafari, S. Abdallah, M. Plumbley, M. Davies,Probabilistic modeling paradigms for audio source separation, Machine Audition: Principles, Algorithms and Systems, p.162–185, 2010. [2] T. Virtanen, Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria, IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol. 15, no. 3, pp.1066-1074, 2007. [3] J. Le Roux, N. Ono, S. Sagayama, Explicit consistency constraints for STFT spectrograms and their application to phase reconstruction, Proc. SAPA, 2008. [4] S. Leglaive, U. Şimşekli, A. Liutkus, R. Badeau and G. Richard,Alpha-stable multichannel audio source separation, Proc. IEEE ICASSP, 2017. [5] P. Magron, R. Badeau, B. David, Phase-dependent anisotropic Gaussian model for audio source separation, Proc. IEEE ICASSP, 2017. [6] M. Pariente, Implicit and explicit phase modeling in deep learning-based source separation, PhD thesis - Université de Lorraine, 2021. [7] L. Girin, S. Leglaive, X. Bie,J. Diard, T. Hueber, X. Alameda-Pineda,Dynamical variational autoencoders: A comprehensive review, Foundations and Trends in Machine Learning, vol. 15, no. 1-2, 2021. General Information Theme/Domain : Language, Speech and Audio Town/city : Villers lès Nancy Inria Center : CRI Nancy - Grand Est Starting date : 2022-10-01 Duration of contract : 3 years Deadline to apply : 2022-05-02 Contacts Inria Team : MULTISPEECH PhD Supervisor : Magron Paul / paul.magron@inria.fr About Inria Inria is the French national research institute dedicated to digital science and technology. It employs 2,600 people. Its 200 agile project teams, generally run jointly with academic partners, include more than 3,500 scientists and engineers working to meet the challenges of digital technology, oDen at the interface with other disciplines. The Institute also employs numerous talents in over forty different professions. 900 research support staff contribute to the preparation and development of scientific and entrepreneurial projects that have a worldwide impact. The keys to success Upload your complete application data. Applications will be assessed on a rolling basis, thus it is advised to apply as soon as possible. Instruction to apply Defence Security : This position is likely to be situated in a restricted area (ZRR), as defined in Decree No. 2011-1425 relating to the protection of national scientific and technical potential (PPST).Authorisation to enter an area is granted by the director of the unit, following a favourable Ministerial decision, as defined in the decree of 3 July 2012 relating to the PPST. An unfavourable Ministerial decision in respect of a position situated in a ZRR would result in the cancellation of the appointment. Recruitment Policy : As part of its diversity policy, all Inria positions are accessible to people with disabilities. Warning : you must enter your e-mail address in order to save your application to Inria. Applications must be submitted online on the Inria website. Processing of applications sent from other channels is not guaranteed. [8] P. Magron, T. Virtanen, Complex ISNMF: a phase-aware model for monaural audio source separation, IEEE/ACM Transactions on Audio, Speech and Language Processing, Vol. 27, no. 1, pp. 20-31, 2019. Skills Master or engineering degree in computer science, data science, signal processing, or machine learning. Professional capacity in English (spoken, read, and written). Some programming experience in Python andin somedeep learning framework (e.g., PyTorch). Previous experience and/or interest for speech and audio processing is a plus. Benefits package Restauration subventionnée Transports publics remboursés partiellement Congés: 7 semaines de congés annuels + 10 jours de RTT (base temps plein) + possibilité d'autorisations d'absence exceptionnelle (ex : enfants malades, déménagement) Possibilité de télétravail (après 6 mois d'ancienneté) et aménagement du temps de travail Équipements professionnels à disposition (visioconférence, prêts de matériels informatiques, etc.) Prestations sociales, culturelles et sportives (Association de gestion des œuvres sociales d'Inria) Accès à la formation professionnelle Sécurité sociale Remuneration Salary: 1982€ gross/month for 1st and 2 year. 2085€ gross/month for 3rd year. Monthly salary after taxes : around 1594€ for 1st and 2 year. 1677€ for 3rd year

Back  Top

6-13(2022-04-06) Ph.D. Thesis position and Post-doc position at Loria-INRIA, Nancy, France
Ph.D. Thesis position and Post-doc position at Loria-INRIA, Mutlispeech team, Nancy (France)
 
Multimodal automatic hate speech detection
 
https://jobs.inria.fr/public/classic/fr/offres/2022-04660
https://team.inria.fr/multispeech/fr/category/job-offers/
 
Hate speech expresses antisocial behavior. In many countries, online hate 
speech is punishable by the law. Manual analysis of such content and its moderation 
are impossible. An effective solution to this problem would be the automatic 
detection of hate comments. Until now, for hate speech detection, only text 
documents have been used. We would like to advance the knowledge about hate speech 
detection by exploring a new type of document: audio documents.
 
We would like to develop a new methodology to automatically detect hate speech, 
based on Machine Learning and Deep Neural Networks using text and audio.
 
Required Skills: The candidate should have theoretical and a moderate practical experience
with Deep Learning, including a good practice in Python and an understanding of deep
learning libraries like Pytorch. The knowledge of NLP or signal processing will be helpful. 
 
Supervisors:
Irina Illina, Associate Professor, HDR, Lorrain University
Dominique Fohr, Senior Researcher, CNRS
https://members.loria.fr/DFohr/    dominique.fohr@loria.fr
 
 
MULTISPEECH is a joint research team between the Université of Lorraine, Inria, 
and CNRS. It is part of department D4 “Natural language and knowledge processing” 
of LORIA. Its research focuses on speech processing, with particular emphasis to 
multisource (source separation, robust speech recognition), multilingual (computer 
assisted language learning), and multimodal aspects.


Back  Top

6-14(2022-04-07) Postes de traducteur peul-français (H/F), ELDA, Paris, France
Contexte
ELDA (Evaluation and Language resources Distribution Agency, www.elda.org) a pour activités principales la distribution et la production de ressources linguistiques, ainsi que l’évaluation de technologies de la langue (traduction automatique, reconnaissance de la parole, etc.).

Dans le cadre de ses activités de production, ELDA offre plusieurs postes de traducteur peul-français (H/F) à temps plein ou partiel pour la constitution d’un corpus de traduction de la langue.

Mission
Il s’agira de traduire à partir des documents audios et de leurs transcriptions, du peul vers le français afin de fournir les données nécessaires au développement et à l'évaluation de technologies de la langue peul. Le travail sera effectué selon des conventions de traduction sur lesquelles les candidats seront formés.

Profil recherché
• Natif de la langue peul, plus précisément de Mâssina
• Une première expérience en traduction du peul vers le français est souhaitable
• Excellente maîtrise du français écrit (orthographe, grammaire, syntaxe) pour la traduction et la compréhension des conventions de traduction
• Bonne maîtrise de l’outil informatique
• Capacité à intégrer des règles (de traduction) et à les suivre scrupuleusement et avec constance
 
Durée
Temps plein ou mi-temps, pour une durée de 5 mois minimum

Salaire
Salaire mensuel brut: 1925€

Les candidatures (CV, lettre de motivation) doivent être adressées à lucille@elda.org



Back  Top

6-15(2022-04-07) Postes de transcripteur peul-français (H/F), ELDA, Paris, France

Contexte
ELDA (Evaluation and Language resources Distribution Agency, www.elda.org) a pour activités principales la distribution et la production de ressources linguistiques, ainsi que l’évaluation de technologies de la langue (traduction automatique, reconnaissance de la parole, etc.).

Dans le cadre de ses activités de production, ELDA offre plusieurs postes de traducteur peul-français (H/F) à temps plein ou partiel pour la constitution d’un corpus de traduction de la langue.

Mission
Il s’agira de traduire à partir des documents audios et de leurs transcriptions, du peul vers le français afin de fournir les données nécessaires au développement et à l'évaluation de technologies de la langue peul. Le travail sera effectué selon des conventions de traduction sur lesquelles les candidats seront formés.

Profil recherché
• Natif de la langue peul, plus précisément de Mâssina
• Une première expérience en traduction du peul vers le français est souhaitable
• Excellente maîtrise du français écrit (orthographe, grammaire, syntaxe) pour la traduction et la compréhension des conventions de traduction
• Bonne maîtrise de l’outil informatique
• Capacité à intégrer des règles (de traduction) et à les suivre scrupuleusement et avec constance
 
Durée
Temps plein ou mi-temps, pour une durée de 5 mois minimum

Salaire
Salaire mensuel brut: 1604€

Les candidatures (CV, lettre de motivation) doivent être adressées à lucille@elda.org

Back  Top

6-16(2022-04-07) Contrat doctoral au Collegium Musicæ de Sorbonne Université, Paris, France

Le Collegium Musicæ de Sorbonne Université propose un contrat doctoral sur le style vocal :

 Analyse par la Synthèse performative du style vocal - Porteurs de projet : Christophe d'Alessandro  et Céline  Chabot-Canet

Le propos de cette thèse est l’étude du style vocal par le paradigme d’analyse par la synthèse
performative. Ce sujet associe recherche musicologique sur le style vocal et la musicologie de la
performance, recherche musicale sur les instruments chanteurs, et recherche en informatique musicale
sur les nouvelles interfaces pour l’expression musicale et les synthétiseurs vocaux temps-réel.

des détails se trouvent ici (aller sur l'onglet Collegium Musicæ de la page) :

https://www.sorbonne-universite.fr/projets-proposes-en-2022-programme-instituts-et-initiatives

Contact:

Christophe d'Alessandro : christophe.dalessandro@sorbonne-universite.fr

Back  Top

6-17(2022-04-08) Postdocs at IMT Atlantique, Brest, France

L'équipe RAMBO de IMT Atlantique, en collaboration avec Smart Macadam, recherche des candidats pour

deux postdocs de 18-24 mois basés à Nantes sur les sujets suivants:

1. Postdoctorat sur la reconnaissance de parole (Nantes & Brest, France)
https://institutminestelecom.recruitee.com/l/fr/o/postdoctorante-ou-postdoctorant-reconnaissance-

automatique-de-la-parole-intelligence-artificielle-cdd-18-mois

2. Postdoctorat sur la reconnaissance de sons (Nantes & Brest, France)
https://institutminestelecom.recruitee.com/l/fr/o/postdoctorante-ou-postdoctorant-reconnaissance-de-sons-

intelligence-artificielle-cdd-18-mois

Date limite pour soumettre votre candidature (actualisée): 30 avril 2022

Nous vous encourageons de postuler si vous êtes intéressés !
N'hésitez pas de me contacter si vous avez des questions concernant ces positions.

Cordialement,

Mihai ANDRIES

Enseignant-chercheur
Équipe RAMBO
IMT Atlantique
Brest, France
Back  Top

6-18(2022-04-10) PhD thesis positions, LaBRI, Bordeaux, France

Vocal biomarkers collected through conversational agents for diagnosis assistance and follow-up of

sleep and mental disorders 

Location : Bordeaux, France

Supervisor : Jean-Luc Rouas, CR CNRS HDR, LaBRI
Co-supervisor : Pierre Philip, PU-PH, Sanpsy

Duration : 36 mois
Starting date  : 01/10/2022
Gross salary : 2 135,00 € / month

https://emploi.cnrs.fr/Offres/Doctorant/UMR5800-MAGHIN-017/Default.aspx

Detailed subject: 


The SANPSY and the Labri teams have demonstrated their ability to identify new vocal biomarkers to measure excessive daytime sleepiness both subjectively and objectively in patients suffering from sleep disorders [1]–[5]. SANPSY demonstrated the validity of autonomous numeric solutions (I.e. smartphone based virtual agents) to diagnose sleep/mental disorders in the general population [6]– [10]. We plan now to develop new virtual agents collecting biomarkers (i.e. from speech) in our healthy subjects and patients cohorts for diagnostic, treatment and follow-up (ADDICTAQUI, KANOPEE and AUTONOMHEALTH(PEPR) projects).

The PhD thesis project “Vocal biomarkers collected through conversational agents for diagnosis assistance and follow-up of sleep and mental disorders“ relies on 4 stages:
1) developing new virtual agents to collect vocal markers:
The objective is to design new scenarios targeting behavioral interventions to improve fatigue, mood and excessive daytime sleepiness. Moreover, the scenarios will be designed so that the agent interacts with the subject in order to engage a discussion (spontaneous speech). This will lead to more ecological conditions that should increase the acceptability.
2) switching from high-quality controlled recordings made at the hospital to in-the-field unsupervised recordings using smartphones.
Our current vocal biomarkers are defined using a reading task and using high-quality microphones. The new interaction scenarios from task 1) will lead us to record spontaneous speech with smartphone microphones. This task will tackle the differences in recording conditions and their impact on our feature extraction pipeline.
3) verifying the relevance of the existing vocal markers when used with the new data and propose new features that could be used as high-level biomarkers such as lexical, syntactic and semantic cues.
Our features will have to be adapted to consider the versatile nature of spontaneous discourse which is a completely different speaking style from read speech. Spontaneous speech will however provide additional cues that could be used as high-level biomarkers such as lexical, syntactic and semantic markers.
4) studying the sensitivity and specificity of the selected biomarkers on diagnostic and follow up of symptoms and disorders with respect to other medical measures.
This final part of the PhD project will be addressed jointly by LaBRI and SANPSY and includes the clinical validation of the proposed approaches.

References:

[1]  V. P. Martin, G. Chapouthier, M. Rieant, J.-L. Rouas, and P. Philip, ‘Using reading mistakes as features for sleepiness detection in speech’, in 10th international conference on speech prosody 2020, Tokyo, Japan, May 2020, pp. 985–989. [Online]. Available: https://hal.archives- ouvertes.fr/hal-02495149
[2]  V. P. Martin, J.-L. Rouas, J.-A. Micoulaud-Franchi, and P. Philip, ‘The objective and subjective sleepiness voice corpora’, in 12th edition of its language resources and evaluation conference., Marseille, France, May 2020, pp. 6525–6533. [Online]. Available: https://hal.archives- ouvertes.fr/hal-02489433
[3]  V. P. Martin, J.-L. Rouas, and P. Philip, ‘Détection de la somnolence dans la voix : nouveaux marqueurs et nouvelles stratégies’, Trait. Autom. Lang., vol. 61, no. 2, p. 24, 2020.
[4]  V. P. Martin, J.-L. Rouas, F. Boyer, and P. Philip, ‘Automatic Speech Recognition Systems Errors for Objective Sleepiness Detection Through Voice’, in Interspeech 2021, Aug. 2021, pp. 2476– 2480. doi: 10.21437/Interspeech.2021-291.
[5]  V. P. Martin, J.-L. Rouas, J.-A. Micoulaud-Franchi, P. Philip, and J. Krajewski, ‘How to Design a Relevant Corpus for Sleepiness Detection Through Voice?’, Front. Digit. Health, vol. 3, p. 124, 2021, doi: 10.3389/fdgth.2021.686068.
[6]  L. Dupuy, J.-A. Micoulaud-Franchi, and P. Philip, ‘Acceptance of virtual agents in a homecare context: Evaluation of excessive daytime sleepiness in apneic patients during interventions by continuous positive airway pressure (CPAP) providers’, J. Sleep Res., vol. n/a, no. n/a, p. e13094, 2020, doi: https://doi.org/10.1111/jsr.13094.
[7]  L. Dupuy et al., ‘Smartphone-based virtual agents and insomnia management: A proof-of-concept study for new methods of autonomous screening and management of insomnia symptoms in the general population’, J. Sleep Res., p. e13489, Sep. 2021, doi: 10.1111/jsr.13489.
[8]  P. Philip et al., ‘Trust and acceptance of a virtual psychiatric interview between embodied conversational agents and outpatients’, Npj Digit. Med., vol. 3, no. 1, Art. no. 1, Jan. 2020, doi: 10.1038/s41746-019-0213-y.
[9]  P. Philip et al., ‘Virtual human as a new diagnostic tool, a proof of concept study in the field of major depressive disorders’, Sci. Rep., vol. 7, 2017.
[10]  P. Philip, S. Bioulac, A. Sauteraud, C. Chaufton, and J. Olive, ‘Could a virtual human be used to explore excessive daytime sleepiness in patients?’, Presence Teleoperators Virtual Environ., vol. 23, no. 4, pp. 369–376, 2014. 


Work environment: 
The PhD student will be hosted at LaBRI in the Image and Sound (I&S) department with frequent visits to SANPSY where he/she will interact with the clinicians and the designers of the virtual agents. 

The I&S department conducts research in acquisition, processing, analysis, modeling, synthesis and interaction of audiovisual media. It works on the entire acquisition chain from data collection to information extraction or restitution of digital data with the user at the center of the chain. The spectrum of manipulated data is very wide: 2D and 3D images, video, speech, music, 3D data, EEG, pysiological data, etc. The different steps of the processing chain integrate modeling phases for analysis or synthesis. The targeted application domains are: health, medical, education, gaming, etc.

The SANPSY unit has a recognized expertise in sleep restriction studies and in the evaluation of countermeasures to sleep deprivation. The team is also specialized in sleep disorders, especially obstructive sleep apnea diagnostic and treatment. The SANPSY unit is located on the neuro-psychopharmacological research platform (PRNPP). This platform is recognized nationally and internationally for its expertise in clinical research, simulation and virtual reality. It has been labeled IBISA in 2015. In 2011, SANPSY obtained an EquipEx project (PHENOVIRT) that aimed to improve phenotyping using simulation and virtual reality technologies. Part of this project, SANPSY has initiated, in particular, the development of Embodied Conversational Agents (virtual doctors and patients). Several scenarios for the diagnosis of drowsiness, depression and addiction to tobacco and alcohol have already been developed and tested in patients. 

Université de Bordeaux CNRS
Jean-Luc ROUAS 
CNRS Researcher
Bordeaux Computer Science Research Laboratory (LaBRI)
351 Cours de la libération - 33405 Talence Cedex - France
T. +33 (0) 5 40 00 35 28
www.labri.fr/~rouas
Back  Top

6-19(2022-04-15) Doc ou Postdoc au LIG, Grenoble, France
Date limite de candidature : 30 avril 2022

Sujet de thèse ou de postdoctorat dans le cadre du projet  Popcorn (projet collaboratif avec deux entreprises)
encadrée par Benjamin Lecouteux, Gilles Sérasset et Didier Schwab (Laboratoire d’Informatique de Grenoble, Groupe d’Étude en Traduction Automatique/Traitement Automatisé des Langues et de la Parole)

 

Titre : Peuplement OPérationnel de bases de COnnaissances et Réseaux Neuronaux

 

Le projet aborde le problème de l’enrichissement semi-automatisé d’une base de connaissance au travers de l’analyse automatique de textes. Afin d’obtenir une innovation de rupture dans le domaine du Traitement Automatique du Langage Naturel (TALN) pour les clients sécurité et défense, le projet se focalise sur le traitement du français (même si les approches retenues seront par la suite généralisables à d’autres langues). Les travaux aborderont différents aspects :
● L’annotation automatique de documents textuels par la détection de mentions d’entités présentes dans la base de connaissance et leurs désambiguïsation sémantique (polysémie, homonymie) ;
● La découverte de nouvelles entités (personnes, organisations, équipements, événements, lieux), de leurs attributs (âge d’une personne, numéro de référence d’un équipement, etc.), et des relations entre entités (une personne travaille pour une organisation, des personnes impliquées dans un événement, ...). Une attention particulière sera donnée au fait de pouvoir s’adapter souplement à des évolutions de l’ontologie, la prise en compte de la place de l’utilisateur et de l’analyste pour la validation/capitalisation des extractions effectuées.
Le projet se focalise autour des trois axes de recherches suivants :
● Génération de données synthétiques textuelles à partir de textes de référence ;
● La reconnaissance des entités d’intérêt, des attributs associés et des relations entre les entités.
● La désambiguisation sémantique des entités (en cas d’homonymie par exemple)

 

Profil recherché:
    - Solide expérience en programmation & machine learning pour le Traitement Automatique de Langues (TAL), notamment l’apprentissage profond
    - Master/Doctorat Machine Learning ou informatique, une composante TAL ou linguistique computationnelle sera un plus apprécié
    - Bonne connaissance du français

 

Détails pratiques:
    - Début de la thèse rentrée 2022
    - Contrat doctoral à temps plein au LIG (équipe Getalp) pour 3 ans (salaire: min 1768€ brut mensuel)
    - ou Contrat postdoctoral à temps plein au LIG (équipe Getalp) pour 20 mois (salaire: min 2395€ brut mensuel)



 

Environnement scientifique:

 

  • Le doctorat ou le postdoctorat sera mené au sein de l'équipe Getalp du laboratoire LIG  (https://lig-getalp.imag.fr/).
  • La personne recrutée sera accueillie au sein de l’équipe qui offre un cadre de travail stimulant, multinational  et agréable. 
  • Les moyens pour mener à bien le (post)doctorat seront assurés tant en ce qui concerne les missions en France et à l’étranger qu’en ce qui concerne le matériel (ordinateur personnel, accès aux serveurs GPU du LIG, Grille de calcul Jean Zay du CNRS).

 

Comment postuler ?

 

  • Pour postuler sur une thèse de doctorat, les candidats doivent être titulaires d'un Master en informatique, en apprentissage machine ou en traitement automatique du langage naturel (obtenu avant le début du contrat doctoral, les étudiants actuellement en master 2 peuvent ainsi postuler).
  • Pour postuler sur un postdoctorat, les candidats doivent être titulaires d'une thèse de doctorat en informatique,  en apprentissage machine ou en traitement automatique du langage naturel (obtenu avant le début du contrat doctoral, les étudiants dont la soutenance est prévue avant fin septembre 2022 peuvent ainsi postuler).
  • Ils doivent avoir une bonne connaissance des méthodes d’apprentissage automatique et idéalement une expérience en collecte et gestion de corpus.
  • Ils doivent également avoir une bonne connaissance de la langue française. 
Les candidatures doivent contenir : CV + lettre/message de motivation + notes de master + lettre(s) de recommandations; et être adressées à Benjamin Lecouteux (benjamin.lecouteux@univ-grenoble-alpes.fr), Gilles Sérasset (gilles.serasset@univ-grenoble-alpes.fr) et Didier Schwab (Didier.Schwab@univ-grenoble-alpes.fr
Back  Top

6-20(2022-04-18) PhD studentship at the University of Edinburgh,UK
Hi all,
Here is an offer for a PhD studentship in modelling the articulation of spoken utterances at the University of Edinburgh
The PhD work will be to implement computational models of human speech articulation planning. It will involve the development of software for testing theoretical assumptions, along with tests of software output. The work combines phonetic and phonology aspects, speech technology and motor control theory with programming and software development.
The deadline to apply is 15th May 2022.
 
Information about eligibility and the application process may be found at:
https://www.ed.ac.uk/ppls/linguistics-and-english-language/prospective/postgraduate/funding-research-students/erc-phd-studentship-articulation-spoken-utterances
 
Contacts:
- Alice Turk: a.turk@ed.ac.uk
- Benjamin Elie: benjamin.elie@ed.ac.uk
Back  Top

6-21(2022-04-15) PhD chez Orange, France

Orange recrute un.e doctorant.e sur le sujet 'Deep learning pour le traitement conjoint du langage naturel et des connaissances'.


L’objectif de la thèse est de proposer des solutions pour mutualiser le traitement de tâches de compréhension et génération du langage naturel. Il s’agira ainsi d’étudier la fusion progressive de diverses tâches mêlant langage naturel et langage(s) formel(s) de représentation ou manipulation de connaissances. Le contexte d’application sera tout d’abord celui d’énoncés isolés, puis celui de dialogues humain-machine où l’historique de discussion doit être pris en compte.

 

Détails et candidature via Orange Jobs : https://orange.jobs/jobs/offer.do?joid=111967&lang=FR

Back  Top

6-22(2022-04-17) PhD at University of Zurich, Switzerland

Human knowledge is inherently multi-modal, and it is more than just a collection of isolated pieces of information, irrespective of the form of expression. Instead, it emerges from the interconnectedness of all of these information fragments. Knowledge graphs are a powerful way of capturing such interconnected knowledge. Such graphs are effective for storing and relating information that can easily be expressed in textual form, by assigning a simple text label to every node in a graph or relating them to literals represented using strings or blobs. However, they so far fail to capture the richness of information that is not easily expressed as a short piece of text.

The ‘MediaGraph' project aims to extend the concept of a knowledge graph with all types of information which are currently representable as multimedia to be able to capture the richness of human knowledge. In contrast to a knowledge graph whose nodes are associated with a textual label (with a specification arising from relations to other entities and labels), the nodes in a media graph will be able to represent and interrelate any part of any multimedia document. The resulting graph will not only describe the semantic but also the stylistic and technical relations between the documents and their components and form the basis for novel media interaction paradigms.

For this project, we are seeking a motivated PhD student to help make MediaGraph a reality. Requirements include an MSc in Computer Science or a related discipline, a background in both theoretical and applied aspects of computer science as well as a passion for discovering new things. Experience in the areas of databases, data management, semantic web technologies, multimedia processing, multimedia analysis, machine learning, and/or signal processing is considered a plus. The PhD Student will be in charge of the development of manual as well as automated construction methods for MediaGraphs and will define and own some of the practical use-cases to which MediaGraph will be applied in practice. They will also contribute to the design and implementation of representation, querying, and evaluation mechanisms for the graphs.

To apply, please gather your curriculum vitae, all grade transcripts, selected publications (if available), a list of at least three references, and your BSc/MSc theses as PDF files and go to https://www.apply.dsi.uzh.ch/position/5996546.

The University of Zurich is committed to enhancing the number of women in scientific positions and, therefore, particularly invites women to apply. Women who are as qualified for the position in question as male applicants will be given priority.

For more information on the project and the research group, visit https://www.ifi.uzh.ch/en/ddis.html.

Back  Top

6-23(2022-04-19) Postdoctoral Position at Columbia University in The City of New York, NY, USA

Postdoctoral Position - Machine Learning and Digital Twins, Columbia University in The City of New York.

  • For candidates with a PhD degree in CS, EE, Data Sciences, or a related major.
  • Two years, starting before September 2022.
  • Mentored by Prof. Zoran Kostic, Columbia University Electrical Engineering Department and Data Sciences Institute.
  • Funded by the NSF Cyber Physical Systems Program.
  • Collaboration across multiple departments at Columbia School of Engineering and Data Sciences Institute.
  • For detailed description and instructions on how to express interest, see https://www.aidl.ee.columbia.edu/postdoc .
Back  Top

6-24(2022-04-20) PhD grant at the University of Edinburg, UK
The University of Edinburghis  looking for a PhD candidate to work on modelling the articulation of spoken utterances as part of Alice Turk's Advanced ERC grant. The 4 year PhD studentship at the University of Edinburgh will be fully funded by the grant. We are looking for candidates with decent programming skills, and an interest in speech analysis and modelling:
 
Back  Top

6-25(2022-04-23) Doctoral position : Acoustic to Articulatory Inversion by using dynamic MRI images, INRIA, Nancy, France

Doctoral position : Acoustic to Articulatory Inversion by using dynamic MRI images

 

Loria “Lorraine Research Laboratory in Computer Science and its Applications” is a research unit common to CNRS, the Université de Lorraine and INRIA. Loria gathers 450 scientists and its missions mainly deal with fundamental and applied research in computer sciences, especially the MultiSpeech Team which focuses automatic speech processing, audiovisual speech and speech production. IADI is a research unit common to Inserm the Université de Lorraine whose specialty is developing various techniques and methods to improve imaging of moving organs via the acquisition of MR images.

 

This PhD project founded by LUE (Lorraine Université d’Excellence) associates the Multispeech team and the IADI laboratory.

 

Start date is (expected to be) 1st October 2022 or as soon as possible thereafter.

 

Supervisors

Yves Laprie, email yves.laprie@loria.fr

Pierre-André Vuissoz, email pa.vuissoz@chru-nancy.fr

 

The project

 

Articulatory synthesis mimics the speech production process by first generating the shape of the vocal tract from the sequence of phonemes to be pronounced, then the acoustic signal by solving the aeroacoustic equations. Compared to other approaches to speech synthesis which offer a very high level of quality, the main interest is to control the whole production process, beyond the acoustic signal alone.

The objective of this PhD is to succeed in the inverse transformation, called acoustic to articulatory inversion, in order to recover the geometric shape of the vocal tract from the acoustic signal. A simple voice recording will allow the dynamics of the different articulators to be followed during the production of the sentence.

Beyond its interest in terms of scientific challenge, articulatory acoustic inversion has many potential applications. Alone, it can be used as a diagnostic tool to evaluate articulatory gestures in an educational or medical context.

 

Description of work

 

The objective is the inversion of the acoustic signal to recover the temporal evolution of the medio-sagittal slice. Indeed, dynamic MRI provides two-dimensional images in the medio-sagittal plane at 50Hz of very good quality and the speech signal acquired with an optical microphone can be very efficiently deconstructed with the algorithms developed in the MultiSpeech team (examples available on https://artspeech.loria.fr/resources/). We plan to use corpora already acquired or in the process of being acquired. These corpora represent a very large volume of data (several hundreds of thousands of images) and an approach for tracking the contours of articulators in MRI images which gives very good results was developed to process corpora. The automatically tracked contours can therefore be used to train the inversion. The goal is to perform the inversion using the LSTM approach on data from a small number of speakers for which sufficient data exists. This approach will have to be adapted to the nature of the data and to be able to identify the contribution of each articulator. In itself, successful inversion to recover the shape of the vocal tract in the medio-sagittal plane will be a remarkable success since the current results only cover a very small part of the vocal tract (a few points on the front part of the vocal tract). However, it is important to be able to transpose this result to any subject, which raises the question of speaker adaptation, which is the second objective of the PhD.

 

What we offer

  • A position funded by LUE (Lorraine Université d’Excellence) at a leading technical university that generates knowledge and skills for a sustainable futur.
  • A very complementary scientific environment of the two teams (MultiSpeech and IADI) in all fields of MRI and anatomy in the IADI laboratory and in deep learning and speec processing in the MultiSpeech team of Lori.

  • Engaged and ambitious colleagues along with a creative, international and dynamic working environmen.

  • At Loria, there are lively research groups in a number of areas, for example natural language processing, deep learning, computer graphics, robotics… At the moment, there are about 150 PhD students at Loria and IADI.

  • Works in the very center of Europe in close proximity to nature.

  • Help to relocate and be settled in France and at Université de Lorraine.

 

Supervisors

Yves Laprie, email yves.laprie@loria.fr

Pierre-André Vuissoz, email pa.vuissoz@chru-nancy.fr

 

Application

Your application including all attachments must be in English and submitted electronically by clicking APPLY NOW below.

Please include

  1. Motivated letter of application (max. one page)

  2. Your motivation for applying for the specific PhD project

  3. Curriculum vitae including information about your education, experience, language skills and other skills relevant for the position

  4. Publication list (if possible)

  5. Reference letters (if available)


The deadline for applications is Friday 13 May 2022, 23:59 GMT +2
.

log into Inria’s recruitment system (https://jobs.inria.fr/public/classic/en/offres/2022-04654in order to apply to this position.

Back  Top

6-26(2022-04-26) Position of University Assistants (prae doc), University of Vienna, Austria


The University of Vienna seeks to fill the position starting Oct 1, 2022
of University Assistants (prae doc)
at the Research Group MIS - Multimedia Information Systems
at the Faculty of Computer Science, University of Vienna
under the supervision of Prof. Klas
Reference number: “#01 PC3: Multimedia Information Systems, Prof. Klas'

Please see the job announcement at
https://docs.univie.ac.at/how-to-apply/pooled-calls/

Please note that good German language skills are required for this position due to the role in teaching.
==
The Faculty of Computer Science of the University of Vienna has world-leading researchers in Computer Science who pursue basic as well as applied research. The UniVie Doctoral School Computer Science (DoCS) builds an essential framework to foster excellence in research and teaching. Its main focus is young prospective researchers eager to make an impact on both basic research as well as applied problems with collaborations across the University and beyond. The DoCS aims to provide these young researchers with the broad knowledge and expertise needed to perform Computer Science research at the highest achievable quality. The Doctoral School trains doctoral candidates in solving basic as well as applied research questions of high relevance. The Research Group Multimedia Information Systems at the Faculty of Computer Science is looking for highly motivated research assistants, Ph.D. candidates. We are offering an excellent working environment in a young, creative, highly motivated, and international team, manifold opportunities for personal development, the possibility to take responsibility in research projects at an early stage, the opportunity for research stays abroad, excellent industrial contacts, and intensive support in the course of a dissertation.

Duration of employment: 4 years (The announcement is made for four years, whereby the employment relationship is initially limited to 1.5 years and is automatically extended to a total of four years unless the employer submits a declaration of non-renewal after a maximum of 12 months.)

Extent of employment: 30.0 hours/week
Job grading in accordance with collective bargaining agreement: §48 VwGr. B1 Grundstufe
(praedoc) with relevant work experience determining the assignment to a particular salary
grade.

Job description
Participation in research, teaching, and administration:
- Participation in research projects/research studies
- Participation in publications/academic articles/presentations
- We expect the successful candidate to sign a doctoral thesis agreement within 12-18 months.
- Participation in teaching and independent teaching of courses as defined by the collective agreement
- Supervision of students
- Involvement in the organisation of meetings, conferences, symposiums
- Involvement in the department administration as well as in teaching and research administration
The research should focus on techniques for the automated detection of misinformation in multimedia content.

Profile
- Master in Computer Science, or Media Technologies, or Business Informatics (or equivalent qualification, with a strong focus on Computer Science) with excellent marks.
- Excellent command of written and spoken German and English.
- High ability to express yourself both orally and in writing
- Strong motivation to work in a team environment.
- Strong motivation to publish at top-refereed conferences/journals.
- Perseverance, focus, integrity, ability to make things done, leadership in organization and partnership/communication with other project teams.
- Comprehensive IT user skills

Desirable qualifications are
- Excellent knowledge in the fields of multimedia systems, digital media technologies, semantic analysis of multimedia content, techniques for the detection of misinformation, user interface, and feedback systems, various programming languages (Java, C, C++, C#, Python, JavaScript), technologies in the area of Internet-of-Things, web-based systems, databases, blockchain systems, cloud computing, formal methods in CS
- Expertise in various application areas of Computer Science
- Proven interest in scientific work and in scientific publishing
- Basic experience in research methods and academic writing
- Knowledge of university processes and structures - Experience in remote teaching using modern IT infrastructures

Application documents
- Curriculum vitae
- Letter of Motivation including ideas for a prospective doctoral project proposal
- Abstract of master thesis
- Degree certificates
- List of publications, evidence of teaching experience (if available)

Applications need to be submitted via the recruiting tool Apply@DoCS | Servicedesk Universität Wien (univie.ac.at), mentioning reference number “#01 PC3: Multimedia Information Systems, Prof. Klas”.
Further details are available here (also how you set up an u:account): https://docs.univie.ac.at/apply.

The University pursues a non-discriminatory employment policy and values equal opportunities, as well as diversity (http://diversity.univie.ac.at/). The University lays special emphasis on increasing the number of women in senior and academic positions. Given equal qualifications, preference will be given to female applicants.

The candidates who are selected for the position join the DoCS as doctoral student members.

Back  Top

6-27(2022-04-30) 3 PhD fellowships at the University of Copenhaguen, Denmark

3 PhD fellowships in applied Machine Learning, Information Retrieval and Natural Language Processing

The Information Retrieval Lab of the Department of Computer Science at the University of Copenhagen (DIKU) is offering 3 fully funded PhD Fellowships in applied Machine Learning, Information Retrieval, and Natural Language Processing, commencing 1 September 2022 or as soon as possible thereafter.

The fellows will conduct research, having as starting point the following broad research areas:
  • a fully-funded PhD in interpretability of applied machine learning;
  • a fully-funded PhD in overparameterization and generalizability in deep neural architectures;
  • a fully-funded PhD in web & information retrieval;
 
We are looking for candidates with a MSc degree in a subject relevant for the research area. The successful candidate is expected to have strong grades in Machine Learning and/or Information Retrieval and/or Natural Language Processing. Successful candidates should have a preliminary research record as witnessed by a master thesis or publications in the area.

The deadline for applications is 19 May 2022, 23:59 GMT +2.
 
Back  Top

6-28(2022-05-06) PhD students, Postdoctoral Researchers and R&D Engineers at Telecom Paris, Palaiseau, France

We have multiple openings for PhD studentsPostdoctoral Researchers and R&D Engineers at Télécom Paris, Institut polytechnique de Paris, in the “Signal, Statistics and Learning (S2A) team. 

 

All positions are located at Telecom Paris, 19 place Marguerite Perey, 91120 Palaiseau, France.

 

Start of the positions: October/November 2022 (for PhDs/Engineer), January 2023 for PostDoc

 

Subject:

The positions will be a part of the ERC Advanced (2022) – HI-Audio (Hybrid and Interpretable Deep neural audio machines) project, which aims at building hybrid deep approaches combining parameter-efficient and interpretable models with modern resource-efficient deep neural architectures with applications in speech/audio scene analysis, music information retrieval and sound transformation and synthesis.

 

The potential topics include (and are not limited to):

- Deep generative models, adversarial learning

- Attention-based models and curriculum learning

- Statistical/deterministic audio models (signal models, sound propagation models,…)

- Music Information Retrieval software platform development (R&D Engineer position)

 

 

Candidate Profile: 

- For the Phd positions: A masters degree in applied mathematics, datascience/computer science or speech/audio/music processing is required.

- For the Postdoc position: PhD degree and publications in theory or applications of machine learning, generative modelling, discrete optimal transport or signal processing, ideally with applications to Speech/Audio/Music signals.

- Master internship positions will also be open in early 2023.

 

Télécom Paris, and the S2A  team:

 

The S2A team gathers 18 permanent faculties covering a wide variety of research topics including Statistics, Probabilistic modeling, Machine learning, Data science, Audio and social signal processing. On the overall, Télécom Paris’ research counts 19 research teams and covers various domains in computer science and networks, applied mathematics, electronics, image, data, signals and economic and social sciences. Télécom Paris (https://www.telecom-paris.fr/en/home) is a member of IMT (Institut Mines-Télécom), and is a founding member of the Institut Polytechnique de Paris (IP Paris, https://www.ip-paris.fr/en), a world-class scientific and technological institution which is a partnership between five prestigious French engineering schools  with HEC as a key partner.

 

Application:
- There is no specific deadline. Applications are welcome until all positions are filled.

- In the application, please send a resume, a motivation letter (and full transcript grades for Phd/Engineer positions) to Gaël Richard, firstname.lastname@telecom-paris.fr. At least one reference letter will be asked in a second step.

 

Back  Top

6-29(2022-05-15) PhD position, Loria, Nancy

Multichannel Speech Enhancement for Patients with Auditory Neuropathy Spectrum Disorders

Location: LORIA, MULTISPEECH team, Nancy

Supervisors: Romain Serizel (Maître de Conférences, Université de Lorraine), Paul Magron (Chargé de Recherche, INRIA).

This PhD fits within the scope of the ANR project 'REFINED' involving the Multispeech research team in (LORIA), Nancy, the Laboratory of Embedded Artificial Intelligence in CEA (List) in Paris, and the Hearing Institute in Paris. Context Worldwide, around 466 million people currently suffer from a hearing loss. To remedy the loss of hearing sensitivity, portable hearing aids have been designed for almost a century. Regardless of the recent advances in audio signal processing integrated in current hearing aids models, people suffering from Auditory Neuropathy Spectrum Disorders enjoy little or no benefit from current hearing aids [1]. Contrary to regular hearing losses, Auditory Neuropathy Spectrum Disorders impair the processing of temporal information without necessarily affecting auditory sensitivity. This can have a particularly dramatic impact in scenarios where the speech of interest is present together with some background noise or with one or several concurrent speaker(s). Current speech enhancement systems are usually trained on generic corpora, and they are designed to optimize some cost between the target (known) speech and the output of the system, which is estimated from the mixture, such as the mean squared error [2] or the speech-to-distortion ratio [3]. The trained system is then evaluated using a criterion that is designed to reflect the speech perception from people without hearing losses [4]. Yet, the main need of subjects with Auditory Neuropathy Spectrum Disorders, shared with ageing subjects who experience central auditoryprocessing difficulties, is not to restore audibility but to improve their speech intelligibility, particularly in noisy environments, by compensating for the deterioration of acoustic cues that rely on temporal precision [5]. Objectives Based on clinical studies performed at the Hearing Institute within the project, the main goal of this PhD is to define new cost functions to be optimized by the speech processing algorithms that are more relevant for subjects with Auditory Neuropathy Spectrum Disorders than generic losses used in current algorithms. We will pay particular attention to the algorithms’ ability to help volunteers in scenarios with multiple potential target sources that are spatially distributed in a room. We will derive the speech enhancement filters aiming to extract not only speech, but also additional cues such as speech contour or timbre. In a latter step, the model will be adapted under light human supervision in order to reduce the burden of the usual iterative “handcrafted” adjustments and repeated visits with a specialist clinician to fit the hearing aid to individual needs.

Profile • Strong background in audio signal processing or machine learning • Excellent programming skills • Excellent English writing and speaking skills Application Upload your application on ADUM (https://www.adum.fr/as/ed/voirproposition.pl? site=adumR&matricule_prop=43498#version) with the following: • CV • Cover letter • Recommendation letter • M1-M2 note transcript • Master thesis, if available

References [1] Berlin, C. I. et al. Multi-site diagnosis and management of 260 patients with auditory neuropathy/dys-synchrony (auditory neuropathy spectrum disorder). Int J Audiol 49, 30-43 (2010). [2] Doclo, S., Spriet, A., Wouters, J. & Moonen, M. Frequency-domain criterion for the speech distortion weighted multichannel Wiener filter for robust noise reduction. Speech Communication 49, 636-656 (2007). [3] Luo, Y., et al. FaSNet: Low-latency adaptive beamforming for multi-microphone audio processing. 2019 IEEE automatic speech recognition and understanding workshop (2019). [4] Vincent, E., Rémi G., and Cédric F. Performance measurement in blind audio source separation. IEEE transactions on audio, speech, and language processing 14.4, 1462-1469 (2006). [5] https://claritychallenge.github.io/clarity_CC_doc/docs/cpc1/cpc1_intro

Back  Top

6-30(2022-05-20) Docteur R&D informatique, NLP - F/H (CDD), Lunii, Paris

Lunii  logotype
LUNII PARIS · TÉLÉTRAVAIL HYBRIDE

Docteur R&D informatique, NLP - F/H (CDD)

 

Vous rejoindrez l’équipe Innovation sous la responsabilité de Samuel, chercheur en informatique, spécialisé dans la synthèse vocale. Vous prendrez en charge les aspects NLP nécessaires au développement d’un système de synthèse vocale narrative.

Vous aurez pour missions de :

  • Mettre en place une stratégie de recherche autour de l'analyse syntaxique et structurelle d’histoires.
  • Définir les procédés NLP (SpaCy, NLTK, Transformers…) qui permettront la constitution automatique d’un corpus à grande échelle destiné à la synthèse vocale narrative.
  • Penser le futur de la génération intelligente d’histoires à partir d’éléments clés, e.g. personnage principal, personnage secondaire, lieu, objet…

Liste non exhaustive.

Lunii recrute et reconnaît tous les talents : nous sommes profondément attaché·e·s à la mixité et à la diversité, on vous attend !


Au moins 7 postes d'ATER en section 27 sont ouverts (ou sur le point de l'être) à Nancy, avec intégration au laboratoire LORIA[*] (www.loria.fr) pour la recherche. Cinq postes sont déjà publiés sur galaxie [*] :

27ATER0817 (Institut des science du Digital, Management et Cognition)
27MCF1050B (POLYTECH-Nancy)
27PR1139 (POLYTECH-Nancy)
27PR1316 (Faculté des Sciences et Technologies)
33MCF0129 (Ecole des Mines-Nancy)

et au moins deux autres sont prévus, l'un à POLYTECH-Nancy et l'autre à l'IUT Nancy-Charlemange.

Tous les détails, y compris les modalités de candidature et les dates limites de candidature, certaines très proches, sont disponibles sur le serveur galaxie [**].

Cordialement,

Slim Ouni.
 
 
[*] La liste des équipes du LORIA : https://www.loria.fr/fr/la-recherche/les-equipes/ 
Back  Top

6-33(2022-06-01) Research positions @ L3i Lab, La Rochelle, France

Cross-lingual and cross-domain terminology alignment

 

Interested in joining a young NLP group of 10+ people located in a historical town by the Atlantic Ocean? And walk 10 minutes from the lab to the beach. We have open positions in the context of recent Horizon 2020 projects: Embeddia and NewsEye as well as related local projects. In the last 2 years, we have among others published long papers in CORE A* and A conferences such as ACL, JCDL, CoNLL, ICDAR, COLING, ICADL, etc.  

Location: L3i laboratory, La Rochelle, France

Duration: 2 years (1+1), with possible further extension

Net salary range: 2100€-2300 € monthly

Context: H2020 Embeddia project and regional project Termitrad

Start: September 2022 (tentatively)

 


Keywords: terminology alignment, cross-lingual word embeddings, named-entity recognition and linking, deep/machine learning, statistical NLP, (text) mining.


Applications are invited for a postdoctoral researcher position around the topic of project Termitrad: keyword and terminology alignment 1) across languages and 2) across domains. In short, the overall objective of the project is to improve the relevance of the keywords describing research papers (and, time allowing, the quality of abstracts). One the one hand (cross-lingual alignment), we will rely on a corpora of journal articles with both French and English keywords and abstracts, both in as written by authors and in versions curated by experts. On the other hand (crossdomain alignment), we will work with use cases provided by researchers from different fields using different terms to describe similar concepts.

To address this very project, the project team will consist of senior staff, 2 post-doctoral researchers and 2-3 PhD students, one of which is jointly supervised in the Józef Stefan Institute in Ljubljana, coordinator of H2020 Embeddia. In this context, you will first be in charge of building a state of the art of existing related approaches, tools and resources, then to conduct further research and experiments, as well as participate in the supervision of PhD students.


Who we search for:

-    - PhD in statistical NLP, IR, or ML, ideally with further postdoctoral experience

-    - proven record of high-level publications in one or more of those fields

-    - fluency in written and spoken English (French language skills are welcome but unnecessary)

 

Applications including a CV and a one-page research statement discussing how the candidate's background fits requirements and topic are to be sent to by email to  antoine.doucet@univ-lr.fr, strictly with the subject 'Embeddia/Termitrad postdoc application'.

Application deadline: 14 June 2022.

Back  Top

6-34(2022-06-01) Post-doc @L3i, La Rochelle, France

-- Post-doctoral research position - L3i - La Rochelle France
---------------------------------------------------------------------------------------------------------------------------

Title : Emotion detection by semantic analysis of the text in comics speech balloons

 

The L3i laboratory has one open post-doc position in computer science, in the specific field of natural language processing in the context of digitised documents.

 

Duration: 12 months (an extension of 12 months will be possible)

Position available from: As soon as possible

Salary: approximately 2150 € / month (net)

Place: L3i lab, University of La Rochelle, France

Specialty: Computer Science/ Document Analysis/ Natural Language Processing

Contact: Jean-Christophe BURIE (jcburie [at] univ-lr.fr) / Antoine Doucet (antoine.doucet [at] univ-lr.fr)

 

Position Description

The L3i is a research lab of the University of La Rochelle. La Rochelle is a city in the south west of France on the Atlantic coast and is one of the most attractive and dynamic cities in France. The L3i works since several years on document analysis and has developed a well-known expertise in ‘Bande dessinée”, manga and comics analysis, indexing and understanding.

The work done by the post-doc will take part in the context of SAiL (Sequential Art Image Laboratory) a joint laboratory involving L3i and a private company. The objective is to create innovative tools to index and interact with digitised comics. The work will be done in a team of 10 researchers and engineers.

The team has developed different methods to extract and recognise the text of the speech balloons. The specific task of the recruited researcher will be to use Natural Language Processing strategies to analyse the text in order to identify emotions expressed by a character (reacting to the utterance of another speaking character) or caused by it (talking to another character). The datasets will be collections of comics in French and English.

 

Qualifications

Candidates must have a completed PhD and a research experience in natural language processing. Some knowledge and experience in deep learning is also recommended.

 

General Qualifications

• Good programming skills mastering at least one programming language like Python, Java, C/C++

• Good teamwork skills

• Good writing skills and proficiency in written and spoken English or French

 

Applications

Candidates should send a CV and a motivation letter to jcburie [at] univ-lr.fr and antoine.doucet [at] univ-lr.fr.

Applications will be considered from 9 June onwards, and until a candidate is hired

Back  Top

6-35(2022-06-07) Postdoctoral fellowship, Northwestern University, USA

Postdoctoral fellowship in corpus phonetics / data science for speech, Northwestern University
Applications are invited for a postdoctoral fellowship (contingent on final approval of funding) for a National Science Foundation grant entitled, Enhancing speech and deep learning research through holistic acoustic analysis. This project will test a novel, unsupervised deep learning approach that discovers a representational space for analysis of acoustic variation. To test this highly general approach, we will assess whether it out-performs a baseline, based on current methods for analyzing individual variation in bilingual speech.
 
The postdoctoral fellow will be co-supervised by Matt Goldrick (PI) and Ann Bradlow (co-I). The fellow will help develop automated workflows for a number of grant components, including: the baseline technique for measuring acoustic variation using current (off-the-shelf) automatic acoustic analysis methods; online intelligibility testing over large numbers (1000+) participants; and online free-classification techniques.
 
The fellow will work in collaboration with research teams at other grant sites: Technion University (lead by Joseph Keshet) and the University of California at San Diego (lead by Tamar H. Gollan). The project will support travel by the fellow to one of these other sites and other opportunities for professional development.
 
Desired qualifications include experience with corpus phonetics techniques including data-frame/corpus design, phonetic processing and scripting in Praat, online speech recognition testing with automatic scoring, and data visualization and analysis with R. 
 
The position is for one year, with the potential for renewal for a second year. The pay rate will follow NIH postdoctoral fellows stipend levels. The position can start as early as September 1, 2022, and ideally would begin before January 1, 2023. Applications should be submitted to matt-goldrick@northwestern.edu. Applications should include a CV (including contact information and links to written work) and the names of two references (letters will be requested after initial review of applications). Application review will begin June 20, 2022.  Email inquiries about the position should be directed to Matt Goldrick (matt-goldrick@northwestern.edu) or Ann Bradlow (abradlow@northwestern.edu).

 

Northwestern University requires all staff and faculty to be vaccinated against COVID-19, subject to limited exceptions. For more information, please visit our COVID-19 and Campus Updates website.

 

The Northwestern University campus sits on the traditional homelands of the people of the Council of Three Fires, the Ojibwe, Potawatomi, and Odawa as well as the Menominee, Miami and Ho-Chunk nations. We acknowledge and honor the original people of the land upon which Northwestern University stands, and the Native people who remain on this land today.

 

Northwestern University is an Equal Opportunity, Affirmative Action Employer of all protected classes, including veterans and individuals with disabilities. Women, racial and ethnic minorities, individuals with disabilities, and veterans are encouraged to apply. Click for information on EEO is the Law.

 

Back  Top

6-36(2022-06-07) PhD grant @ INRIA, France

Inria is opening a fully funded PhD position on multimodal speech
anonymization. For details and to apply, see:
https://jobs.inria.fr/public/classic/en/offres/2022-05013

Applications will be reviewed on a continuous basis until June 30.

Back  Top

6-37(2022-06-10) Associate Teaching Professor @ University of Cambridge, Department of Engineering, Cambridge, UK

Job opportunity: Associate Teaching Professor at the University of Cambridge, Department of Engineering

 

We're advertising for an Associate Teaching Professor who will be the Course Director of the Machine Learning and Machine Intelligence (MLMI) MPhil. The post will involve teaching and the post-holder can be research active e.g. they can start and run their own research group. The main expertise could be in any field related to the MPhil including: machine learning, machine intelligence, speech and language processing, signal processing, control, robotics, human-computer interaction, computer vision, and high performance computing.

 

Advert: https://www.jobs.cam.ac.uk/job/35215/

 

Back  Top

6-38(2022-06-07) Two PhD positions at Quality and Usability Lab, Technical University of Berlin, Germany

Two PhD positions at Quality and Usability Lab, Technical University of Berlin, Germany

 

We are looking to recruit two Doctoral Researchers to join Quality and Usability Lab, at Technical University of Berlin, Germany. One of the positions is a scholarship-based position with a potential continuation as a research assistant (TVL-E13) depending to follow up funding.

The Quality and Usability Lab is part of TU Berlin’s Faculty IV and deals with the design and evaluation of human-machine interaction, in which aspects of human perception, technical systems and the design of interaction are the subject of our research. We focus on self-determined work in an interdisciplinary and international team; for this we offer open and flexible working conditions that promote scientific and personal exchange and are a prerequisite for excellent results.

 

  1. Research Assistant (full-time) Speech Quality

The research is in the area of the assessment of the quality of speech services using a crowdsourcing approach. The aim of the research is to analyze how crowdsourcing-based listening-only and conversational speech quality evaluation experiments can be set up in order to provide valid and reliable results, and how the characteristics of the test participants, the test environment and the playback system can be assessed in online tests. It will be assessed which differences are to be expected between crowdsourcing and laboratory-based speech quality evaluation, and how these differences influence the development of instrumental speech quality prediction models. The results are expected to influence methods for speech quality assessment in crowdsourcing, as they are summarized in ITU-T Recommendation P.808.

This project is funded by the Deutsche Forschungsgemeinschaft, DFG, and is limited to a duration until January 31, 2024 (compensation TVL E13). A subsequent ongoing employment is supported if the PhD cannot be finished in running time of the project

    1. Tasks:

  • Interacting and extending web platforms that are created for conducting and managing listening-only or conversational experiments (Frontend: HTML/JS/CSS, backend for conversation testing: Node.js / express.js, WebRTC)

  • Conducting subjective listening-only and/or conversation tests in the laboratory and via crowdsourcing; Analyzing the results

  • Recording of source speech signals in both laboratory and large-scale crowdsourcing and preparing speech dataset.

  • Enhancing test methods (that we developed for ITU-T Rec. P.808) for screening the participants’ ability, environment and set-up suitability for the speech quality assessment tasks.

  • Processing speech signals collected in a crowdsourcing approach, and applying relevant artificial network degradation conditions (e.g. background noise, clipping, etc.).

  • Benchmarking state-of-the-art instrumental models for predicting speech quality based on their performance on the collected crowdsourcing dataset.

  • Project communication and reporting.

  • Publication and presentation of project and research results in scientific journals, at conferences, at workshops and ITU-T Study Group 12 expert’s meetings. Publication and presentation of project and research results in scientific journals, at conferences, and in workshops as well as standardization meetings of ITU-T

 

    1. Requirements:

  • Successfully completed university degree (Master, Diplom or equivalent) in computer engineering/science, informatics, media informatics, digital media, or information systems (or an equivalent technical background)

  • Deep knowledge, and hands-on experience in one or more general purpose programming languages (recommended is Python)

  • Profound programming skills in front-end (HTML5/CSS3, JS, jQuery, JSON), AND one scripting language for data processing (either MATLAB, Python or R), and ideally backend development skills

  • Knowledge about digital signal processing, beneficial: speech signal processing respectively audio signal processing and acoustics

  • Knowledge about empirical subjective tests and statistical data analysis is appreciated

  • Language skills: English fluent in writing and speaking (B2 level); willingness to learn German is expected

  • Joy of working in an interdisciplinary and international environment

 

  1. Conversation Quality – Scholarship-based position

The position is open to do research in the field of speech signals analysis, and the assessment of speech quality in different (mobile and fixed) networks. Therefore, speech signals are to be analyzed in listening-only as well as conversational situations in order to get indications or the perceived quality. Based on these analysis, signal-based and parametric models for the estimation of speech quality can be extended and integrated. One focus of the present research may be the evaluation of new speech codecs in different network scenarios. The models are to be validated based on subjective listening and conversation tests.

 

The scholarship is limited to 10 months; however, the outcomes of the research should be used to support the preparation of a new project application, and may also become a foundation for a later PhD thesis. A subsequent position as a research assistant from the project funds would be possible if the funds were approved.

 

2.1 Tasks

  • Maintaining and further developing a platform to conduct web-based voice calls

  • Conducting subjective conversation tests in the laboratory and via crowdsourcing

  • Analysis of speech signals

  • Creating and evaluating models for predicting quality aspects using different algorithms (including traditional signal processing methods and state-of-the-art DNNs)

  • Project communication and reporting

  • Publication and presentation of project and research results in scientific journals, at conferences, and in workshops

 

2.2. Requirements

  • Master or diploma in electrical engineering, computer engineering, computer science, media informatics, media technology, information systems management (or an equivalent technical background)

  • Profound knowledge in digital signal processing, beneficial: speech signal processing or audio signal processing, respectively

  • Good programming skills (e.g. MATLAB or Python) and safe handling of web development tools (e.g. HTML5/CSS3, JS, ideally also backend development skills)

  • Interest in running user studies with test participants to determinate speech quality

  • Language skills: English and German fluent in writing and speaking

  • Knowledge about empirical studies and statistical data analysis is appreciated

  • Joy of working in an interdisciplinary and international environment

 

 

Application

For both positions, please send the following documents, bundled in a single PDF file, to Prof. Dr.-Ing. Sebastian Möller bewerbung@qu.tu-berlin.de: Letter of application, curriculum vitae, copies of certificates, job references. Please also specify for which position you are applying.

To ensure equal opportunities between women and men, applications by women with the required qualifications are explicitly desired. Qualified individuals with disabilities will be favored.

 

Back  Top

6-39(2022-06-10) Associate Teaching Professor at the University of Cambridge, Department of Engineering , UK

Job opportunity: Associate Teaching Professor at the University of Cambridge, Department of Engineering

 

We're advertising for an Associate Teaching Professor who will be the Course Director of the Machine Learning and Machine Intelligence (MLMI) MPhil. The post will involve teaching and the post-holder can be research active e.g. they can start and run their own research group. The main expertise could be in any field related to the MPhil including: machine learning, machine intelligence, speech and language processing, signal processing, control, robotics, human-computer interaction, computer vision, and high performance computing.

 

Advert: https://www.jobs.cam.ac.uk/job/35215/

 

Back  Top

6-40(2022-06-11) Chaire de professeur junior au CNRS, France

Le CNRS ouvre un poste attractif en Apprentissage Automatique pour le Traitement Automatique des Langues. Il s'agit d'une 'Chaire Professeur Junior', un type de poste créé cette année, qui offre un accès direct à un poste permanent de Directrice ou Directeur de Recherche CNRS au bout de 3 à 6 ans.

L'objectif affiché de cette campagne est de renforcer un des trois laboratoires d'excellence suivants en TAL : LISN, LIG, LORIA, sur des aspects stratégiques ou souverains : le traitement automatique de la langue française écrite ou orale, des domaines d'application spécifiques ayant peu de données d'apprentissage ou faiblement couverts par les modèles génériques (par exemple, dans les domaines de la sécurité, la défense ou la santé), ou encore en lien avec des applications et plateformes éducatives ou d'assistance aux personnes handicapées. Ce poste inclut une charge d'enseignement de seulement 42 heures par an pendant toute la durée du contrat. Ensuite, la fonction de directrice ou directeur de recherche CNRS n'impose aucune charge d'enseignement obligatoire.

La prise de fonction sur ce poste est accompagnée d'un environnement de 300,000 € (200,000 € de l'ANR plus une bourse de thèse).

La date limite de dépôt des dossiers est le : 31/08/2022
La date prévue d'embauche est le : 01/12/2022

Tous les détails concernant cette offre de poste du CNRS sont donnés ici :

https://www.cnrs.fr/en/cnrsinfo/join-cnrs-25-tenure-track-positions-available

L'offre spécifique en Apprentissage Automatique pour le Traitement Automatique des Langues est détaillée ici :

https://emploi.cnrs.fr/Offres/CPJ/CPJ-2022-020/Default.aspx?Lang=EN


Contacts :

LISN: Gilles Adda (Gilles.Adda@lisn.upsaclay.fr)

LIG: François Portet (francois.portet@imag.fr)

LORIA: Christophe Cerisara (christophe.cerisara@loria.fr)

Les activités d'enseignement prendront part dans les programmes dédiés mis en place par l'Université de Paris Saclay (contact : Dominique Quadri presidence-dept-info.sciences@u-psud.fr), l'Université de Grenoble Alpes (contact : Massih-Reza.Amini@grenoble-inp.fr), ou l'Université de Lorraine (contact : Maxime Amblard maxime.amblard@loria.fr) dans le domaine de l'intelligence artificielle.

Back  Top

6-41(2022-06-12) Postdocs at the Speech Prosody Special Interest Group

To the Speech Prosody Special Interest Group,

 

We are looking for two postdocs (2 years each in the first instance) to work on intonation and intonation pragmatics as part of SPRINT (sprintproject.io). We would appreciate it if you could share the information with your networks, especially because we have a short deadline for applications (19 June).

 

The job descriptions and other details can be found at the links below, but potential applicants can get in touch with me as well (using my Radboud address, amalia.arvaniti@ru.nl)

 

Please note that the previous links we sent don’t work, these are the new links and should work.

 

https://www.ru.nl/english/working-at/vacature/details-vacature/?ruid=1583&pad=%2fenglish&doel=embed&taal=uk

https://www.ru.nl/english/working-at/vacature/details-vacature/?ruid=1585&pad=%2fenglish&doel=embed&taal=uk

 

regards to all,

 

Amalia Arvaniti

Back  Top

6-42(2022-06-14) Multiple faculty positions at National Yang Ming Chiao Tung University (NYCU), Taiwan

National Yang Ming Chiao Tung University (NYCU), one of the top-ranked universities in Taiwan, invites applications for multiple faculty positions (assistant, associate, full, and chair professors), in the Institute of Artificial Intelligence Innovation (IAII). IAII is part of the newly established Industry Academia Innovation School (IAIS) in NYCU, with major focuses on innovations in artificial intelligence (https://iais.nycu.edu.tw/). IAII/NYCU is located in Hsinchu Science Park, Taiwan’s “Silicon Valley”, wherein over two thirds of the CEOs and managers are NYCU graduates.

 

With the determination to facilitate industry–government–academia–research collaboration to drive the next-generation industry development, we are looking for strong candidates in the broader area of artificial intelligence, data science, security, information engineering, broadband communication, and Internet of Things. Applicants are expected to conduct outstanding research and be committed to teaching, in collaboration with world-class ICT industry partners.

 

Applicants should submit the following items: ● Cover letter ● Curriculum Vitae ● Research statement ● Teaching statement ● Publication list ●      Three or more reference letters ● Any other PDF-formatted supporting materials (optional). Please address all inquiries and nominations to Prof. Wen-Huang Cheng, Director of the IAII via email (whcheng@nycu.edu.tw).

 

--

Wen-Huang Cheng (鄭文皇)
Distinguished Professor,
     Department of Electronics Engineering | Institute of Electronics
     College of Electrical and Computer Engineering,
     National Chiao Tung University (NCTU), Taiwan
Director, 
     NCTU Artificial Intelligence Graduate Program
 
Email: whcheng@nctu.edu.tw
Phone: +886-(0)3-5712121 ext 54289
Back  Top

6-43(2022-06-22) Doctoral Researcher, Institute of Linguistics, JWGoethe University, Frankfurt/Main, Germany

The Johann Wolfgang Goethe University Frankfurt am Main is one of the largest universities in Germany with around 48,000 students and with about 5,000 employees. Founded in 1914 by Frankfurt citizens and since 2008 once again proud of its foundation status Goethe University possesses a high degree of autonomy, modernity and professional diversity. As a comprehensive university, the Goethe University offers a total of 16 departments on five campuses and more than 100 degree programs along with an outstanding research reputation.

 

The Institute of Linguistics at the Department of Modern Languages of Goethe University Frankfurt am Main offers a position in cotutelle with the Department of Translation and Language Sciences at Universitat Pompeu Fabra, Barcelona, in the project “Co-Speech Gestures and Prosody as Multimodal Markers of Information Structure” as a

 

Doctoral Researcher

(E13 TV-GU, 65% part-time) starting October 1st 2022

 

funded for 3 years by the German Science Foundation (Deutsche Forschungsgemeinschaft DFG). The salary grade includes social benefits and is based on the job characteristics of the collective agreement applicable to Goethe University (TV-G-U).

We offer a 3-year doctoral co-tutelle position to work within a collaborative team. The main focus of the doctoral research will be to assess the multimodal markers of IS in Catalan. The project will be run in close collaboration with the team assessing the multimodal markers of IS in German. The two teams involved are the Prosodic Studies Group at UPF (IP: Dr. Pilar Prieto)

and the Phonology Lab at the Institute of Linguistics in Frankfurt (PI: Dr. Frank Kügler). The project is part of the DFG Priority Progamme 2329 “Visual Communication” (https://vicom.info).

The ideal candidate has a strong background in linguistics and linguistic experimentation, is highly motivated and interested in the assessment of prosodic and gestural markers in language. Knowledge of Catalan will be required to run the experiments. To qualify for a doctoral position in Linguistics the candidate should hold a master’s degree in Linguistics, Philology, Psychology, or equivalent.


Doctoral students are expected to participate actively in the project and the two department’s activities. The position comes with no teaching obligation.


The application in English should consist of one pdf file containing:

  • A letter of intent describing your research interests and motivation for PhD studies (maximum one page)
  • CV
  • English certificate of your BA and MA degree (or equivalent) with a transcript of records
  • A maximum 3-page research proposal which states a research question related to the project, describes the methodology and work plan, and contextualises the expected results in relation to the state of the art. The actual dissertation project of the successful candidate will be worked out in collaboration with the supervisor.
  • One or two letters of reference
  • Other documents which the applicant would like to include

All required documents should be emailed as a pdf file (preferably as one document) to Frank Kügler (Kuegler@em.uni-frankfurt.de) and Pilar Prieto (pilar.prieto@upf.edu) up to July 15th, 2022.

For more information, please contact Frank Kügler and Pilar Prieto.

 

The Goethe University is committed to a policy of providing equal employment opportunities for both men and women alike, and therefore encourages particularly women to apply for the position/s offered. Individuals with severe disability will be prioritized in case of equal qualification.


--

Prof. Dr. Frank Kügler
Institut für Linguistik
Goethe-Universität Frankfurt
Norbert-Wollheim-Platz 1
D-60323 Frankfurt am Main
Telefon +49 69 798 32217
E-Mail: kuegler@em.uni-frankfurt.de

 

Back  Top

6-44(2022-06-22) 2 postdocs @Radbout University, Nijmegen, The Netherlands

We are looking for two postdocs (2 years each in the first instance) to work on intonation and intonation pragmatics as part of SPRINT (sprintproject.io). We would appreciate it if you could share the information with your networks, especially because we have a short deadline for applications (19 June).

 

The job descriptions and other details can be found at the links below, but potential applicants can get in touch with me as well (using my Radboud address, amalia.arvaniti@ru.nl)

 

Please note that the previous links we sent don’t work, these are the new links and should work.

 

https://www.ru.nl/english/working-at/vacature/details-vacature/?ruid=1583&pad=%2fenglish&doel=embed&taal=uk

https://www.ru.nl/english/working-at/vacature/details-vacature/?ruid=1585&pad=%2fenglish&doel=embed&taal=uk

 

Back  Top

6-45(2022_06-23) Post-Doctoral/PhD position at Telecom-Paris

Post-Doctoral/PhD position at Telecom-Paris on Deep learning approaches for social computing

           

*Place of work* Telecom Paris, Palaiseau (Paris outskirt)

 

*Starting date* From September 2022 (but can start later)

 

*Context*

The PhD student/post-doctoral fellow will take part in the REVITALISE projectfunded by ANR ( viRtual bEhaVioral skIlls TrAining for pubLIc SpEaking). The research activity will bring together the research topics of Prof. Chloé Clavel [Clavel] of the S2a [SSA] team at Telecom-Paris– social computing [SocComp] - and Dr. Mathieu Chollet [Chollet] from University of Glasgow – multimodal systems for social skills training, and Dr Beatrice Biancardi [Biancardi] – Social Behaviour Modelling from CESI Engineering School, Nanterre.

 

Candidate profile*

As a minimum requirement, the successful candidate should have:

• A master degree in one or more of the following areas: human-agent interaction, deep learning, computational linguistics, affective computing, reinforcement learning, natural language processing, speech processing

 Excellent programming skills (preferably in Python)

 Excellent command of English

 

*How to apply*

The application should be formatted as **a single pdf file** and should include:

 A complete and detailed curriculum vitae

 A cover letter

 The contact of two referees

 

For the post-doctoral fellow position, additional documents are required:

 

 The defense and Phd reports

 The contact of two referees

 

The pdf file should be sent to the three supervisors: Chloé Clavel, Beatrice Biancardi and Mathieu Chollet: chloe.clavel@telecom-paris.frbbiancardi@cesi.frmathieu.chollet@glasgow.ac.uk

 

           

Multimodal attention models for assessing and providing feedback on users’ public speaking ability

 

*Keywords* human-machine interaction, attention models, recurrent neural networks, Social Computing, natural language processing, speech processing, non-verbal behavior processing, multimodality, soft skills, public speaking

 

*Supervision* Chloé Clavel, Mathieu Chollet, Beatrice Biancardi

 

*Description* Oral communication skills are essential in many situations and have been identified as core skills of the 21st century. Technological innovations have enabled social skills training applications which hold great training potential: speakers’ behaviors can be automatically measured, and machine learning models can be trained to predict public speaking performance from these measurements and subsequently generate personalized feedback to the trainees.

The REVITALISE project proposes to study explainable machine learning models for the automatic assessment of public speaking and for automatic feedback production to public speaking trainees. In particular, the recruited intern will address the following points:

-   identify relevant datasets for training public speaking and prepare them for model training

-   propose and implement multimodal machine learning models for public speaking assessment and compare them to existing approaches in terms of predictive performance.

-   integrate the public assessment models to produce feedback a public speaking training interface, and evaluate the usefulness and acceptability of the produced feedback in a user study

The results of the project will help to advance the state of the art in social signal processing, and will further our understanding of the performance/explainability trade-off of these models.

 

The compared models will include traditional machine learning models proposed in previous work [Wortwein] and sequential neural approaches (recurrent networks) that integrate attention models as a continuation of the work done in [Hemamou_a], [Hemamou_b][BenYoussef]. The feedback production interface will extend a system developed in previous work [Chollet21].

 

Selected references of the team:

[Hemamou_a] L. Hemamou, G. Felhi, V. Vandenbussche, J.-C. Martin, C. Clavel, HireNet: a Hierarchical Attention Model for the Automatic Analysis of Asynchronous Video Job Interviews.  in AAAI 2019

[Hemamou_b] Leo Hemamou;Arthur Guillon;Jean-Claude Martin;Chloe Clavel, Multimodal Hierarchical Attention Neural Network: Looking for Candidates Behaviour which Impact Recruiter’s Decision, IEEE Trans. of Affective Computing, Sept. 2021

[Ben-Youssef]  Atef Ben-Youssef, Chloé Clavel, Slim Essid, Miriam Bilac, Marine Chamoux, and Angelica Lim.  Ue-hri: a new dataset for the study of user engagement in spontaneous human-robot interactions.  In  Proceedings of the 19th ACM International Conference on Multimodal Interaction, pages 464–472. ACM, 2017.

[Wortwein] Torsten Wörtwein, Mathieu Chollet, Boris Schauerte, Louis-Philippe Morency, Rainer Stiefelhagen, and Stefan Scherer. 2015. Multimodal Public Speaking Performance Assessment. In Proceedings of the 2015 ACM on International Conference on Multimodal Interaction (ICMI '15). Association for Computing Machinery, New York, NY, USA, 43–50.

[Chollet21] Chollet, M., Marsella, S., & Scherer, S. (2021). Training public speaking with virtual social interactions: effectiveness of real-time feedback and delayed feedback. Journal on Multimodal User Interfaces, 1-13.

 

Other references:

[TPT] https://www.telecom-paristech.fr/eng/ 

[IMTA] https://www.imt-atlantique.fr/fr

[SocComp.] https://www.tsi.telecom-paristech.fr/recherche/themes-de-recherche/analyse-automatique-des-donnees-sociales-social-computing/

[SSA] http://www.tsi.telecom-paristech.fr/ssa/#

[PACCE] https://www.ls2n.fr/equipe/pacce/

[Clavel] https://clavel.wp.imt.fr/publications/

[Chollet] https://matchollet.github.io/

[Biancardi] https://sites.google.com/view/beatricebiancardi

-Rasipuram, Sowmya, and Dinesh Babu Jayagopi. 'Automatic multimodal assessment of soft skills in social interactions: a review.' Multimedia Tools and Applications (2020): 1-24.

-Sharma, Rahul, Tanaya Guha, and Gaurav Sharma. 'Multichannel attention network for analyzing visual behavior in public speaking.' 2018 IEEE Winter Conference on Applications of Computer Vision (WACV). IEEE, 2018.

-Acharyya, R., Das, S., Chattoraj, A., & Tanveer, M. I. (2020, April). FairyTED: A Fair Rating Predictor for TED Talk Data. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 34, No. 01, pp. 338-345).

Back  Top

6-46(2022-06-27) PhD grant @ LIS, Marseille

 


Candidat(e) pour une thèse de doctorat en informatique sur un projet collaboratif susceptible d'être financé par la DGA. La thèse se déroulera au sein de l'équipe R2I ( Recherche d'informations et interactions) du pôle Sciences des données du LIS (Marseille)


Sujet de thèse de doctorat

Titre : Génération automatique de résumés fluides de textes en français par apprentissage profond

Encadrement : Prof. Patrice BELLOT (https://cv.archives-ouvertes.fr/patrice-bellot ; Université d’Aix-Marseille CNRS, LIS), Adrian CHIFU (https://adrianchifu.com ; Université d’Aix-Marseille CNRS, LIS) 

Période : octobre 2022 - septembre 2025

Mots clés : résumé automatique, fluidification textuelle, recherche d’information, traitement automatique des langues, apprentissage automatique, réseaux neuronaux

Contexte : Projet collaboratif susceptible d’être soutenu par la DGA entre : 

Description du sujet :

Le contexte du projet

Devant la croissance exponentielle des volumes de données et particulièrement de la documentation de type texte (manuels, publications, sites internet, etc.), une solution est de permettre d’accéder facilement aux éléments essentiels, au travers de résumés des textes les plus pertinents dans le contexte utilisateur. Or à ce jour les résumés automatiques restent perfectibles, aussi bien du point de vue de la couverture informationnelle que de leur susceptibilité à créer de fausses informations ou encore de leur fluidité de la lecture, critère qui est la cible première de cette thèse.

Le but du projet RAFFAL est d’améliorer les technologies automatiques (par IA) de résumés de documents en français selon l’angle des métriques qui les régissent en tant que fonction objective (apprentissage automatique de modèles) et mesure d’évaluation humaine Par ailleurs, les algorithmes, modèles et jeux de données de nouvelle génération basés sur les technologies les plus récentes de d’apprentissage profond (notamment de type Transformeret modèles séquence à séquence) sont pratiquement exclusivement en langue anglaise et doivent être testées et adaptées au français.

Le domaine du résumé automatique est confronté depuis longtemps au manque de métriques d’évaluationautomatique de la qualité des résumés fournis suffisamment fiables ; ce manque de métriques d’évaluation est un frein majeur à l’industrialisation et au déploiement des technologies de résumés automatiques pour lesquels des critères de confiance et de pilotage sont indispensables.

Plan de travail
Le plan de travail comprend deux volets majeurs. Le premier correspond à une étude des propriétés et des limites des métriques existantes et à leur adaptation au français. Le second correspond à la modification des fonctions objectives utilisées pour l’entraînement des modèles selon les métriques adaptées et de nouvelles métriques.

La thèse que nous proposons attaquera tout d’abord la définition de la fluidité. Les mesures de fluidité et de qualité d’un résumé existantes, généralement pour l’anglais, seront étudiées et adaptées à la langue française. Il s’agit par exemple de revisiter le lien entre les mesures existantes, les différentes dimensions qualitatives d’un résumé et leur implémentation au sein d’une architecture neuronale notamment de type séquence à séquence (profondeur des représentations et niveaux d’abstraction, mécanismes attentionnels...). Les ressources linguistiques et les corpus de textes utiles devront être identifiés.

Des évaluateurs humains pourront être impliqués et nous devons à la fois étudier des mesures d’accord inter-annotateurs et analyser leurs profils, selon leur niveau de connaissance de la thématique du résumé par exemple. Une évaluation en ligne pourrait permettre d’identifier les points complexifiant la lecture et conduire à de nouvelles métriques qui influeront à leur tour la création dynamique d’un résumé (approche par renforcement, réécriture alternative, complétion informationnelle par extraction d’information ou annotation sémantique).

La fluidité sera étudiée en tant que fonction objectif pour l’optimisation du « compromis » entre la perte informationnelle et les phénomènes d’hallucination (collaboration avec une autre thèse effectuée en parallèle au sein du laboratoire ISIR de Paris Sorbonne Université). Nous allons étudier l’équilibre entre la fluidité, d’une part, et la qualité et la complétude informationnelles, d’autre part (ex. : le « compromis » entre la précision et le rappel, pour les résultats d’un moteur de recherche). Cette phase nécessitera l’identification des informations essentielles, des éléments textuels centraux des textes à résumer et pourra être approchée par le biais de systèmes questions-réponses.  

Enfin, la fluidité d’un résumé étant dépendante du contexte, il est nécessaire d’étudier son caractère subjectif, notamment en tenant compte des types de texte (actualités, prises de position, interviews avec dialogues, articles scientifiques...et des priorités du résumé (couverture des  points de vue et des opinions sur un sujet sans perte de l’identification des sources, synthèse factuelle autour d’un événement...).

Chaque étape fera l’objet d’expérimentations sur des données et problématiques réelles, en collaboration avec le partenaire industriel du projet. Les propositions de la thèse s’inscriront dans le cadre de la science ouverte (publications, données et modèles lorsque cela est possible, codes source). 

Profil de candidature :

Parcours antérieur : Master 2 Informatique orienté Recherche en IA ou en TAL ou équivalent

Langue : Français (niveau minimum C1)

Langage de programmation : Python

Connaissances et compétences souhaitées : 

- apprentissage automatique statistique, architectures neuronales, transformeurs

- classification automatique de documents

- annotation de corpus

- outils et ressources du Traitement Automatique des Langues

- modèles de langue et représentations textuelles

- résumé automatique, génération de textes, simplification de textes

- recherche d’information et questions-réponses

Back  Top

6-47(2022-07-07) Two internship positions @ Naver Labs Europe
Naver Labs Europe (https://europe.naverlabs.com/)  is currently offering 2 internship positions related to Speech Processing.
More details on both job offers can be found here:
 
 
 
 
Back  Top



 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA