ISCA - International Speech
Communication Association

ISCApad Archive  »  2016  »  ISCApad #214  »  Resources  »  Books

ISCApad #214

Monday, April 11, 2016 by Chris Wellekens

5-1 Books
5-1-1Niebuhr, Olivier, Understanding Prosody:The Role of Context, Function and Communication

Understanding Prosody: The Role of Context, Function and Communication

Ed. by Niebuhr, Oliver

Series:Language, Context and Cognition 13,   De Gruyter or

The volume represents a state-of-the-art snapshot of the research on prosody for phoneticians, linguists and speech technologists. It covers well-known models and languages. How are prosodies linked to speech sounds? What are the relations between prosody and grammar? What does speech perception tell us about prosody, particularly about the constituting elements of intonation and rhythm? The papers of the volume address questions like these with a special focus on how the notion of context-based coding, the knowledge of prosodic functions and the communicative embedding of prosodic elements can advance our understanding of prosody.



5-1-2Albert Di Cristo: « La Prosodie de la Parole : Une Introduction », Editions de Boeck-Solal (296 p)
 Albert Di Cristo: « La Prosodie de la Parole : Une Introduction », Editions de Boeck-Solal (296 p). 
Sommaire : 
Avant –propos, Introduction, ;
 Ch.1 : Eléments de définition ; 
 Ch 2. Situation de la prosodie dans le champ des sciences du langage et dans l’étude de la communication ; 
Ch 3. La prosodie sur les deux versants de la communication orale interindividuelle (production et compréhension) ; 
Ch 4. La prosodie et le cerveau ;
 Ch 5. La matérialité de la prosodie ; 
Ch 6. Les niveau d’analyse et de représentation de la prosodie ; 
Ch 7. Les théories, les modèles de la prosodie et leurs appareils formels ;
 Ch 8 La fonctionnalité plurielle de la prosodie ; 
Ch 9. Les relations de la prosodie avec les sens ; 
Suggestions de lecture ;
 Index des termes ; 
Index des noms propres.

5-1-3Pierre-Yves Oudeyer, 'Aux sources de la parole: auto-organisation et évolution', Odile Jacob
Pierre-Yves Oudeyer, dir. rech. Inria, vient de publier 'Aux sources de la parole: auto-organisation et évolution', chez Odile Jacob (Sept. 2013).
Il discute de la question de l'évolution et de l'acquisition de la parole, chez l'enfant et chez les robots.
En faisant dialoguer biologie, linguistique, neurosciences et expériences robotiques, 
ce livre étudie en particulier les phénomènes d'auto-organisation, permettant la formation spontanée de langues nouvelles dans une population d'individus. 
Il présente en particulier des expériences dans lesquelles une population de robots numériques invente, forme, et négotie son propre système de parole
et explique comment de telles expériences robotiques peuvent nous aider à mieux comprendre l'homme.
Il présente aussi des expérimentations robotiques récentes, et à partir de perspectives nouvelles en intelligence artificielle, dans lesquelles des mécanismes de curiosité permettent à un robot de découvrir par lui-même son corps, les objets qui l'entourent, et finalement les interactions vocales avec ses pairs. C'est ainsi que s'auto-organise son propre développement cognitif, et qu'apparaissent des hypothèses nouvelles pour comprendre le développement chez l'enfant.
Site web du livre:
Pierre-Yves Oudeyer,
Directeur de recherche, Inria
Responsable de l'équipe Flowers
Inria Bordeaux Sud-Ouest et Ensta-ParisTech, France

5-1-4Björn Schuller, Anton Batliner , Computational Paralinguistics: Emotion, Affect and Personality in Speech and Language Processing, Wiley, ISBN: 978-1-119-97136-8, 344 pages, November 2013
Björn Schuller, Anton Batliner Computational Paralinguistics: Emotion, Affect and Personality in Speech and Language Processing Wiley, ISBN: 978-1-119-97136-8, 344 pages, November 2013 Description - This book presents the methods, tools and techniques that are currently being used to recognise (automatically) the affect, emotion, personality and everything else beyond linguistics (‘paralinguistics’) expressed by or embedded in human speech and language. - It is the first book to provide such a systematic survey of paralinguistics in speech and language processing. The technology described has evolved mainly from automatic speech and speaker recognition and processing, but also takes into account recent developments within speech signal processing, machine intelligence and data mining. - Moreover, the book offers a hands-on approach by integrating actual data sets, software, and open-source utilities which will make the book invaluable as a teaching tool and similarly useful for those professionals already in the field. Key features: - Provides an integrated presentation of basic research (in phonetics/linguistics and humanities) with state-of-the-art engineering approaches for speech signal processing and machine intelligence. - Explains the history and state of the art of all of the sub-fields which contribute to the topic of computational paralinguistics. - Covers the signal processing and machine learning aspects of the actual computational modelling of emotion and personality and explains the detection process from corpus collection to feature extraction and from model testing to system integration. - Details aspects of real-world system integration including distribution, weakly supervised learning and confidence measures. - Outlines machine learning approaches including static, dynamic and context-sensitive algorithms for classification and regression. - Includes a tutorial on freely available toolkits, such as the open-source ‘openEAR’ toolkit for emotion and affect recognition co-developed by one of the authors, and a listing of standard databases and feature sets used in the field to allow for immediate experimentation enabling the reader to build an emotion detection model on an existing corpus. Links: - The book: - Table of Contents (pdf): - Chapter01 (pdf): 



5-1-5Li Deng and Dong Yu, Deep Learning: Methods and Applications, Foundations and Trends in Signal Processing
Foundations and Trends in Signal Processing ( has published the following issue:   

Volume 7, Issue 3-4                                                                                                                                                                   
Deep Learning: Methods and Applications                                                               
By Li Deng and Dong Yu (Microsoft Research, USA)                                       

5-1-6O.Niebuhr, R.Skarnitzl, 'Tackling the Complexity in Speech', Prague University Press

Tackling the Complexity in Speech

Author Oliver Niebuhr, Radek Skarnitzl (eds)
Publisher Univerzita Karlova v Praze, Filozofická fakulta
Release year 2015
ISBN 978-80-7308-558-2
Series Opera Facultatis philosophicae
Pages 230

The present volume is meant to give the reader an impression of the range of questions and topics that are currently subject of international research in the discovery of complexity, the organization of complexity, and the modelling of complexity. These are the main sections of our volume. Each section includes four carefully selected chapters. They deal with facets of speech production, speech acoustics, and/or speech perception or recognition, place them in an integrated phonetic-phonological perspective, and relate them in more or less explicit ways to aspects of speech technology. Therefore, we hope that this volume can help speech scientists with traditional training in phonetics and phonology to keep up with the latest developments in speech technology. In the opposite direction, speech researchers starting from a technological perspective will hopefully get inspired by reading about the questions, phenomena, and communicative functions that are currently addressed in phonetics and phonology. Either way, the future of speech research lies in international, interdisciplinary collaborations, and our volume is meant to reflect and facilitate such collaborations.


5-1-7J.Li, L.Deng, R.Haeb-Umbach and Y.Gong, 'Robust Automatic Speech Recognition', Academic Press

 'Robust Automatic Speech Recognition'

  • The  first book that provides a comprehensive review on noise and reverberation robust speech recognition methods in the era of deep neural networks
  • Connects robust speech recognition techniques to machine learning paradigms with rigorous mathematical treatment
  • Provides elegant and structural ways to categorize and analyze noise-robust speech recognition techniques
  • Written by leading researchers who have been actively working on the subject matter in both industrial and academic organizations for many years


5-1-8Barbosa, P. A. and Madureira, S. Manual de Fonética Acústica Experimental. Aplicações a dados do português. 591 p. São Paulo: Cortez, 2015. [In Portuguese]

Barbosa, P. A. and Madureira, S. Manual de Fonética Acústica Experimental. Aplicações a dados do português. 591 p. São Paulo: Cortez, 2015. [In Portuguese]

This manual of Experimental Acoustic Phonetics is conceived for Undergraduate and Graduate classes on areas such as Acoustic Phonetics, Phonology, Communications Engineering, Music, Acoustic Physics, Speech Theraphy, among others.  Starting with a theoretical and methodological presentation of Acoustic Phonetics Theory and Techniques in five chapters,  including a chapter on experimental methods, the book follows with detailed acoustic analysis of all classes of sounds using audio files from both European and Brazilian Portuguese as data.
All analyses are explained step by step using Praat. The audiofiles are available on the book web site and can be downloaded.  All techniques can be applied to any language, of course. Proposed exercices at the end of each chapter allow the teacher o evaluate the student progress.


Delete | Reply | Reply to All | Forward | Redirect | View Thread | Blacklist | Whitelist | Message Source | Save as | Print
Move | Copy

5-1-9Damien Nouvel, Inalco, Maud Ehrmann, EPFL,Sophie Rosset, CNRS. Les entités nommées pour le traitement automatique des langues

Les entités nommées pour le traitement automatique des langues

Damien Nouvel, Inalco, Maud Ehrmann, EPFL
Sophie Rosset, CNRS  

Le livre est disponible en ebook au prix de 9,90 euros.
(prix réservé aux particuliers - PDF lisible sur tout support - uniquement disponible sur
Le livre est disponible en version papier au prix de 40,00 euros.

Le monde numérisé et connecté produit de grandes quantités de données. Analyser automatiquement le langage naturel est un enjeu majeur pour les applications de recherches sur le Web, de suivi d'actualités, de fouille, de veille, d'opinion, etc.

Les recherches menées en extraction d'information ont montré l'importance de certaines unités, telles que les noms de personnes, de lieux et d’organisations, les dates ou les montants. Le traitement de ces éléments, les « entités nommées », a donné lieu au développement d'algorithmes et de ressources utilisées par les systèmes informatiques.

Théorique et pratique, cet ouvrage propose des outils pour définir ces entités, les identifier, les lier à des bases de connaissance ou pour procéder à l’évaluation des systèmes.

1. Les entités nommées pour l’accès à l’information
2. Les entités nommées, des unités référentielles
3. Ressources autour des entités nommées
4. Reconnaître les entités nommées
5. Lier les entités nommées aux référentiels
6. Évaluation de la reconnaissance des entités nommées

168 pages - Octobre 2015
Ouvrage papier - broché 
ISBN 978-1-78405-104-4

 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA