ISCA - International Speech
Communication Association


ISCApad Archive  »  2018  »  ISCApad #237  »  Resources  »  Books

ISCApad #237

Sunday, March 11, 2018 by Chris Wellekens

5-1 Books
5-1-1J.Li, L.Deng, R.Haeb-Umbach and Y.Gong, 'Robust Automatic Speech Recognition', Academic Press

 'Robust Automatic Speech Recognition'

  • The  first book that provides a comprehensive review on noise and reverberation robust speech recognition methods in the era of deep neural networks
  • Connects robust speech recognition techniques to machine learning paradigms with rigorous mathematical treatment
  • Provides elegant and structural ways to categorize and analyze noise-robust speech recognition techniques
  • Written by leading researchers who have been actively working on the subject matter in both industrial and academic organizations for many years

https://na01.safelinks.protection.outlook.com/?url=http%3a%2f%2fstore.elsevier.com%2fRobust-Automatic-Speech-Recognition%2fJinyu-Li%2fisbn-9780128023983%2f.&data=01%7c01%7cygong%40exchange.microsoft.com%7c3bd27ec380c8427e97e208d2975aca2a%7c72f988bf86f141af91ab2d7cd011db47%7c1&sdata=PRRo3i4DYNV1rNmVlhPMaHa0pUN4oi%2b1khyjctDXxjU%3d

Back  Top

5-1-2Barbosa, P. A. and Madureira, S. Manual de Fonética Acústica Experimental. Aplicações a dados do português. 591 p. São Paulo: Cortez, 2015. [In Portuguese]

Barbosa, P. A. and Madureira, S. Manual de Fonética Acústica Experimental. Aplicações a dados do português. 591 p. São Paulo: Cortez, 2015. [In Portuguese]     


http://www.cortezeditora.com.br/manual-de-fonetica-acustica-experimental-1599.aspx/p

This manual of Experimental Acoustic Phonetics is conceived for Undergraduate and Graduate classes on areas such as Acoustic Phonetics, Phonology, Communications Engineering, Music, Acoustic Physics, Speech Theraphy, among others.  Starting with a theoretical and methodological presentation of Acoustic Phonetics Theory and Techniques in five chapters,  including a chapter on experimental methods, the book follows with detailed acoustic analysis of all classes of sounds using audio files from both European and Brazilian Portuguese as data.
All analyses are explained step by step using Praat. The audiofiles are available on the book web site and can be downloaded.  All techniques can be applied to any language, of course. Proposed exercices at the end of each chapter allow the teacher o evaluate the student progress.

 

Delete | Reply | Reply to All | Forward | Redirect | View Thread | Blacklist | Whitelist | Message Source | Save as | Print
Move | Copy
Back  Top

5-1-3Damien Nouvel, Inalco, Maud Ehrmann, EPFL,Sophie Rosset, CNRS. Les entités nommées pour le traitement automatique des langues

Les entités nommées pour le traitement automatique des langues

Damien Nouvel, Inalco, Maud Ehrmann, EPFL
Sophie Rosset, CNRS  

Le livre est disponible en ebook au prix de 9,90 euros.
(prix réservé aux particuliers - PDF lisible sur tout support - uniquement disponible sur iste-editions.fr)
Le livre est disponible en version papier au prix de 40,00 euros.

Le monde numérisé et connecté produit de grandes quantités de données. Analyser automatiquement le langage naturel est un enjeu majeur pour les applications de recherches sur le Web, de suivi d'actualités, de fouille, de veille, d'opinion, etc.

Les recherches menées en extraction d'information ont montré l'importance de certaines unités, telles que les noms de personnes, de lieux et d’organisations, les dates ou les montants. Le traitement de ces éléments, les « entités nommées », a donné lieu au développement d'algorithmes et de ressources utilisées par les systèmes informatiques.

Théorique et pratique, cet ouvrage propose des outils pour définir ces entités, les identifier, les lier à des bases de connaissance ou pour procéder à l’évaluation des systèmes.
 
 
Sommaire

1. Les entités nommées pour l’accès à l’information
2. Les entités nommées, des unités référentielles
3. Ressources autour des entités nommées
4. Reconnaître les entités nommées
5. Lier les entités nommées aux référentiels
6. Évaluation de la reconnaissance des entités nommées

168 pages - Octobre 2015
Ouvrage papier - broché 
ISBN 978-1-78405-104-4
Back  Top

5-1-4R.Fuchs, 'Speech Rhythm in Varieties of English' , Springer

R.Fuchs,  'Speech Rhythm in Varieties of English' has appeared with Springer, in the 'Prosody, Phonology and Phonetics' series: https://www.springer.com/gp/book/9783662478172

Back  Top

5-1-5Pejman Mowlaee et al., 'Phase-Aware Signal Processing in Speech Communication: Theory and Practice', Wiley 2016

Phase-Aware Signal Processing in Speech Communication: Theory and Practice

Pejman Mowlaee, Johannes Stahl, Josef Kulmer, Florian Mayer

http://eu.wiley.com/WileyCDA/WileyTitle/productCd-1119238811.html

An overview on the challenging new topic of phase-aware signal processing

Speech communication technology is a key factor in human-machine interaction, digital hearing aids, mobile telephony, and automatic speech/speaker recognition. With the proliferation of these applications, there is a growing requirement for advanced methodologies that can push the limits of the conventional solutions relying on processing the signal magnitude spectrum.

Single-Channel Phase-Aware Signal Processing in Speech Communication provides a comprehensive guide to phase signal processing and reviews the history of phase importance in the literature, basic problems in phase processing, fundamentals of phase estimation together with several applications to demonstrate the usefulness of phase processing.

Key features:

  • Analysis of recent advances demonstrating the positive impact of phase-based processing in pushing the limits of conventional methods.
  • Offers unique coverage of the historical context, fundamentals of phase processing and provides several examples in speech communication.
  • Provides a detailed review of many references and discusses the existing signal processing techniques required to deal with phase information in different applications involved with speech.
  • The book supplies various examples and MATLAB® implementations delivered within the PhaseLab toolbox.

Single-Channel Phase-Aware Signal Processing in Speech Communication is a valuable single-source for students, non-expert DSP engineers, academics and graduate students.

ejman Mowlaee, Johannes Stahl, Josef Kulmer, Florian Mayer
Back  Top

5-1-6Jean Caelen, Anne Xuereb, 'Dialogue : altérité, interaction, énaction'

 

Jean Caelen,Anne Xuereb

Dialogue : altérité, interaction, énaction

Editions universitaires européennes

 

Back  Top

5-1-7Bäckström, Tom (with Guillaume Fuchs, Sascha Disch, Christian Uhle and Jeremie Lecomte), 'Speech Coding with Code-Excited Linear Prediction', Springer


 Speech Coding with Code-Excited Linear Prediction

Author: Bäckström, Tom

Invited chapters from: Guillaume Fuchs, Sascha Disch, Christian Uhle and Jeremie Lecomte

Publisher: Springer

http://www.springer.com/gp/book/9783319502021

Back  Top

5-1-8Shinji Watanabe, Marc Delcroix, Florian Metze, John R. Hershey (Eds), 'New Era for Robust Seech Recognition', Springer.

Shinji Watanabe, Marc Delcroix, Florian Metze, John R. Hershey (Eds), 'New Era for Robust Seech Recognition', Springer.

https://link.springer.com/book/10.1007%2F978-3-319-64680-0

Back  Top



 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA