ISCApad #146 |
Tuesday, August 10, 2010 by Chris Wellekens |
5-1-1 | Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods
Joseph Keshet and Samy Bengio, Editors
John Wiley & Sons
March, 2009
Website: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods
About the book:
This is the first book dedicated to uniting research related to speech and speaker recognition based on the recent advances in large margin and kernel methods. The first part of the book presents theoretical and practical foundations of large margin and kernel methods, from support vector machines to large margin methods for structured learning. The second part of the book is dedicated to acoustic modeling of continuous speech recognizers, where the grounds for practical large margin sequence learning are set. The third part introduces large margin methods for discriminative language modeling. The last part of the book is dedicated to the application of keyword-spotting, speaker
verification and spectral clustering.
Contributors: Yasemin Altun, Francis Bach, Samy Bengio, Dan Chazan, Koby Crammer, Mark Gales, Yves Grandvalet, David Grangier, Michael I. Jordan, Joseph Keshet, Johnny Mariéthoz, Lawrence Saul, Brian Roark, Fei Sha, Shai Shalev-Shwartz, Yoram Singer, and Nathan Srebo.
| ||
5-1-2 | Some aspects of Speech and the Brain. Some aspects of Speech and the Brain.
Susanne Fuchs, Hélène Loevenbruck, Daniel Pape, Pascal Perrier
Editions Peter Lang, janvier 2009
What happens in the brain when humans are producing speech or when they are listening to it ? This is the main focus of the book, which includes a collection of 13 articles, written by researchers at some of the foremost European laboratories in the fields of linguistics, phonetics, psychology, cognitive sciences and neurosciences.
| ||
5-1-3 | Spoken Language Processing Spoken Language Processing, edited by Joseph Mariani (IMMI and Publisher ISTE-Wiley Speech processing addresses various scientific and technological areas. It includes speech analysis and variable rate coding, in order to store or transmit speech. It also covers speech synthesis, especially from text, speech recognition, including speaker and language identification, and spoken language understanding. This book covers the following topics: how to realize speech production and perception systems, how to synthesize and understand speech using state-of-the-art methods in signal processing, pattern recognition, stochastic modeling, computational linguistics and human factor studies.
| ||
5-1-4 | L'imagerie medicale pour l'etude de la parole Alain Marchal, Christian Cave Eds Hermes Lavoisier 99 euros • 304 pages • 16 x 24 • 2009 • ISBN : 978-2-7462-2235-9 Du miroir laryngé à la vidéofibroscopie actuelle, de la prise d'empreintes statiques à la palatographie dynamique, des débuts de la radiographie jusqu'à l'imagerie par résonance magnétique ou la magnétoencéphalographie, cet ouvrage passe en revue les différentes techniques d'imagerie utilisées pour étudier la parole tant du point de vue de la production que de celui de la perception. Les avantages et inconvénients ainsi que les limites de chaque technique sont passés en revue, tout en présentant les principaux résultats acquis avec chacune d'entre elles ainsi que leurs perspectives d'évolution. Écrit par des spécialistes soucieux d'être accessibles à un large public, cet ouvrage s'adresse à tous ceux qui étudient ou abordent la parole dans leurs activités professionnelles comme les phoniatres, ORL, orthophonistes et bien sûr les phonéticiens et les linguistes.
| ||
5-1-5 | Korpusbasierte Sprachverarbeitung Author: Christoph Draxler
| ||
5-1-6 | Linear Predictive Coding and the Internet Protocol, by Robert M. Gray Linear Predictive Coding and the Internet Protocol, by Robert M. Gray, a special edition hardback book from Foundations and Trends in Signal Processing (FnT SP). The book brings together two forthcoming issues of FnT SP, the first being a survey of LPC, the second a unique history of realtime digital speech on packet networks.
Volume 3, Issue 3 A Survey of Linear Predictive Coding: Part 1 of LPC and the IP By Robert M. Gray (Stanford University) http://www.nowpublishers.com/product.aspx?product=SIG&doi=2000000029
Volume 3, Issue 4
A History of Realtime Digital Speech on Packet Networks: Part 2 of LPC and the IP By Robert M. Gray (Stanford University) http://www.nowpublishers.com/product.aspx?product=SIG&doi=2000000036
The links above will take you to the article abstracts.
|