ISCA - International Speech
Communication Association


ISCApad Archive  »  2011  »  ISCApad #157  »  Resources

ISCApad #157

Tuesday, July 12, 2011 by Chris Wellekens

5 Resources
5-1 Books
5-1-1Alain Marchal, Christian Cave, L'imagerie medicale pour l'etude de la parole

Alain Marchal, Christian Cave

Eds Hermes Lavoisier

99 euros • 304 pages • 16 x 24 • 2009 • ISBN : 978-2-7462-2235-9

Du miroir laryngé à la vidéofibroscopie actuelle, de la prise d'empreintes statiques à la palatographie dynamique, des débuts de la radiographie jusqu'à l'imagerie par résonance magnétique ou la magnétoencéphalographie, cet ouvrage passe en revue les différentes techniques d'imagerie utilisées pour étudier la parole tant du point de vue de la production que de celui de la perception. Les avantages et inconvénients ainsi que les limites de chaque technique sont passés en revue, tout en présentant les principaux résultats acquis avec chacune d'entre elles ainsi que leurs perspectives d'évolution. Écrit par des spécialistes soucieux d'être accessibles à un large public, cet ouvrage s'adresse à tous ceux qui étudient ou abordent la parole dans leurs activités professionnelles comme les phoniatres, ORL, orthophonistes et bien sûr les phonéticiens et les linguistes.

Top

5-1-2Christoph Draxler, Korpusbasierte Sprachverarbeitung

Author: Christoph Draxler
Title: Korpusbasierte Sprachverarbeitung
Publisher: Narr Francke Attempto Verlag Tübingen
Year: 2008
Link: http://www.narr.de/details.php?catp=&p_id=16394

Summary: Spoken language is a major area of linguistic research and speech technology development. This handbook presents an introduction to the technical foundations and shows how speech data is collected, annotated, analysed, and made accessible in the form of speech databases. The book focuses on web-based procedures for the recording and processing of high quality speech data, and it is intended as a desktop reference for practical recording and annotation work. A chapter is devoted to the Ph@ttSessionz database, the first large-scale speech data collection (860+ speakers, 40 locations in Germany) performed via the Internet. The companion web site (http://www.narr-studienbuecher.de/Draxler/index.html) contains audio examples, software tools, solutions to the exercises, important links, and checklists. 

Top

5-1-3Robert M. Gray, Linear Predictive Coding and the Internet Protocol

Linear Predictive Coding and the Internet Protocol, by Robert M. Gray, a special edition hardback book from Foundations and Trends in Signal Processing (FnT SP). The book brings together two forthcoming issues of FnT SP, the first being a survey of LPC, the second a unique history of realtime digital speech on packet networks.

 

Volume 3, Issue 3                                                                                                                                                                                                 

A Survey of Linear Predictive Coding: Part 1 of LPC and the IP                                                                                                                                  

By Robert M. Gray (Stanford University)                                                                                                                                                                  

http://www.nowpublishers.com/product.aspx?product=SIG&doi=2000000029                                                                                                             

 

Volume 3, Issue  4

 

A History of Realtime Digital Speech on Packet Networks: Part 2 of LPC and the IP                                                                                                     

By Robert M. Gray (Stanford University)                                                                                                                                                                  

http://www.nowpublishers.com/product.aspx?product=SIG&doi=2000000036                                                                                                            

 

The links above will take you to the article abstracts.

Top

5-1-4M. Embarki and M. Ennaji, Modern Trends in Arabic Dialectology

Modern Trends in Arabic Dialectology,
M. Embarki & M. Ennaji (eds.), Trenton (USA): The Red Sea Press.

Contents
Introduction
Mohamed Embarki and Moha Ennaji
vii
Part I: Theoretical and Hi storical Perspectives
and Methods in Arabic Di alectology
Chapter 1 : Arabic Dialects: A Discussion
Janet C. E. Watson p. 3
Chapter 2 : The Emergence of Western Arabic: A Likely Consequence of Creolization
Federrico Corriente p. 39
Chapter 3 : Acoustic Cues for the Classification of Arabic Dialects
Mohamed Embarki p. 47
Chapter 4 : Variation and Attitudes:
A Sociolinguistic Analysis of the Qaaf
Maher Bahloul p. 69

Part II : Eastern Arabic Di alects
Chapter 5 : Arabic Bedouin Dialects and their Classification
Judith Rosenhouse p. 97
Chapter 6 : Evolution of Expressive Structures in Egyptian Arabic
Amr Helmy Ibrahim p. 121
Chapter 7 : ?adram? Arabic Lexicon
Abdullah Hassan Al-Saqqaf p. 139

Part III: Western Arabic Di alects
Chapter 8 : Dialectal Variation in Moroccan Arabic
Moha Ennaji p. 171
Chapter 9 : Formation and Evolution of Andalusi Arabic and its
Imprint on Modern Northern Morocco
Ángeles Vicente p. 185
Chapter 10 : The Phonetic Implementation of Falling Pitch Accents
in Dialectal Maltese: A Preliminary Study
of the Intonation of Gozitan ?ebbu?i
Alexandra Vella p. 211
Index p. 239



Top

5-1-5Gokhan Tur , R De Mori, Spoken Language Understanding: Systems for Extracting Semantic Information from Speech

Title: Spoken Language Understanding: Systems for Extracting Semantic Information from Speech

Editors: Gokhan Tur and Renato De Mori

Web: http://www.wiley.com/WileyCDA/WileyTitle/productCd-0470688246.html

Brief Description (please use as you see fit):

Spoken language understanding (SLU) is an emerging field in between speech and language processing, investigating human/ machine and human/ human communication by leveraging technologies from signal processing, pattern recognition, machine learning and artificial intelligence. SLU systems are designed to extract the meaning from speech utterances and its applications are vast, from voice search in mobile devices to meeting summarization, attracting interest from both commercial and academic sectors.

Both human/machine and human/human communications can benefit from the application of SLU, using differing tasks and approaches to better understand and utilize such communications. This book covers the state-of-the-art approaches for the most popular SLU tasks with chapters written by well-known researchers in the respective fields. Key features include:

Presents a fully integrated view of the two distinct disciplines of speech processing and language processing for SLU tasks.

Defines what is possible today for SLU as an enabling technology for enterprise (e.g., customer care centers or company meetings), and consumer (e.g., entertainment, mobile, car, robot, or smart environments) applications and outlines the key research areas.

Provides a unique source of distilled information on methods for computer modeling of semantic information in human/machine and human/human conversations.

This book can be successfully used for graduate courses in electronics engineering, computer science or computational linguistics. Moreover, technologists interested in processing spoken communications will find it a useful source of collated information of the topic drawn from the two distinct disciplines of speech processing and language processing under the new area of SLU.

Top

5-1-6Jody Kreiman, Diana Van Lancker Sidtis ,Foundations of Voice Studies: An Interdisciplinary Approach to Voice Production and Perception

Foundations of Voice Studies: An Interdisciplinary Approach to Voice Production and Perception
Jody Kreiman, Diana Van Lancker Sidtis
ISBN: 978-0-631-22297-2
Hardcover
512 pages
May 2011, Wiley-Blackwell

Foundations of Voice Studies provides a comprehensive description and analysis of the multifaceted role that voice quality plays in human existence.

•Offers a unique interdisciplinary perspective on all facets of voice perception, illustrating why listeners hear what they do and how they reach conclusions based on voice quality
•Integrates voice literature from a multitude of sources and disciplines
•Supplemented with practical and approachable examples, including a companion website with sound files, available on publication at www.wiley.com/go/voicestudies
•Explores the choice of various voices in advertising and broadcasting, and voice perception in singing voices and forensic applications
•Provides a straightforward and thorough overview of vocal physiology and control


Top

5-2 Database
5-2-1SpeechOcean China

SpeechOcean China also has about 200+ large language resources and some of databases can be freely used to our members for academic research purpose.  As a ISCA member, we will be also glad to share these databases to other ISCA members,

www.speechocean.com

Top

5-2-2ELRA - Language Resources Catalogue - Update (2011-05)


*****************************************************************
ELRA - Language Resources Catalogue - Update
*****************************************************************

ELRA is happy to announce that 2 new Multimodal and 3 new Speech Resources are now available in its catalogue.
Moreover, two Speech Resources previously announced are now available at better pricing conditions.

1) New Language Resources:

ELRA-S0314 LILA Marathi database
The LILA Marathi database comprises 2,002 Marathi speakers (992 males and 1010 females) recorded over the Korean mobile telephone network. Each speaker uttered around 46 read and spontaneous items.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1136

ELRA-S0315 A-SpeechDB
A-SpeechDB© is an Arabic speech database which contains about 20 hours of continuous speech recorded through one desktop omni microphone by 205 native speakers (about 30% of females and 70% of males), aged between 20 and 45. Automatically generated transcriptions are provided with a manually revised version for each sentence.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1140

ELRA-S0316 SmartKom Home (SKH)
Release SKH 1.0 contains 130 recordings in the technical setup ('scenario') SmartKom Home which should be an intelligent communication assistant for the private environment. Naive users were asked to test a 'prototype' for a market study not knowing that the system was in fact controlled by two human operators. They were asked to solve two tasks in a period of 4.5 minutes while they were left alone with the system.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1137

ELRA-S0317 SmartKom Mobil (SKM)
Release SKM 1.0 contains 146 recordings in the technical setup ('scenario') SmartKom Mobil which is a portable PDA equipped with a net link and additional intelligent communication devices. Naive users were asked to test a 'prototype' for a market study not knowing that the system was in fact controlled by two human operators. They were asked to solve two tasks in a period of 4,5 min while they were left alone with the system.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1138

ELRA-S0318 SmartKom Audio (SKAUDIO)
Release SKAUDIO 1.0 contains all audio channel recordings of the SmartKom corpora SmartKom Public (cf. ELRA-S0136), SmartKom Home (cf. ELRA-S0316) and SmartKom Mobil (cf. ELRA-S0317).
For more information, see: http://catalog.elra.info/product_info.php?products_id=1139


2) Revised Language Resources (new pricing conditions):

ELRA-S0136 SmartKom Public (SKP)
Release SKP 2.0 contains 172 recordings in the technical setup ('scenario') SmartKom Public which is comparable to a traditional public phone booth but equipped with additional intelligent communication devices. Naive users were asked to test a 'prototype' for a market study not knowing that the system was in fact controlled by two human operators. They were asked to solve two tasks in a period of 4.5 minutes while they were left alone with the system.
For more information, see: http://catalog.elra.info/product_info.php?cPath=37_39&products_id=1130

ELRA-S0281 LILA Hindi-L1 database
The LILA Hindi-L1 database comprises 2,030 Hindi speakers (1,012 males and 1,018 females, all speakers with Hindi as first language) recorded over the Indian mobile telephone network. Each speaker uttered around 60 read and spontaneous items.
For more information, see: http://catalog.elra.info/product_info.php?products_id=1071


For more information on the catalogue, please contact Valérie Mapelli mailto:mapelli@elda.org

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html 

Top

5-3 Software


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA