ISCA - International Speech
Communication Association


ISCApad Archive  »  2012  »  ISCApad #168  »  Journals  »  IEEE Signal Processing Magazine: Special Issue on Fundamental Technologies in Modern Speech Recognition

ISCApad #168

Sunday, June 10, 2012 by Chris Wellekens

7-2 IEEE Signal Processing Magazine: Special Issue on Fundamental Technologies in Modern Speech Recognition
  

IEEE Signal Processing Magazine

Special Issue on Fundamental Technologies in Modern Speech Recognition

                            

Guest Editors:                      

Sadaoki Furui   Tokyo Institute of Technology, Tokyo, Japan  (furui@cs.titech.ac.jp)

Li Deng         Microsoft Research, Redmond, USA (deng@microsoft.com)

Mark Gales      University of Cambridge, Cambridge, UK (mjfg@eng.cam.ac.uk)

Hermann Ney     RWTH Aachen University, Aachen, Germany  (ney@cs.rwth-aachen.de)

Keiichi Tokuda  Nagoya Institute of Technology, Nagoya, Japan  (tokuda@nitech.ac.jp)

 Recently, various statistical techniques that form the basis of fundamental technologies underlying today’s automatic speech recognition (ASR) research and applications have attracted new attentions. These techniques have significantly contributed to progress in ASR, including speaker recognition, and their various applications. 

The purpose of this special issue is to bring together leading experts from various disciplines to explore the impact of statistical approaches on ASR.

The special issue will provide a comprehensive overview of recent developments and open problems.

This Call for Papers invites researchers to contribute articles that have a broad appeal to the signal processing community.  Such an article could be for example a tutorial of the fundamentals or a presentation of a state-of-the-art method.  Examples of the topics that could be addressed in the article include, but are not limited to:

 * Supervised, unsupervised, and lightly supervised training/adaptation

 * Speaker-adaptive and noise-adaptive training

 * Discriminative training

 * Large-margin based methods

 * Model complexity optimization

 * Dynamic Bayesian networks for various levels of speech modeling and decoding

 * Deep belief networks and related deep learning techniques

 * Sparse coding for speech feature extraction and modeling

 * Feature parameter compensation/normalization

 * Acoustic factorization

 * Conditional random fields (CRF) for modeling and decoding

 * Acoustic source separation by PCA and ICA

 * De-reverberation

 * Rapid language adaptation for multilingual speech recognition

 * Weighted-finite-state-transducer (WFST) based decoding

 * Uncertainty decoding

 * Speaker recognition, especially text-independent speaker verification

 * Statistical framework for human-computer dialogue modeling

 * Automatic speech summarization and information extraction

 

Submission Procedure:

Prospective authors should submit their white papers to the web submission system at

http://mc.manuscriptcentral.com/spmag-ieee.

 

Schedule:

 * White paper due:         October 1, 2011

 * Invitation notification: November 1, 2011

 * Manuscript due:          February 1, 2012

 * Acceptance notification: April 1, 2012

 * Final manuscript due:    May 15, 2012

 * Publication date:        September 15, 2012 


Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA