ISCA Services

ISCA - International Speech
Communication Association

ISCApad Archive » 2017 » ISCApad #224 » Journals » CfP IEEE Journal of Selected Topics in Signal Processing: Special Issue on End-to-End Speech and Language Processing

ISCApad #224

Saturday, February 11, 2017 by Chris Wellekens

7-14 CfP IEEE Journal of Selected Topics in Signal Processing: Special Issue on End-to-End Speech and Language Processing

CALL FOR PAPERS

IEEE Journal of Selected Topics in Signal Processing

Special Issue on End-to-End Speech and Language Processing

End-to-end (E2E) systems have achieved competitive results compared to conventional hybrid Hidden Markov-deep neural

network model-based automatic speech recognition (ASR) systems. Such E2E systems are attractive because they do not require

initial alignments between input acoustic features and output graphemes or words. Very deep convolutional networks and recurrent

neural networks have also been very successful in ASR systems due to their added expressive power and better generalization.

ASR is often not the end goal of real-world speech information processing systems. Instead, an important end goal is information

retrieval, in particular keyword search (KWS), that involves retrieving speech documents containing a user-specified query from a

large database. Conventional keyword search uses an ASR system as a front-end that converts the speech database into a finitestate

transducer (FST) index containing a large number of likely word or sub-word sequences for each speech segment, along with

associated confidence scores and time stamps. A user-specified text query is then composed with this FST index to find the

putative locations of the keyword along with confidence scores. More recently, inspired by E2E approaches, ASR-free keyword

search systems have been proposed with limited success. Machine learning methods have also been very successful in Question-

Answering, parsing, language translation, analytics and deriving representations of morphological units, words or sentences.

Challenges such as the Zero Resource Speech Challenge aim to construct systems that learn an end-to-end Spoken Dialog (SD)

system, in an unknown language, from scratch, using only information available to a language learning infant (zero linguistic

resources). The principal objective of the recently concluded IARPA Babel program was to develop a keyword search system that

delivers high accuracy for any new language given very limited transcribed speech, noisy acoustic and channel conditions, and

limited system build time of one to four weeks. This special issue will showcase the power of novel machine learning methods not

only for ASR, but for keyword search and for the general processing of speech and language.

Topics of interest in the special issue include (but are not limited to):

• Novel end-to-end speech and language processing

• Query-by-example search

• Deep learning based acoustic and word representations

• Query-by-example search

• Question answering systems

• Multilingual dialogue systems

• Multilingual representation learning

• Low and zero resource speech processing

• Deep learning based ASR-free keyword search

• Deep learning based media retrieval

• Kernel methods applied to speech and language processing

• Acoustic unit discovery

• Computational challenges for deep end-to-end systems

• Adaptation strategies for end to end systems

• Noise robustness for low resource speech recognition systems

• Spoken language processing: speech to speech translation,

speech retrieval, extraction, and summarization

• Machine learning methods applied to morphological,

syntactic, and pragmatic analysis

• Computational semantics: document analysis, topic

segmentation, categorization, and modeling

• Named entity recognition, tagging, chunking, and parsing

• Sentiment analysis, opinion mining, and social media analytics

• Deep learning in human computer interaction

Dates:

Manuscript submission: April 1, 2017

First review completed: June 1, 2017

Revised Manuscript Due: July 15, 2017

Second Review Completed: August 15, 2017

Final Manuscript Due: September 15, 2017

Publication: December. 2017

Guest Editors:

Nancy F. Chen, Institute for Infocomm Research (I2R), A*STAR, Singapore

Mary Harper, Army Research Laboratory, USA

Brian Kingsbury, IBM Watson, IBM T.J. Watson Research Center, USA

Kate Knill, Cambridge University, U.K.

Bhuvana Ramabhadran, IBM Watson, IBM T.J. Watson Research Center, USA

Back

Top

Organisation	Events	Membership	Help
> Board	> Interspeech	> Join - renew	> Sitemap
> Legal documents	> Workshops	> Membership directory	> Contact
> Logos			> FAQ
			> Privacy policy