| Call for papers
FIRST WORKSHOP ON SPEECH, LANGUAGE AND AUDIO IN MULTIMEDIA
Organized by ISCA SIG on Speech and Language in Multimedia IEEE SIG on Audio and Speech Processing for Multimedia
as a satellite event of Interspeech 2013
Aug. 22—23, 2013, Marseille, France
http://slam2013.lif.univ-mrs.fr
********************************************************************* ********************************************************************* *********************************************************************
The first Workshop on Speech, Language and Audio in Multimedia (SLAM) aims at bringing together researchers working in speech, language and audio processing to analyze, index and access multimedia data. Multimedia data are now available in very large amounts with a wide variety of formats and qualities, from professional content to user-generated ones: Lectures, meetings, interviews, debates, conversational broadcast, podcasts, social videos on the Web, etc. Such data, along with the associated use scenarios, raise specific challenges: Robustness facing the high variability in quality; Efficiency to handle very large amount of data; Semantics shared across modalities; Potentially high error rates in transcription; etc. Worldwide, several national and international research projects are focusing on audio analysis of multimedia data. Similarly, various benchmark initiatives have been initiated such as TRECVID MED, MediaEval, or ETAPE and REPERE in France.
The SLAM workshop intends to bring together players from the field to share recent research results, discuss ongoing and future projects, benchmarking initiatives and applications. We expect communications on research work, project description, evaluation initiative, demonstrations and applications dealing with speech and/or language and/or audio on any type of multimedia material. The list of topics of interest includes (but is not limited to):
Audio event detection and audio classification Speech and speaker recognition on multimedia material Audio-aware genre analysis and classification Multimodal speaker identification and clustering Multimedia spoken term detection and content retrieval Speech and audio aware content segmentation and structuring Audio indexing and fingerprinting Robust feature extraction and processing Natural language processing for multimedia Metadata extraction : Entity extraction, keyword extraction, etc. Summarization and hyperlink generation Multimodal fusion and integration involving audio Generation of descriptive text for multimedia Speech and audio multimedia applications and services Databases and benchmarks Large scale speech and audio analysis Navigation in multimedia audio content
applied to any media such as:
Professional broadcasts – TV, radio, podcasts, newsfeeds, synopsis, etc. Social media – YouTube, Flickr, Facebook, Twitter, Voice Social Network, etc. Audiovisual archives – Lectures and conferences, meetings, etc. Music – Music catalogs, music collections, etc.
The workshop is organized in conjunction with Interspeech 2013 over 1.5 days, starting Thu. 22, 2013 at mid-day and ending Fri. 23, 2013 afternoon, right before the main conference. Marseille is conveniently connected by high-speed train to Lyon where the Interspeech conference will take place. The format of the workshop will include an invited talk, oral presentations of scientific work and a poster session for project and benchmark presentations.
Prospective authors should submit a manuscript following instructions on the web site. All submissions will be peer-reviewed by at least two reviewers. Proceedings will be published online in open access and a special issue in a dedicated journal is targeted.
*Important dates* full paper submission deadline: April 15, 2013 notification of acceptance: May 17, 2013 camera ready paper: June 28, 2013 workshop: August 22-23, 2013 Interspeech conference: August 25-30, 2013
The first SLAM workshop is jointly organized by the newly created ISCA SIG on Speech and Language in Multimedia and by the IEEE SIG on Audio and Speech Processing in Multimedia. This first edition is intended as the first of a series of workshop.
General Chairs: Frédéric Bechet (LIF-CNRS, Aix Marseille Université) Guillaume Gravier (IRISA, CNRS)
Scientific committee: Xavier Anguera (Telefónica) Frédéric Bechet (Aix Marseille Université) Delphine Charlet (Orange Labs) Gerald Friedland (ICSI) Sadaoki Furui (Tokyo Institute of Technology) Guillaume Gravier (CNRS) Gareth Jones (Dublin City University) Martha Larson (TU Delft) Lin-Shan Lee (National Taiwan University) Georges Linarès (Université d'Avignon) Florian Metze (CMU)
Organizing committee: Hervé Bredin (LIMSI, CNRS) Benoît Favre (LIF-CNRS, Aix Marseille Université) Sylvain Meigner (LIUM, Université du Maine) Christian Raymond (IRISA, INSA Rennes) |