ISCA - International Speech
Communication Association


ISCApad Archive  »  2011  »  ISCApad #157  »  Events  »  Other Events  »  (2011-09-01) MediaEval 2011 Benchmark Evaluation and Workshop

ISCApad #157

Tuesday, July 12, 2011 by Chris Wellekens

3-3-16 (2011-09-01) MediaEval 2011 Benchmark Evaluation and Workshop
  
--------------------------------------------------
Call for Participation
MediaEval 2011 Benchmark Evaluation and Workshop
Official Satellite Event of Interspeech 2011
http://www.multimediaeval.org
--------------------------------------------------

The MediaEval benchmarking initiative and workshop provides a unique opportunity to work on new and interesting speech data within the context of forward-looking multimedia applications. MediaEval sets its focus on aspects of multimedia that go beyond visual content and, in particular, concentrates on the speech and language aspects of multimedia access and retrieval.

MediaEval tasks provide the research community with challenging opportunities to make use of speech recognition and audio analysis technology in interesting application scenarios. Participants carry out one or more tasks and submit runs to be evaluated. The MediaEval 2011 workshop provides a forum for presentation of results and is an official satellite event of Interspeech 2011.

For each task, participants receive a task definition, task data and accompanying resources (dependent on task) such as video shot boundaries, single-image keyframes, visual features, speech transcripts and social metadata.  Participation is open to all interested research groups. In order to participate, please sign up by 31 May via http://www.multimediaeval.org MediaEval 2011 offers a wide selection of tasks. The following four are of particular interest to the speech and audio research communities:

Genre Tagging
Given a set of genre tags (how-to, interview, review etc.) and a video collection, participants are required to automatically assign genre tags to each video based on a combination of modalities, i.e., speech, metadata, audio and visual (Data: Creative Commons internet video, multiple languages mostly English)

Rich Speech Retrieval
Given a set of queries and a video collection, participants are required to automatically identify relevant jump-in points into the video based on the combination of modalities, i.e., speech, metadata, audio and visual. The task can be approached as a multimodal task, but also as strictly a searching speech task. (Data: Creative Commons internet video, multiple languages mostly English)

Spoken Web Search
This task involves searching FOR audio content WITHIN audio content USING an audio content query. It is particularly interesting for speech researchers in the area of spoken term detection and low-resource speech recognition. About 400 audio recordings (4-30 sec in length each) from four different Indian languages -- English, Hindi, Gujarati and Telugu -- will be used.

Affect Task: Violent Scenes Detection
This task requires participants to deploy multimodal features to automatically detect portions of movies containing violent material. Any features automatically extracted from the video, including the subtitles, can be used by participants. (Data: A set of ca. 15 Hollywood movies that must be purchased by the participants.)

MediaEval 2011 Timeline
March-May register and return usage agreements
1 June release of development/training data
1 July release of test data
8 August run submission
22 August working notes paper submission
1&2 September MediaEval 2011 Workshop in Pisa

The MediaEval 2011 Workshop is an official satellite event of Interspeech 2011 (http://www.interspeech2011.org)

MediaEval 2011 Coordination
Martha Larson, Delft University of Technology
Gareth Jones, Dublin City University

MediaEval 2011 Organization Committee
Claire-Helene Demarty, Technicolor
Maria Eskevich, Dublin City University
Guillaume Gravier, IRISA/CNRS
Pascal Kelm, Technical University of Berlin
Florian Metze, CMU
Vasileios Mezaris, ITI CERTH
Vanessa Murdock, Yahoo! Research
Roeland Ordelman, University of Twente and Netherlands Institute for Sound & Vision
Adam Rae, Yahoo! Research
Nitendra Rajput, IBM Research India
Sebastian Schmiedeke, Technical University of Berlin
Pavel Serdyukov, Yandex
Mohammad Soleymani, University of Geneva
Raphael Troncy, Eurecom

Contact
For questions or additional information please contact Martha Larson m.a.larson@tudelft.nl

MediaEval 2011 is coordinated by PetaMedia, a FP7 EU Network of Excellence (http://www.petamedia.eu), and by the OpenSem project of EIT ICT Labs (http://eit.ictlabs.eu). Many other projects make individual contributions to organization, including: AXES (http://www.axes-project.eu), Chorus+ (http://www.ist-chorus.org), Glocal (http://www.glocal-project.eu), Quaero (http://www.quaero.org) and weknowit (http://www.weknowit.eu).

Back  Top


 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA