ISCA - International Speech Communication Association
It is our pleasure to introduce A||GO (https://allgo.inria.fr/ or http://allgo.irisa.fr/), a platform providing a collection of web-services for the automatic analysis of various data, including multimedia content across modalities. The platform builds on the back-end web service deployment infrastructure developed and maintained by Inria?s Service for Experimentation and Development (SED). Originally dedicated to multimedia content, A||GO progressively broadened to other fields such as computational biology, networks and telecommunications, computational graphics or computational physics. As part of the CNRS PlaSciDo initiative [1], the Linkmedia team at IRISA / Inria Rennes is making available via A||GO a number of web services devoted to multimedia content analysis across modalities (language, audio, image, video). The web services provided currently include research results from the Linkmedia team as well as contribution from a number of partners. A list of the services available by the date is given below and the current state is available at https://www-linkmedia.irisa.fr/software along with demo videos. Most web services are interoperable, facilitating the implementation of a multimedia content analysis processing chain, and are free to use for trial, prototyping or lab work. A brief and free account creation step will allow you to execute the web-services using either the graphical interface or a command line via a dedicated API. We expect the number of web services to grow over time and invite interested parties to contact us should they wish to contribute the multimedia web service offer of A||GO. List of multimedia content analysis tools currently available on A||GO: - Audio Processing SaMuSa: music/speech segmentation SilAD: silence detection Radi.sh: repeated audio motif discovery LORIA STS v2: speech transcription for the French language from LORIA Multi channel BSS locate: audio source localization toolbox from IRISA-PANAMA A-spade: audio declipper from IRISA-PANAMA Transvox: voice faker from LORIA - Natural Language Processing NERO: name entity recognition TermEx: keywords/indexing terms detection Otis!: topic segmentation Hi-tost: hierarchical topic structuring - Video Processing Vidseg: video shot segmentation HUFA: face detection and tracking Shortcuts to Linkmedia services are also available here: https://www-linkmedia.irisa.fr/software/ For more information don't hesitate to contact us (contact-multimedia-allgo@irisa.fr). Gabriel Sargent and Guillaume Gravier -- Linkmedia IRISA - CNRS Rennes, France
© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.