ISCA - International Speech
Communication Association


ISCApad Archive  »  2015  »  ISCApad #203  »  Events  »  ISCA Events

ISCApad #203

Saturday, May 16, 2015 by Chris Wellekens

3-1 ISCA Events
3-1-1(2015-09-06) Call for Satellite Workshops of INTERSPEECH 2015, Dresden, Germany
**** Call for Satellite Workshops **** 
INTERSPEECH 2015 will be held in the beautiful city of Dresden, Germany, on September 6-10, 2015
The theme is 'Speech beyond Speech - Towards a Better Understanding of the Most Important 
Biosignal'. The Organizing Committee of INTERSPEECH 2015 is now inviting proposals for 
satellite workshops, which will be held in proximity to the main conference. 
The Organizing Committee will work to facilitate the organization of such satellite workshops, 
to stimulate discussion in research areas related to speech and language, at locations in Central 
Europe, and around the same time as INTERSPEECH. We are particularly looking forward to 
proposals from neighboring countries. If you are interested in organizing a satellite workshop, 
or would like a planned event to be listed as an official satellite event, please contact the organizers
 or the Satellite Workshop Chair at fmetze@cs.cmu.edu The Satellite Workshop coordinator along 
with the INTERSPEECH team will help to connect (potential) workshop organizers with local 
contacts in Germany, if needed, and will try to be helpful with logistics such as payment, publicity,
 and coordination with ISCA or other events. Proposals should include:
 * workshop name and acronym 
* organizers' name and contact info 
* website (if already known) 
* date and proposed location of the workshop 
* estimated number of participants 
* a short description of the motivation for the workshop 
* an outline of the program and invited speakers 
* a description of the submission process (e.g. deadlines, target acceptance rate) 
* a list of the scientific committee members 
 
Proposals for satellite workshops should be submitted by email to workshops@interspeech2015.org
 by August 31st, 2014 We strongly recommend that organizers also apply for
 ISCA approval/ sponsorship, which will greatly facilitate acceptance as an INTERSPEECH satellite 
event. We plan to notify proposers no later than October 30, 2014. If you have any questions about 
whether a potential event would be a good candidate for an INTERSPEECH 2015 satellite workshop 
feel free to contact the INTERSPEECH 2015 Satellite Workshops Chair. 
 
Sincerely, 
Florian Metze 
Satellite Workshops Chair fmetze@cs.cmu.edu

 

 
Back  Top

3-1-2(2015-09-06) 3rd and Final Call for INTERSPEECH 2015, Sep 6-10, Dresden, Germany.

3rd and Final Call for INTERSPEECH 2015, Sep 6-10, Dresden, Germany.

 

INTERSPEECH is the world’s largest and most comprehensive conference on the science and technology of spoken language processing. INTERSPEECH conferences emphasize interdisciplinary approaches addressing all aspects of speech science and technology, ranging from basic theories to applications.

 

INTERSPEECH 2015 in Dresden (Germany) will be organized around the theme Speech beyond Speech: Towards a Better Understanding of the Most Important Biosignal, which acknowledges the fact that speech is the most important biosignal humans can produce and perceive. It is evident that not all characteristics of speech are already fully understood. We therefore encourage contributions that analyze and model speech as a biosignal in a broad understanding, e.g. for extracting information about the speaker, for identifying processes leading to speech production, or for generating speech signals with specific bio-characteristics. Contributions to all other areas of speech science and technology are also welcome.

 

 

Important Dates

---------------------

20 March 2015                  Paper: submission deadline

20 March 2015                  Tutorial: submission deadline

17 April 2015                      Show and Tell: submission deadline

 

10 June 2015                      Paper: camera-ready

10 June 2015                      Show and Tell: camera-ready

20 June 2015                      Early registration deadline

 

6-10 Sep 2015                    Conference in Dresden, Germany

 

 

 

 

INTERSPEECH 2015 hosts a wide range of Events, e.g. Special Sessions and Workshops

-------------------------------------------------------------------------------------------------------------------

 

10 Special Sessions

- Active Perception in Human and Machine Speech Communication

- Biosignal-based Spoken Communication

- Interspeech 2015 Computational Paralinguistics Challenge (ComParE): Degree of Nativeness, Parkinson’s & Eating Condition

- Automatic Speaker Verification Spoofing and Countermeasures

- Zero Resource Speech Technologies: Unsupervised Discovery of Linguistic Units

- Robust Speech Processing using Observation Uncertainty and Uncertainty Propagation

- Speech Science in End User Applications

- Synergies of Speech and Multimedia Technologies

- Speech and Language Processing of Children’s Speech

- Advanced Crowdsourcing for Speech and Beyond

 

 

10 Satellite Workshops

- Errors by Humans and Machines in multimedia, multimodal and multilingual data processing (ERRARE)

- Speech and Language Processing for Assistive Technologies (SLPAT)

- Workshop on Speech and Language Technology for Education (SLaTE)

- International Workshop on the History of Speech Communication Research (HSCR)

- Workshop on Speech and Audio Technologies for the Digital Humanities (SAT4DH)

- Blizzard Challenge Workshop

- Special Interest Group on Discourse and Dialogue (SIGDIAL)

- International Workshop on Speech Robotics (IWSR)

- The 1st Joint Conference on Facial Analysis, Animation and Audio-Visual Speech Processing (FAAVSP)

- MediaEval Benchmarking Initiative for Multimedia Evaluation (MediaEval)

 

 

3 Related Events

- Speech Technology and Human-Computer Dialogue (SpeD)

- The 1st Joint Conference on Facial Analysis, Animation and Audio-Visual Speech Processing (FAAVSP)

- International Conference on Text, Speech and Dialogue (TSD)

 

 

Visit www.interspeech2015.org

 

 

 

 

********************************************************

Dr. Tim Polzehl

Quality and Usability Lab, Telekom Innovation Laboratories, SoftwareCampus

Technische Universität Berlin

E-mail: tim.polzehl@telekom.de

 

### visit INTERSPEECH 2015 in Dresden, Germany - http://www.interspeech2015.org ###

 

DEUTSCHE TELEKOM AG

Aufsichtsrat: Prof. Dr. Ulrich Lehner (Vorsitzender)

Vorstand: Timotheus Höttges (Vorsitzender),

Reinhard Clemens, Niek Jan van Damme, Thomas Dannenfeldt,

Dr. Thomas Kremer, Claudia Nemat, Prof. Dr. Marion Schick

Handelsregister: Amtsgericht Bonn HRB 6794

Sitz der Gesellschaft Bonn 

 

 

Back  Top

3-1-3(2015-09-06) Calls for INTERSPEECH 2015, Dresden, Germany

Calls for INTERSPEECH 2015

 

INTERSPEECH is the world’s largest and most comprehensive conference on the science and technology of spoken language processing. INTERSPEECH conferences emphasize interdisciplinary approaches addressing all aspects of speech science and technology, ranging from basic theories to applications.

 

INTERSPEECH 2015 will be organized around the theme Speech beyond Speech: Towards a Better Understanding of the Most Important Biosignal, which acknowledges the fact that speech is the most important biosignal humans can produce and perceive. It is evident that not all characteristics of speech are already fully understood. We therefore encourage contributions that analyze and model speech as a biosignal in a broad understanding, e.g. for extracting information about the speaker, for identifying processes leading to speech production, or for generating speech signals with specific bio-characteristics. Contributions to all other areas of speech science and technology are also welcome.

 

Important Dates

30 Nov 2014   special sessions: paper submission deadline
15 Dec 2014  
special sessions: notification of acceptance/rejection for pre-selection
20 Mar 2015  
tutorial: submission deadline
20 Mar 2015  
paper: submission deadline
01 May 2015 
tutorial: notification of acceptance/rejection
17 Apr 2015  
show and tell: submission deadline
01 Jun 2015   
paper: notification of acceptance/rejection
01 Jun 2015   
show and tell: notification of acceptance/rejection
10 Jun 2015   
paper: camera-ready
10 Jun 2015   
show and tell: camera-ready
20 Jun 2015   
early registration deadline
6-10 Sep 2015 conference

 

Call for Papers

The Organizing Committee of INTERSPEECH 2015 is now inviting for paper submissions. Please refer to our CfP page  for more details.

15.09.2014 – Details on paper submission and topics can be found on our CfP page.

 

Call for Special Sessions

Submissions of Special Sessions & Challenges proposals are encouraged for INTERSPEECH 2015, covering interdisciplinary topics and/or important new emerging areas of interest related to the main conference topics.

16.10.2014 – For more information please refer to our SS-CfP page.

 

Call for Show and Tell

The Organizing Committee of INTERSPEECH 2015 is now inviting for contribution to Show and Tell special event. Please refer to our Show and Tell page for more details.

17.04.2015 – Full submission deadline, proposals should be submitted by email tois2015showntell@gmx.de.

 

Call for Tutorials

The Organizing Committee of INTERSPEECH 2015 is now inviting proposals for tutorials, which will be held on September 6 (first day of conference) in the main venue Maritim International Congress Center (ICD) location. Please refer to here for more details.

15.10.2014 – Proposals for tutorials should be submitted by email to Alexander Raake.

 

Call for Satellite Workshops

The Organizing Committee of INTERSPEECH 2015 is now inviting proposals for satellite workshops, which will be held in proximity to the main conference. Please refer to here for more details or manually navigate to http://www.signalprocessingsociety.org/technical-committees/list/sl-tc/spl-nl/2014-05/2014-05-is2015/

Proposals for satellite workshops should be submitted by email to Florian Metze.

 

Back  Top

3-1-4(2015-09-06) Interspeech 2015 Call for tutorials

General conference information – INTERSPEECH 2015

INTERSPEECH is the world's largest and most comprehensive conference on the science and technology of spoken language processing. INTERSPEECH conferences emphasize interdisciplinary approaches addressing all aspects of speech science and technology, ranging from basic theories to applications. INTERSPEECH 2015 will be organized around the theme Speech beyond Speech: Towards a Better Understanding of the Most Important Biosignal, which acknowledges the fact that speech is the most important biosignal humans can produce and perceive. It is evident that not all characteristics of speech are already fully understood. We therefore encourage contributions that analyze and model speech as a biosignal in a broad understanding, e.g. for extracting information about the speaker, for identifying processes leading to speech production, or for generating speech signals with specific bio-characteristics. Contributions to all other areas of speech science and technology are also welcome.

 

Call for tutorials

INTERSPEECH conferences are attended by researchers with a long-term track-record in speech sciences and technology, as well as by early-stage researchers or researchers interested in a new domain within the INTERSPEECH areas. With the current edition of INTERSPEECH, we seek to specifically reflect different audiences in the choice of tutorials, where it will clearly be indicated what prior knowledge the audience should have.

As a consequence, we encourage proposals for tutorials addressing introductory topics or advanced topics in an introductory style, and tutorials targeting experienced researchers who want to dig deeper into a given new topic.

Tutorials may introduce an uprising area of speech-related research, or present an overview of an important more established area.

The tutorials can be of either three hours or six hours duration. The tutorials are expected to cover the state-of-the-art of the proposed topics rather than to primarily present the authors own research.

INTERSPEECH 2015 will be organized around the theme Speech beyond Speech: Towards a Better Understanding of the Most Important Biosignal, which acknowledges the fact that speech is the most important biosignal humans can produce and perceive. As a consequence, we especially encourage tutorials from fields that are directly related with this theme. Of course the call is not restricted to the conference theme, and open to any kind of tutorial related with speech science and technology.

Tutorial Proposal Submission

Proposals for the INTERSPEECH 2015 tutorials should be up to 4 pages long. Proposal submissions must conform to the format defined in the tutorial proposal preparation guidelines and provided in the authors’ kit on the INTERSPEECH 2015 website, along with the Call for Tutorials.

Tutorial proposal submission deadline is on Friday 20 March 2015 midnight (11:59pm) GMT. There will be no extension of deadlines. The working language of the conference is English, and proposals must be written in English, too.

 

All proposals should be submitted to tutorials@interspeech2015.org

Preliminary Tutorial Proposal Template

  1. Tutorial Title
  2. Duration [3 hours – 1 session, 6 hours – 2 sessions]
  3. Presenter(s) [Name and Affiliation]
  4. Description of the proposal [1-2 pages description plus a few relevant references and any webpages/material useful for reviewing the proposal]
  5. Explanation of relevance of the proposed tutorial [0.5-1 page]
  6. Description of Presentation format [e.g. one or more presenters, video conferencing or other media, etc.]
  7. Special equipment required for the tutorial
  8. Contact Information [email, Telephone]
  9. Biography of Presenter(s)
  10. Key Publications of Presenter(s) on the tutorial topic
  11. Target Audience [e.g. new researchers to the field, research students, specialists of adjacent fields, etc.]
  12. Estimated number of participants, possibly first names of interested researchers
  13. Provided accompanying material [handouts, storage devices with media, etc.]
  14. Other considerations/comments

Important Dates

Tutorial submission deadline

:

20 March 2015

Notification of acceptance/rejection

:

1 May 2015

Conference dates

:

6-10 Sept 2015

 

We look forward to welcoming you to INTERSPEECH 2015 in Dresden!

Alexander Raake
Tutorial Chair
Email: tutorials@interspeech2015.org

 

 

Assessment of IP-based Applications

TU Berlin & Deutsche Telekom/T-Labs (Research & Innovation)
Prof. Dr.-Ing. Alexander Raake
Ernst-Reuter-Platz 7, 10587 Berlin
+4930835358462 (Tel.)

+49391580239574 (Fax)

E-Mail: alexander.raake@telekom.de

www.aipa.tu-berlin.de

www.laboratories.telekom.com

Back  Top

3-1-5(2015-09-06)) CfP INTERSPEECH 2015 Special Session on INTERSPEECH 2015 Special Session on

INTERSPEECH 2015
Call for paper: submission for INTERSPEECH 2015 Special Session on
Synergies of Speech and Multimedia Technologies

Paper submission deadline: March 20, 2015
Special Session page:
http://multimediaeval.org/files/Interspeech2015_specialSession_SynergiesOfSpeechAndMultimediaTechnologies.html
Motivation:

Growing amounts of multimedia content is being shared or stored in
online archives. Alternative research directions in the speech
processing and multimedia analysis communities are developing and
improving speech or multimedia processing technologies in parallel,
often using each others work as ?black boxes?. However, genuine
combination would appear to be a better strategy to exploit the
synergies between the modalities of content containing multiple
potential sources of information.

This session seeks to bring together the speech and multimedia research
communities to report on current work and to explore potential synergies
and opportunities for creative research collaborations between speech
and multimedia technologies. From the speech perspective the session
aims to explore how fundamentals of speech technology can be benefit
multimedia applications, and from the multimedia perspective to explore
the crucial role that speech can play in multimedia analysis.

The list of topics of interest includes (but is not limited to):

- Navigation in multimedia content using advanced speech analysis features;
- Large scale speech and video analysis
- Multimedia content segmentation and structuring using audio and visual
features;
- Multimedia content hyperlinking and summarization;
- Natural language processing for multimedia;
- Multimodality-enhanced metadata extraction, e.g. entity extraction,
keyword extraction, etc;
- Generation of descriptive text for multimedia;
- Multimedia applications and services using speech analysis features;
- Affective and behavioural analytics based on multimodal cues;
- Audio event detection and video classification;
- Multimodal speaker identification and clustering.

Important dates:

20 Mar 2015 paper submission deadline
01 Jun 2015 paper notification of acceptance/rejection
10 Jun 2015 paper camera-ready
20 Jun 2015 early registration deadline
6-10 Sept 2015 Interspeech 2015, Dresden, Germany

Submission takes place via the general Interspeech submission
system. Paper contributions must comply to the INTERSPEECH paper
submission guidelines, cf. http://interspeech2015.org/papers.
There will be no extension to the full paper submission deadline.
We are looking forward to receive your contribution!

Organizers:

- Maria Eskevich, Communications Multimedia Group, EURECOM, France
(maria.eskevich@eurecom.fr <mailto:maria.eskevich@eurecom.fr>)
- Robin Aly, Database Management Group, University of Twente, The
Netherlands (r.aly@utwente.nl <mailto:r.aly@utwente.nl>)
- Roeland Ordelman, Human Media Interaction Group, University of Twente,
The Netherlands (roeland.ordelman@utwente.nl
< mailto:roeland.ordelman@utwente.nl>)
- Gareth J.F. Jones, CNGL Centre for Global Intelligent Content, Dublin
City University, Ireland (gjones@computing.dcu.ie
< mailto:gjones@computing.dcu.ie>)

Back  Top

3-1-6(2016) INTERSPEECH 2016, San Francisco, CA, USA

Interspeech 2016 will take place

from September 8-12 2016 in San Francisco, CA, USA

General Chair is Nelson Morgan.

You may from now on be tempted by the nice pictures of the cover page of its tentative website http://www.interspeech2016.org

 

Back  Top

3-1-7INTERSPEECH 2015 Update (April 2015)

 

A View from Dresden onto the History of Speech Communication

Part 4: Helmholtz Resonators Complete article including figures available at: http://interspeech2015.org/conference/historical-review/

Hermann von Helmholtz (1821 –1894) was a German physician and physicist who made important contributions in many areas of science. One of these areas was acoustics, where he published the famous book 'On the sensations of tone as a physiological basis for the theory of music' in 1863. There he described his invention of a special type of resonator, which is now known as Helmholtz resonator. These resonators were devised as highly sensitive devices to identify the harmonic components (partial tones) of sounds and allowed significant advances in the acoustic analysis of vowel sounds and musical instruments.

Before the invention of Helmholtz resonators, strong partial tones in a sound wave were typically identified by very thin, elastic membranes that were spanned on circular rings similar to drums. Such a membrane has a certain resonance frequency (in fact multiple frequencies) that depends on its material, tension, and radius. If the sound field around the membrane contains energy at this frequency, the membrane is excited and starts to oscillate. The tiny amplitudes of this oscillation can be visually detected when fine grained sand is distributed over its surface. When the membrane is excited with its lowest resonance frequency, the sand accumulates at the rim of the membrane or along specific lines on its surface, when higher order modes are excited. With a set of membranes tuned to different frequencies, a rough spectral analysis can be conducted.

It was also known that the sensitivity of this method could be improved when the membrane was spanned over the (removed) bottom of a bottle with an open neck end. The key idea of Helmholtz was to replace this bottle by a hollow sphere with an open neck at one 'end' and another small spiky opening at the opposite 'end'. The spiky opening had to be inserted into one ear canal. In this way, the eardrum was excited similarly to the membrane with the sand of the previous technique. However, due to the high sensitivity of the ear, partial tones could be detected much more easily. A further advantage of these resonators was that their resonance frequencies can be expressed analytically in terms of the volume of the sphere and the diameter and the length of the neck. Hence these resonators became important experimental tools for the subjective sound analysis in the late 19th century and the early 20th century.

The HAPS at the TU Dresden contains three sets of Helmholtz resonators. The biggest of these sets contains 11 resonators, which are tuned to frequencies between 128 Hz and 768 Hz. The HAPS also contains a related kind of resonators that were invented by Schaefer (1902). These resonators are tubes with one open end and one closed end. The closed end also has a small spiky opening that has to be inserted into the ear canal. These resonators maximally respond to frequencies of which the wavelength is four times the length of the tube.

Helmholtz used his resonators not only for sound analysis, but also for the synthesis of vowels. Therefore, he first had to analyze the resonances of the vocal tract for different vowels. He did this by means of a set of tuning forks, which he placed and excited directly in front of his open mouth when he silently articulated the different vowels. When the frequency of a tuning fork was close to a resonance of the vocal tract, the resulting sound became much louder than for the other frequencies. For each of the vowels /u/, /o/, and /a/, he was only able to detect a single resonance of the vocal tract at the frequencies 175 Hz (note f), 494 Hz (note b’) and 988 Hz (note b’’), respectively. For each of the other investigated German vowels, he even detected two resonances. The single resonances detected for /u/, /o/ and /a/ probably correspond to the clusters of the nearby first and second resonances of the corresponding vowels. Obviously, his method of analysis was not sensitive enough to separate the two individual resonances of each of the vowels.

To synthesize the vowels /u/, /o/, and /a/ with a single resonance, he simply connected a reed pipe to Helmholtz resonators tuned to the corresponding frequencies. For the vowels with two resonances, he selected a Helmholtz resonator for one of the resonances and attached a 6-10 cm long glass tube to the outer opening of the resonator to create the second resonance. These experiments showed that Helmholtz had surprising insight in the source-filter principle of speech production, which was fully elaborated by Gunnar Fant and others 100 years later.

Peter Birkholz

Back  Top

3-1-8INTERSPEECH 2015 Update (December 2014)

Updates from INTERSPEECH 2015

Dear colleague,

Interspeech 2015 in Dresden is approaching at an increasing pace, and the entire team of organizers is trying to ensure that you will get a conference which meets all, and hopefully surpasses some, of your expectations. Regarding the usual program of oral and poster sessions, special sessions and challenges, keynotes, tutorials and satellite workshops, the responsible team is working hard to ensure that you will get a program which is not only of respectable breadth and depth, but which also tackles a couple of innovative topics, some of them centered around the special topic of the conference “Speech beyond speech: Towards a better understanding of our most important biosignal”, some of them also addressing other emergent topics.

We would particularly like to draw your attention to the approaching deadlines:
-
          30 Nov 2014: Special sessions submission deadline (passed)
-
          15 Dec 2014: Notification of pre-selected special sessions
-
          20 Mar 2015: Tutorial submission deadline
-
          20 Mar 2015: Paper submission deadline (not extensible)
-
          17 Apr 2015: Show and tell paper submission deadline.
Calls

In addition to regular papers, we will also experiment with a virtual attendance format for persons who are – mainly for visa or health reasons – not able to come to Dresden to present their paper. For these persons, a limited number of technology-equipped poster boards will be available where online presentations can be held. The number of virtual attendance slots is strictly limited (thus potentially leading to a lower acceptance rate). The corresponding papers have to pass the normal review process, but the deadline will most probably be around 14 days before the normal paper submission deadline. More details on this format will be announced soon.

In the upcoming months, we will keep you updated via this thread, and we will present some historical instruments and techniques related to speech technology which nicely illustrate that Dresden has a rich history in speech science and technology. Interspeech 2015 will hopefully contribute to this history with the latest scientific and technological advances. The entire organizing team is looking forward to welcoming you in Dresden.

On behalf of the organizing team,

Sebastian Möller (General Chair)

Back  Top

3-1-9INTERSPEECH 2015 Update (February 2015)

+++ INTERSPEECH 2015 – February Update +++

Dear colleagues,

The preparations for Interspeech 2015 in Dresden are running at an increasing pace, and I got the impression that we have a very active contribution from the community this year. Bernd Möbius and Elmar Nöth, our TCP Chairs, have set up a comprehensive and balanced group of Area Chairs for the new areas we have agreed upon with ISCA, and which will soon be published on our website. The ten preliminarily accepted Special Sessions and Challenges are active in collecting contributions; as an example, the session “Advanced Crowdsourcing for Speech and Beyond” has received 17 requests for research funds, which will now be evaluated according to their fit to the special session topic. And our sponsorship, industry and exhibition chairs, Tim Fingscheidt, Claudia Pohlink, Jimmy Kunzmann and Reinhold Häb-Umbach, are actively soliciting sponsoring money to make the event most affordable for you. 

The 2nd Call for Papers is out (deadline March 20):
http://interspeech2015.org/calls/call-for-papers/

In addition there is a special Call for Papers with Virtual Presentation which solicits contributions to this special format we will experiment with at this year’s Interspeech for the first time, and which will be limited to exceptional cases which otherwise would not be able to participate:
http://interspeech2015.org/calls/call-for-papers-with-virtual-presentation/

In addition, there is still the option to submit proposals for Tutorials (deadline March 20) and Show and Tell contributions (deadline April 17):
http://interspeech2015.org/calls/call-for-tutorials/
http://interspeech2015.org/calls/call-for-show-and-tell/

All further information can be found on our Website which Tim Polzehl is eager to keep updated.

For automatically receiving continuous updates, we recommend that you follow us on Twitter (@interspeech2015), or that you use social channels such as LinkedIn or Facebook. And: Please do not delete your Interspeech 2014 App, it will automatically receive an update for Interspeech 2015.

Finally, Dresden is also polishing her historical charm, and for Interspeech attendants the most important aspect of this might be the second contribution to our historical series, which this time is dedicated to the world’s first successful attempt of a mechanical speech synthesiser.

On behalf of the organizing team,

Sebastian Möller (General Chair),
Oliver Jokisch (Publicity Chair)



A View from Dresden onto the History of Speech Communication

Part 2: Von Kempelen's 'Sprachmaschine' and the beginning of speech synthesis

Complete article including figures available at: http://interspeech2015.org/conference/historical-review/

 

The speaking machine of Wolfgang von Kempelen (1734-1804) can be considered as the first successful attempt of a mechanical speech synthesiser. The Austrian-Hungarian engineer is still famous for his 'chess turk' but it was his 'Sprachmaschine' that can count as a milestone in (speech) technology. In his book 'Mechanismus der menschlichen Sprache nebst der Beschreibung einer sprechenden Maschine' (published 1791, no English translation yet) he described the function of the machine which was intended to give a voice for deaf people. Contemporary personalities like Goethe confirmed the authenticity of a child voice when the speaking machine was played.

 

How does the machine work?

The machine consists of bellows that is connected with a tube to a wooden wind chest. On the other side of the wind chest a round wooden block represents the interface to an open rubber funnel (as the vocal tract). In the wind chest there are two modified recorders to produce the fricatives [s] and [S]. The voice generator is located inside the wooden block. The artificial voice is generated with the help of a reed pipe borrowed by the pipe organ. It has an ivory reed vibrating against a wooden hollow shallot (like in a clarinet). The trained human operator plays the machine like a musical instrument. The right elbows control the air pressure by pressing on the bellows, two fingers of the right hand close or open the access for stops and nasals, two other fingers of the right hand for the fricatives. Vowels are performed by the palm of left hand in different ways.

 

Replicas

Apart from parts of one of the originals that are hosted at the Deutsches Museum in Munich there are several reconstructions based on Kempelen's quite detailed descriptions. The replicas built in Budapest, Vienna, York and Saarbrücken allow a lively demonstration of the mechanical generation of speech as well its acoustic analysis but also perception tests with today's listeners. Interestingly, the art of constructing artificial voices led to the profession of 'voice makers' in Eastern-German Thuringia (more information in one of the next newsletters). Original products of the Thuringian 'Stimmenmacher' as well as one of the replicas located at TU Dresden are at display of the HAPS (Historische Akustisch-Phonetische Sammlung) available for ears, eyes (and hands) at the re-opening of HAPS at 4 Sept, which is also the start of the Interspeech satellite Workshop on The History of Speech Communication Research (HSCR 2015).

 

Jürgen Trouvain and Fabian Brackhane

Back  Top

3-1-10INTERSPEECH 2015 Update (January 2015)

 

+++ INTERSPEECH 2015 Update – and a look back! +++

Dear colleagues,

The regular paper deadline for Interspeech 2015 in Dresden is only 2 months away, so we hope that you are preparing for your submissions. We have received an impressive number of Special Session and Challenges proposals. The list of preliminarily accepted proposals, together with more information on each session and its organizers, can be found under http://interspeech2015.org/events/special-sessions/. Thus, in case that your interests fall within the area of one of these Special Sessions or Challenges, consider submitting there.

Please note that March 20 is – apart from the general paper deadline – also the deadline for tutorial proposals. More details on tutorial proposal submissions can be found under http://interspeech2015.org/calls/call-for-tutorials/. The deadline for Show & Tell papers is then April 17.

The current list of Satellite Workshops will be updated successively and can be found under http://interspeech2015.org/events/workshops/.

From now on, we will have a monthly view back to the history of speech communication and technology which happened in Dresden. On behalf of the organizing team, Sebastian Möller (General Chair)


A View from Dresden onto the History of Speech Communication

Part 1: The historic acoustic-phonetic collection

Information Technology at the TU Dresden goes back to Heinrich Barkhausen (1881–1956), the 'father of the electron valve', who taught from 1911 to 1953. Speech research in a narrower sense started with the development of a vocoder in the 1950s. Walter Tscheschner (1927–2004) performed his extensive investigations on the speech signal using components of the vocoder. In 1969, a scientific unit for Communication and Measurement was founded in Dresden. It is the main root of the present Institute of Acoustics and Speech Communication. W. Tscheschner was appointed Professor of Speech Communication and started with research in speech synthesis and recognition, which today continues.

Numerous objects from the history of Speech Communication in Dresden, but also from other parts of Germany, are preserved at the historic acoustic-phonetic collection of the TU Dresden. Until the opening of Interspeech 2015, we will present interesting exhibits from the collection in this newsletter monthly. Today, we give an introduction.

The historic acoustic-phonetic collection of the TU Dresden consists of three parts:

• Objects that illustrate the development of acoustics and speech technology at the TU Dresden. The most interesting devices are speech synthesizers of various technologies.

• Objects illustrating the development of experimental phonetics from 1900 until the introduction of the computer. The items of this part were collected by D. Mehnert from different phonetics laboratories and rehabilitation units throughout Germany.

• Objects which were formerly collected at the Phonetics Institute of Hamburg University. This important collection, which was founded by Giulio Panconcelli-Calzia, was transferred to Dresden in 2005 in accordance with a contract due to the closing of the Hamburg institute.

The collection is presented in the Barkhausenbau at the main campus of the TU Dresden. Recently, it is moving to new rooms which are more convenient for the presentation. The newly installed collection will be re-opened at the opportunity of Interspeech 2015.

For this purpose, we cordially invite to a workshop on the history of speech communication, called HSCR2015, which will be held as a satellite event of Interspeech 2015 at September 4/5, 2015, in the Technical Museum of the City of Dresden. It is organized by the special interest group (SIG) on 'The History of Speech Communication Sciences', which is supported by the International Speech Communication Association (ISCA) and the International Phonetic Association (IPA). More information on the workshop is presented on http://www.sig-hist.org/.

Rüdiger Hoffmann (Local Chair)


 





Back  Top

3-1-11INTERSPEECH 2015 Update (March 2015)

A View from Dresden onto the History of Speech Communication

Part 3: Voices for toys - First commercial spin-offs in speech synthesis

 

Complete article including figures available at:
http://interspeech2015.org/conference/historical-review/

 

When Wolfgang von Kempelen died in 1804, his automata (including the speaking machine) came in ownership of Johann Nepomuk Maelzel (1772 – 1838), who demonstrated them at many tours in Europe and America. He was a clever mechanic and applied Kempelen’s ideas in a mechanical voice for puppets, which could pronounce “Mama” and “Papa”. He received a patent on it in 1824 (Figure 1).

 

The idea of speaking puppets and toys was continued mainly in the area of Sonneberg in Thuringia, Germany. This small town was the world capital of manufacturing puppets and toys in the 19th century. The voices consist of a bellow, a metal tongue for voicing, and a resonator. There are three reasons why we appreciate the mechanical voices as a milestone in the development of speech technology:

 

1. The mechanical voices established the first commercial spin-off in speech research. The toy manufacturers in Sonneberg recognized the importance of Mälzel’s invention and produced speaking puppets from 1852. The “Stimmenmacher” (voices maker) was a specific profession, and we find eight manufacturers for human and animal voices alone in Sonneberg in 1911. The most important of them was Hugo Hölbe (1844 – 1931), who developed mechanisms which were able to speak not only Mama/Papa (Figure 2), but also words like Emma, Hurrah, etc.

 

2. The mechanical voices were applied in the first book with multimodal properties. The bookseller Theodor Brand from Sonneberg received a patent for his “speaking picture book” in 1878. This book shows different animals. Pulling a knob, which corresponds to a picture, activates the voice of the animal (Figure 3). The picture book was published in several languages and was a huge commercial success all over the world.

 

3. The mechanical voices are the first attempt to support the rehabilitation of hard hearing people by means of speech technology. The German otologist Johannes Kessel (1839 – 1907) demonstrated Hölbe’s voices as a training tool in speech therapy at a conference in 1899. The quality of this kind of synthetic speech proved to be not sufficient for this purpose, however.

 

The samples from Kessel came to the Phonetic Laboratory of Panconcelli-Calzia in Hamburg, who mentioned them in his historic essays. Due to the transfer of the phonetic exhibits from Hamburg to Dresden in 2005, you can visit the mechanical voices in the HAPS of the TU Dresden now.

 

 

Rüdiger Hoffmann

Photographs Copyright TU Dresden / HAPS

 

Back  Top

3-1-12INTERSPEECH 2015 Update (May 2015)

A View from Dresden onto the History of Speech Communication

 

Part 5: Artificial vocal fold models – The investigation of phonation

Complete article including figures available at:
http://interspeech2015.org/conference/historical-review/


The investigation of the larynx was (and is) one of the predominant topics in phonetic research. In the early times of experimental phonetics, mechanical models of the larynx or, at least, of the vocal folds have been utilized according to the paradigm of analysis-by-synthesis.

 

The first models used flat parallel elastic membranes or other simple elements to simulate the function of the vocal folds (Fig. 1). However, the geometry of these models was rather different from that of real human vocal folds. A substantial progress was made by Franz Wethlo (1877 – 1960), who worked at the Berlin university as an educationalist and special pedagogue. He realized that the vocal folds should not be modelled by flat parallel membranes, but that the three-dimensional shape of the vocal folds should be taken into account. Hence, he proposed a three-dimensional model, which was formed by two elastic cushions (Fig. 2). The cushions were filled with pressurized air, the pressure of which could be varied for experimental purposes. In particular, the air pressure in the cushion pipes was varied to adjust the tension of the vocal folds. The whole model was known as “Polsterpfeife” (cushion pipe). Wethlo described it in 1913.

 

The historical collection (HAPS) at the TU Dresden owns several cushion pipes from Wethlo in different sizes, modelling male, female, and children’s voices. A team from the TU Dresden repeated Wethlo’s experiments with his original equipment in 2004. Therefore, the cushion pipes were connected to a historical “vocal tract model”. This vocal tract model was actually a stack of wooden plates with holes of different diameters to model the varying cross-sectional area of the vocal tract between the glottis and the lips (Fig. 3). This “configurable” vocal tract model came to the HAPS collections from the Institute of Phonetics in Cologne. The artificial vocal folds were used to excite vocal tract configurations for the vowels /a/, /i/ and /u/, but listening experiments showed that these artificial vowels were rather difficult to discriminate.

 

Today, there is renewed interest in mechanical models of the vocal folds. Such models can be used in physical 3d robotic models of the speech apparatus (e. g., the Waseda talker series of talking robots: http://www.takanishi.mech.waseda.ac.jp/top/research/voice/), to evaluate the accuracy of low-dimensional digital vocal fold models (e. g., http://scitation.aip.org/content/asa/journal/jasa/121/1/10.1121/1.2384846) or to examine pathological voice production.

 

 

Rüdiger Hoffmann & Peter Birkholz

 

Back  Top



 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA