ISCApad #202 |
Monday, April 13, 2015 by Chris Wellekens |
3-1-1 | (2015-09-06) Call for Satellite Workshops of INTERSPEECH 2015, Dresden, Germany**** Call for Satellite Workshops ****
INTERSPEECH 2015 will be held in the beautiful city of Dresden, Germany, on September 6-10, 2015
The theme is 'Speech beyond Speech - Towards a Better Understanding of the Most Important
Biosignal'. The Organizing Committee of INTERSPEECH 2015 is now inviting proposals for
satellite workshops, which will be held in proximity to the main conference.
The Organizing Committee will work to facilitate the organization of such satellite workshops,
to stimulate discussion in research areas related to speech and language, at locations in Central
Europe, and around the same time as INTERSPEECH. We are particularly looking forward to
proposals from neighboring countries. If you are interested in organizing a satellite workshop,
or would like a planned event to be listed as an official satellite event, please contact the organizers
or the Satellite Workshop Chair at fmetze@cs.cmu.edu The Satellite Workshop coordinator along
with the INTERSPEECH team will help to connect (potential) workshop organizers with local
contacts in Germany, if needed, and will try to be helpful with logistics such as payment, publicity,
and coordination with ISCA or other events. Proposals should include:
* workshop name and acronym * organizers' name and contact info
* website (if already known)
* date and proposed location of the workshop
* estimated number of participants
* a short description of the motivation for the workshop
* an outline of the program and invited speakers
* a description of the submission process (e.g. deadlines, target acceptance rate)
* a list of the scientific committee members
Proposals for satellite workshops should be submitted by email to workshops@interspeech2015.org
by August 31st, 2014 We strongly recommend that organizers also apply for
ISCA approval/ sponsorship, which will greatly facilitate acceptance as an INTERSPEECH satellite
event. We plan to notify proposers no later than October 30, 2014. If you have any questions about
whether a potential event would be a good candidate for an INTERSPEECH 2015 satellite workshop
feel free to contact the INTERSPEECH 2015 Satellite Workshops Chair.
Sincerely,
Florian Metze
Satellite Workshops Chair fmetze@cs.cmu.edu
| ||||||||||||||
3-1-2 | (2015-09-06) 3rd and Final Call for INTERSPEECH 2015, Sep 6-10, Dresden, Germany. 3rd and Final Call for INTERSPEECH 2015, Sep 6-10, Dresden, Germany.
INTERSPEECH is the world’s largest and most comprehensive conference on the science and technology of spoken language processing. INTERSPEECH conferences emphasize interdisciplinary approaches addressing all aspects of speech science and technology, ranging from basic theories to applications.
INTERSPEECH 2015 in Dresden (Germany) will be organized around the theme Speech beyond Speech: Towards a Better Understanding of the Most Important Biosignal, which acknowledges the fact that speech is the most important biosignal humans can produce and perceive. It is evident that not all characteristics of speech are already fully understood. We therefore encourage contributions that analyze and model speech as a biosignal in a broad understanding, e.g. for extracting information about the speaker, for identifying processes leading to speech production, or for generating speech signals with specific bio-characteristics. Contributions to all other areas of speech science and technology are also welcome.
Important Dates --------------------- 20 March 2015 Paper: submission deadline 20 March 2015 Tutorial: submission deadline 17 April 2015 Show and Tell: submission deadline
10 June 2015 Paper: camera-ready 10 June 2015 Show and Tell: camera-ready 20 June 2015 Early registration deadline
6-10 Sep 2015 Conference in Dresden, Germany
INTERSPEECH 2015 hosts a wide range of Events, e.g. Special Sessions and Workshops -------------------------------------------------------------------------------------------------------------------
10 Special Sessions - Active Perception in Human and Machine Speech Communication - Biosignal-based Spoken Communication - Interspeech 2015 Computational Paralinguistics Challenge (ComParE): Degree of Nativeness, Parkinson’s & Eating Condition - Automatic Speaker Verification Spoofing and Countermeasures - Zero Resource Speech Technologies: Unsupervised Discovery of Linguistic Units - Robust Speech Processing using Observation Uncertainty and Uncertainty Propagation - Speech Science in End User Applications - Synergies of Speech and Multimedia Technologies - Speech and Language Processing of Children’s Speech - Advanced Crowdsourcing for Speech and Beyond
10 Satellite Workshops - Errors by Humans and Machines in multimedia, multimodal and multilingual data processing (ERRARE) - Speech and Language Processing for Assistive Technologies (SLPAT) - Workshop on Speech and Language Technology for Education (SLaTE) - International Workshop on the History of Speech Communication Research (HSCR) - Workshop on Speech and Audio Technologies for the Digital Humanities (SAT4DH) - Blizzard Challenge Workshop - Special Interest Group on Discourse and Dialogue (SIGDIAL) - International Workshop on Speech Robotics (IWSR) - The 1st Joint Conference on Facial Analysis, Animation and Audio-Visual Speech Processing (FAAVSP) - MediaEval Benchmarking Initiative for Multimedia Evaluation (MediaEval)
3 Related Events - Speech Technology and Human-Computer Dialogue (SpeD) - The 1st Joint Conference on Facial Analysis, Animation and Audio-Visual Speech Processing (FAAVSP) - International Conference on Text, Speech and Dialogue (TSD)
Visit www.interspeech2015.org
******************************************************** Dr. Tim Polzehl Quality and Usability Lab, Telekom Innovation Laboratories, SoftwareCampus Technische Universität Berlin E-mail: tim.polzehl@telekom.de
### visit INTERSPEECH 2015 in Dresden, Germany - http://www.interspeech2015.org ###
DEUTSCHE TELEKOM AG Aufsichtsrat: Prof. Dr. Ulrich Lehner (Vorsitzender) Vorstand: Timotheus Höttges (Vorsitzender), Reinhard Clemens, Niek Jan van Damme, Thomas Dannenfeldt, Dr. Thomas Kremer, Claudia Nemat, Prof. Dr. Marion Schick Handelsregister: Amtsgericht Bonn HRB 6794 Sitz der Gesellschaft Bonn
| ||||||||||||||
3-1-3 | (2015-09-06) Calls for INTERSPEECH 2015, Dresden, Germany Calls for INTERSPEECH 2015
INTERSPEECH is the world’s largest and most comprehensive conference on the science and technology of spoken language processing. INTERSPEECH conferences emphasize interdisciplinary approaches addressing all aspects of speech science and technology, ranging from basic theories to applications.
INTERSPEECH 2015 will be organized around the theme Speech beyond Speech: Towards a Better Understanding of the Most Important Biosignal, which acknowledges the fact that speech is the most important biosignal humans can produce and perceive. It is evident that not all characteristics of speech are already fully understood. We therefore encourage contributions that analyze and model speech as a biosignal in a broad understanding, e.g. for extracting information about the speaker, for identifying processes leading to speech production, or for generating speech signals with specific bio-characteristics. Contributions to all other areas of speech science and technology are also welcome.
Important Dates 30 Nov 2014 special sessions: paper submission deadline
Call for Papers The Organizing Committee of INTERSPEECH 2015 is now inviting for paper submissions. Please refer to our CfP page for more details. 15.09.2014 – Details on paper submission and topics can be found on our CfP page.
Call for Special Sessions Submissions of Special Sessions & Challenges proposals are encouraged for INTERSPEECH 2015, covering interdisciplinary topics and/or important new emerging areas of interest related to the main conference topics. 16.10.2014 – For more information please refer to our SS-CfP page.
Call for Show and Tell The Organizing Committee of INTERSPEECH 2015 is now inviting for contribution to Show and Tell special event. Please refer to our Show and Tell page for more details. 17.04.2015 – Full submission deadline, proposals should be submitted by email tois2015showntell@gmx.de.
Call for Tutorials The Organizing Committee of INTERSPEECH 2015 is now inviting proposals for tutorials, which will be held on September 6 (first day of conference) in the main venue Maritim International Congress Center (ICD) location. Please refer to here for more details. 15.10.2014 – Proposals for tutorials should be submitted by email to Alexander Raake.
Call for Satellite Workshops The Organizing Committee of INTERSPEECH 2015 is now inviting proposals for satellite workshops, which will be held in proximity to the main conference. Please refer to here for more details or manually navigate to http://www.signalprocessingsociety.org/technical-committees/list/sl-tc/spl-nl/2014-05/2014-05-is2015/ Proposals for satellite workshops should be submitted by email to Florian Metze.
| ||||||||||||||
3-1-4 | (2015-09-06) Interspeech 2015 Call for tutorials General conference information – INTERSPEECH 2015 INTERSPEECH is the world's largest and most comprehensive conference on the science and technology of spoken language processing. INTERSPEECH conferences emphasize interdisciplinary approaches addressing all aspects of speech science and technology, ranging from basic theories to applications. INTERSPEECH 2015 will be organized around the theme Speech beyond Speech: Towards a Better Understanding of the Most Important Biosignal, which acknowledges the fact that speech is the most important biosignal humans can produce and perceive. It is evident that not all characteristics of speech are already fully understood. We therefore encourage contributions that analyze and model speech as a biosignal in a broad understanding, e.g. for extracting information about the speaker, for identifying processes leading to speech production, or for generating speech signals with specific bio-characteristics. Contributions to all other areas of speech science and technology are also welcome.
Call for tutorials INTERSPEECH conferences are attended by researchers with a long-term track-record in speech sciences and technology, as well as by early-stage researchers or researchers interested in a new domain within the INTERSPEECH areas. With the current edition of INTERSPEECH, we seek to specifically reflect different audiences in the choice of tutorials, where it will clearly be indicated what prior knowledge the audience should have. As a consequence, we encourage proposals for tutorials addressing introductory topics or advanced topics in an introductory style, and tutorials targeting experienced researchers who want to dig deeper into a given new topic. Tutorials may introduce an uprising area of speech-related research, or present an overview of an important more established area. The tutorials can be of either three hours or six hours duration. The tutorials are expected to cover the state-of-the-art of the proposed topics rather than to primarily present the authors own research. INTERSPEECH 2015 will be organized around the theme Speech beyond Speech: Towards a Better Understanding of the Most Important Biosignal, which acknowledges the fact that speech is the most important biosignal humans can produce and perceive. As a consequence, we especially encourage tutorials from fields that are directly related with this theme. Of course the call is not restricted to the conference theme, and open to any kind of tutorial related with speech science and technology. Tutorial Proposal Submission Proposals for the INTERSPEECH 2015 tutorials should be up to 4 pages long. Proposal submissions must conform to the format defined in the tutorial proposal preparation guidelines and provided in the authors’ kit on the INTERSPEECH 2015 website, along with the Call for Tutorials. Tutorial proposal submission deadline is on Friday 20 March 2015 midnight (11:59pm) GMT. There will be no extension of deadlines. The working language of the conference is English, and proposals must be written in English, too.
All proposals should be submitted to tutorials@interspeech2015.org Preliminary Tutorial Proposal Template
Important Dates
We look forward to welcoming you to INTERSPEECH 2015 in Dresden! Alexander Raake
Assessment of IP-based Applications TU Berlin & Deutsche Telekom/T-Labs (Research & Innovation) +49391580239574 (Fax) E-Mail: alexander.raake@telekom.de
| ||||||||||||||
3-1-5 | (2015-09-06)) CfP INTERSPEECH 2015 Special Session on INTERSPEECH 2015 Special Session on INTERSPEECH 2015
| ||||||||||||||
3-1-6 | (2016) INTERSPEECH 2016, San Francisco, CA, USA Interspeech 2016 will take place from September 8-12 2016 in San Francisco, CA, USA General Chair is Nelson Morgan. You may from now on be tempted by the nice pictures of the cover page of its tentative website http://www.interspeech2016.org
| ||||||||||||||
3-1-7 | INTERSPEECH 2015 Update (April 2015)
A View from Dresden onto the History of Speech Communication Part 4: Helmholtz Resonators Complete article including figures available at: http://interspeech2015.org/conference/historical-review/ Hermann von Helmholtz (1821 –1894) was a German physician and physicist who made important contributions in many areas of science. One of these areas was acoustics, where he published the famous book 'On the sensations of tone as a physiological basis for the theory of music' in 1863. There he described his invention of a special type of resonator, which is now known as Helmholtz resonator. These resonators were devised as highly sensitive devices to identify the harmonic components (partial tones) of sounds and allowed significant advances in the acoustic analysis of vowel sounds and musical instruments. Before the invention of Helmholtz resonators, strong partial tones in a sound wave were typically identified by very thin, elastic membranes that were spanned on circular rings similar to drums. Such a membrane has a certain resonance frequency (in fact multiple frequencies) that depends on its material, tension, and radius. If the sound field around the membrane contains energy at this frequency, the membrane is excited and starts to oscillate. The tiny amplitudes of this oscillation can be visually detected when fine grained sand is distributed over its surface. When the membrane is excited with its lowest resonance frequency, the sand accumulates at the rim of the membrane or along specific lines on its surface, when higher order modes are excited. With a set of membranes tuned to different frequencies, a rough spectral analysis can be conducted. It was also known that the sensitivity of this method could be improved when the membrane was spanned over the (removed) bottom of a bottle with an open neck end. The key idea of Helmholtz was to replace this bottle by a hollow sphere with an open neck at one 'end' and another small spiky opening at the opposite 'end'. The spiky opening had to be inserted into one ear canal. In this way, the eardrum was excited similarly to the membrane with the sand of the previous technique. However, due to the high sensitivity of the ear, partial tones could be detected much more easily. A further advantage of these resonators was that their resonance frequencies can be expressed analytically in terms of the volume of the sphere and the diameter and the length of the neck. Hence these resonators became important experimental tools for the subjective sound analysis in the late 19th century and the early 20th century. The HAPS at the TU Dresden contains three sets of Helmholtz resonators. The biggest of these sets contains 11 resonators, which are tuned to frequencies between 128 Hz and 768 Hz. The HAPS also contains a related kind of resonators that were invented by Schaefer (1902). These resonators are tubes with one open end and one closed end. The closed end also has a small spiky opening that has to be inserted into the ear canal. These resonators maximally respond to frequencies of which the wavelength is four times the length of the tube. Helmholtz used his resonators not only for sound analysis, but also for the synthesis of vowels. Therefore, he first had to analyze the resonances of the vocal tract for different vowels. He did this by means of a set of tuning forks, which he placed and excited directly in front of his open mouth when he silently articulated the different vowels. When the frequency of a tuning fork was close to a resonance of the vocal tract, the resulting sound became much louder than for the other frequencies. For each of the vowels /u/, /o/, and /a/, he was only able to detect a single resonance of the vocal tract at the frequencies 175 Hz (note f), 494 Hz (note b’) and 988 Hz (note b’’), respectively. For each of the other investigated German vowels, he even detected two resonances. The single resonances detected for /u/, /o/ and /a/ probably correspond to the clusters of the nearby first and second resonances of the corresponding vowels. Obviously, his method of analysis was not sensitive enough to separate the two individual resonances of each of the vowels. To synthesize the vowels /u/, /o/, and /a/ with a single resonance, he simply connected a reed pipe to Helmholtz resonators tuned to the corresponding frequencies. For the vowels with two resonances, he selected a Helmholtz resonator for one of the resonances and attached a 6-10 cm long glass tube to the outer opening of the resonator to create the second resonance. These experiments showed that Helmholtz had surprising insight in the source-filter principle of speech production, which was fully elaborated by Gunnar Fant and others 100 years later. Peter Birkholz
| ||||||||||||||
3-1-8 | INTERSPEECH 2015 Update (December 2014) Updates from INTERSPEECH 2015 Dear colleague, Interspeech 2015 in Dresden is approaching at an increasing pace, and the entire team of organizers is trying to ensure that you will get a conference which meets all, and hopefully surpasses some, of your expectations. Regarding the usual program of oral and poster sessions, special sessions and challenges, keynotes, tutorials and satellite workshops, the responsible team is working hard to ensure that you will get a program which is not only of respectable breadth and depth, but which also tackles a couple of innovative topics, some of them centered around the special topic of the conference “Speech beyond speech: Towards a better understanding of our most important biosignal”, some of them also addressing other emergent topics. We would particularly like to draw your attention to the approaching deadlines: In addition to regular papers, we will also experiment with a virtual attendance format for persons who are – mainly for visa or health reasons – not able to come to Dresden to present their paper. For these persons, a limited number of technology-equipped poster boards will be available where online presentations can be held. The number of virtual attendance slots is strictly limited (thus potentially leading to a lower acceptance rate). The corresponding papers have to pass the normal review process, but the deadline will most probably be around 14 days before the normal paper submission deadline. More details on this format will be announced soon. In the upcoming months, we will keep you updated via this thread, and we will present some historical instruments and techniques related to speech technology which nicely illustrate that Dresden has a rich history in speech science and technology. Interspeech 2015 will hopefully contribute to this history with the latest scientific and technological advances. The entire organizing team is looking forward to welcoming you in Dresden. On behalf of the organizing team,
| ||||||||||||||
3-1-9 | INTERSPEECH 2015 Update (February 2015) +++ INTERSPEECH 2015 – February Update +++ Dear colleagues, The preparations for Interspeech 2015 in Dresden are running at an increasing pace, and I got the impression that we have a very active contribution from the community this year. Bernd Möbius and Elmar Nöth, our TCP Chairs, have set up a comprehensive and balanced group of Area Chairs for the new areas we have agreed upon with ISCA, and which will soon be published on our website. The ten preliminarily accepted Special Sessions and Challenges are active in collecting contributions; as an example, the session “Advanced Crowdsourcing for Speech and Beyond” has received 17 requests for research funds, which will now be evaluated according to their fit to the special session topic. And our sponsorship, industry and exhibition chairs, Tim Fingscheidt, Claudia Pohlink, Jimmy Kunzmann and Reinhold Häb-Umbach, are actively soliciting sponsoring money to make the event most affordable for you. The 2nd Call for Papers is out (deadline March 20): In addition there is a special Call for Papers with Virtual Presentation which solicits contributions to this special format we will experiment with at this year’s Interspeech for the first time, and which will be limited to exceptional cases which otherwise would not be able to participate: In addition, there is still the option to submit proposals for Tutorials (deadline March 20) and Show and Tell contributions (deadline April 17): All further information can be found on our Website which Tim Polzehl is eager to keep updated. For automatically receiving continuous updates, we recommend that you follow us on Twitter (@interspeech2015), or that you use social channels such as LinkedIn or Facebook. And: Please do not delete your Interspeech 2014 App, it will automatically receive an update for Interspeech 2015. Finally, Dresden is also polishing her historical charm, and for Interspeech attendants the most important aspect of this might be the second contribution to our historical series, which this time is dedicated to the world’s first successful attempt of a mechanical speech synthesiser. On behalf of the organizing team, A View from Dresden onto the History of Speech Communication Complete article including figures available at: http://interspeech2015.org/conference/historical-review/
The speaking machine of Wolfgang von Kempelen (1734-1804) can be considered as the first successful attempt of a mechanical speech synthesiser. The Austrian-Hungarian engineer is still famous for his 'chess turk' but it was his 'Sprachmaschine' that can count as a milestone in (speech) technology. In his book 'Mechanismus der menschlichen Sprache nebst der Beschreibung einer sprechenden Maschine' (published 1791, no English translation yet) he described the function of the machine which was intended to give a voice for deaf people. Contemporary personalities like Goethe confirmed the authenticity of a child voice when the speaking machine was played.
How does the machine work? The machine consists of bellows that is connected with a tube to a wooden wind chest. On the other side of the wind chest a round wooden block represents the interface to an open rubber funnel (as the vocal tract). In the wind chest there are two modified recorders to produce the fricatives [s] and [S]. The voice generator is located inside the wooden block. The artificial voice is generated with the help of a reed pipe borrowed by the pipe organ. It has an ivory reed vibrating against a wooden hollow shallot (like in a clarinet). The trained human operator plays the machine like a musical instrument. The right elbows control the air pressure by pressing on the bellows, two fingers of the right hand close or open the access for stops and nasals, two other fingers of the right hand for the fricatives. Vowels are performed by the palm of left hand in different ways.
Replicas Apart from parts of one of the originals that are hosted at the Deutsches Museum in Munich there are several reconstructions based on Kempelen's quite detailed descriptions. The replicas built in Budapest, Vienna, York and Saarbrücken allow a lively demonstration of the mechanical generation of speech as well its acoustic analysis but also perception tests with today's listeners. Interestingly, the art of constructing artificial voices led to the profession of 'voice makers' in Eastern-German Thuringia (more information in one of the next newsletters). Original products of the Thuringian 'Stimmenmacher' as well as one of the replicas located at TU Dresden are at display of the HAPS (Historische Akustisch-Phonetische Sammlung) available for ears, eyes (and hands) at the re-opening of HAPS at 4 Sept, which is also the start of the Interspeech satellite Workshop on The History of Speech Communication Research (HSCR 2015).
Jürgen Trouvain and Fabian Brackhane
| ||||||||||||||
3-1-10 | INTERSPEECH 2015 Update (January 2015)
+++ INTERSPEECH 2015 Update – and a look back! +++ Dear colleagues, The regular paper deadline for Interspeech 2015 in Dresden is only 2 months away, so we hope that you are preparing for your submissions. We have received an impressive number of Special Session and Challenges proposals. The list of preliminarily accepted proposals, together with more information on each session and its organizers, can be found under http://interspeech2015.org/events/special-sessions/. Thus, in case that your interests fall within the area of one of these Special Sessions or Challenges, consider submitting there. Please note that March 20 is – apart from the general paper deadline – also the deadline for tutorial proposals. More details on tutorial proposal submissions can be found under http://interspeech2015.org/calls/call-for-tutorials/. The deadline for Show & Tell papers is then April 17. The current list of Satellite Workshops will be updated successively and can be found under http://interspeech2015.org/events/workshops/. From now on, we will have a monthly view back to the history of speech communication and technology which happened in Dresden. On behalf of the organizing team, Sebastian Möller (General Chair) A View from Dresden onto the History of Speech Communication Part 1: The historic acoustic-phonetic collection Information Technology at the TU Dresden goes back to Heinrich Barkhausen (1881–1956), the 'father of the electron valve', who taught from 1911 to 1953. Speech research in a narrower sense started with the development of a vocoder in the 1950s. Walter Tscheschner (1927–2004) performed his extensive investigations on the speech signal using components of the vocoder. In 1969, a scientific unit for Communication and Measurement was founded in Dresden. It is the main root of the present Institute of Acoustics and Speech Communication. W. Tscheschner was appointed Professor of Speech Communication and started with research in speech synthesis and recognition, which today continues. Numerous objects from the history of Speech Communication in Dresden, but also from other parts of Germany, are preserved at the historic acoustic-phonetic collection of the TU Dresden. Until the opening of Interspeech 2015, we will present interesting exhibits from the collection in this newsletter monthly. Today, we give an introduction. The historic acoustic-phonetic collection of the TU Dresden consists of three parts: • Objects that illustrate the development of acoustics and speech technology at the TU Dresden. The most interesting devices are speech synthesizers of various technologies. • Objects illustrating the development of experimental phonetics from 1900 until the introduction of the computer. The items of this part were collected by D. Mehnert from different phonetics laboratories and rehabilitation units throughout Germany. • Objects which were formerly collected at the Phonetics Institute of Hamburg University. This important collection, which was founded by Giulio Panconcelli-Calzia, was transferred to Dresden in 2005 in accordance with a contract due to the closing of the Hamburg institute. The collection is presented in the Barkhausenbau at the main campus of the TU Dresden. Recently, it is moving to new rooms which are more convenient for the presentation. The newly installed collection will be re-opened at the opportunity of Interspeech 2015. For this purpose, we cordially invite to a workshop on the history of speech communication, called HSCR2015, which will be held as a satellite event of Interspeech 2015 at September 4/5, 2015, in the Technical Museum of the City of Dresden. It is organized by the special interest group (SIG) on 'The History of Speech Communication Sciences', which is supported by the International Speech Communication Association (ISCA) and the International Phonetic Association (IPA). More information on the workshop is presented on http://www.sig-hist.org/. Rüdiger Hoffmann (Local Chair)
| ||||||||||||||
3-1-11 | INTERSPEECH 2015 Update (March 2015) A View from Dresden onto the History of Speech Communication Part 3: Voices for toys - First commercial spin-offs in speech synthesis
Complete article including figures available at:
When Wolfgang von Kempelen died in 1804, his automata (including the speaking machine) came in ownership of Johann Nepomuk Maelzel (1772 – 1838), who demonstrated them at many tours in Europe and America. He was a clever mechanic and applied Kempelen’s ideas in a mechanical voice for puppets, which could pronounce “Mama” and “Papa”. He received a patent on it in 1824 (Figure 1).
The idea of speaking puppets and toys was continued mainly in the area of Sonneberg in Thuringia, Germany. This small town was the world capital of manufacturing puppets and toys in the 19th century. The voices consist of a bellow, a metal tongue for voicing, and a resonator. There are three reasons why we appreciate the mechanical voices as a milestone in the development of speech technology:
1. The mechanical voices established the first commercial spin-off in speech research. The toy manufacturers in Sonneberg recognized the importance of Mälzel’s invention and produced speaking puppets from 1852. The “Stimmenmacher” (voices maker) was a specific profession, and we find eight manufacturers for human and animal voices alone in Sonneberg in 1911. The most important of them was Hugo Hölbe (1844 – 1931), who developed mechanisms which were able to speak not only Mama/Papa (Figure 2), but also words like Emma, Hurrah, etc.
2. The mechanical voices were applied in the first book with multimodal properties. The bookseller Theodor Brand from Sonneberg received a patent for his “speaking picture book” in 1878. This book shows different animals. Pulling a knob, which corresponds to a picture, activates the voice of the animal (Figure 3). The picture book was published in several languages and was a huge commercial success all over the world.
3. The mechanical voices are the first attempt to support the rehabilitation of hard hearing people by means of speech technology. The German otologist Johannes Kessel (1839 – 1907) demonstrated Hölbe’s voices as a training tool in speech therapy at a conference in 1899. The quality of this kind of synthetic speech proved to be not sufficient for this purpose, however.
The samples from Kessel came to the Phonetic Laboratory of Panconcelli-Calzia in Hamburg, who mentioned them in his historic essays. Due to the transfer of the phonetic exhibits from Hamburg to Dresden in 2005, you can visit the mechanical voices in the HAPS of the TU Dresden now.
Rüdiger Hoffmann Photographs Copyright TU Dresden / HAPS
|