ISCApad #223 |
Saturday, January 14, 2017 by Chris Wellekens |
4-1 | Carnegie Speech
Carnegie Speech produces systems to teach people how to speak another language understandably. Some of its products include NativeAccent, SpeakIraqi, SpeakRussian, and ClimbLevel4. You can find out more at www.carnegiespeech.com. You can also read about Forbes.com awarding it a Best Breakout Idea of 2009 at: http://www.forbes.com/2009/12/21/best-breakout-ideas-2009-entrepreneurs-technology-breakout_slide_11.html
| ||
4-2 | Research in Interactive Virtual Experiences at USC CA USA REU Site: Research in Interactive Virtual Experiences --------------------------------------------------------------------
The Institute for Creative Technologies (ICT) offers a 10-week summer research program for undergraduates in interactive virtual experiences. A multidisciplinary research institute affiliated with the University of Southern California, the ICT was established in 1999 to combine leading academic researchers in computing with the creative talents of Hollywood and the video game industry. Having grown to encompass a total of 170 faculty, staff, and students in a diverse array of fields, the ICT represents a unique interdisciplinary community brought together with a core unifying mission: advancing the state-of-the-art for creating virtual reality experiences so compelling that people will react as if they were real.
Reflecting the interdisciplinary nature of ICT research, we welcome applications from students in computer science, as well as many other fields, such as psychology, art/animation, interactive media, linguistics, and communications. Undergraduates will join a team of students, research staff, and faculty in one of several labs focusing on different aspects of interactive virtual experiences. In addition to participating in seminars and social events, students will also prepare a final written report and present their projects to the rest of the institute at the end of summer research fair.
Students will receive $5000 over ten weeks, plus an additional $2800 stipend for housing and living expenses. Non-local students can also be reimbursed for travel up to $600. The ICT is located in West Los Angeles, just north of LAX and only 10 minutes from the beach.
This Research Experiences for Undergraduates (REU) site is supported by a grant from the National Science Foundation. The site is expected to begin summer 2013, pending final award issuance.
Students can apply online at: http://ict.usc.edu/reu/ Application deadline: March 31, 2013
For more information, please contact Evan Suma at reu@ict.usc.edu.
| ||
4-3 | Announcing the Master of Science in Intelligent Information Systems Carnegie Mellon University
degree designed for students who want to rapidly master advanced content-analysis, mining, and intelligent information technologies prior to beginning or resuming leadership careers in industry and government. Just over half of the curriculum consists of graduate courses. The remainder provides direct, hands-on, project-oriented experience working closely with CMU faculty to build systems and solve problems using state-of-the-art algorithms, techniques, tools, and datasets. A typical MIIS student completes the program in one year (12 months) of full-time study at the Pittsburgh campus. Part-time and distance education options are available to students employed at affiliated companies. The application deadline for the Fall 2013 term is December 14, 2012. For more information about the program, please visit http://www.lti.cs.cmu.edu/education/msiis/overview.shtml
| ||
4-4 | Master in linguistics (Aix-Marseille) France Master's in Linguistics (Aix-Marseille Université): Linguistic Theories, Field Linguistics and Experimentation TheLiTEx offers advanced training in Linguistics. This specialty focuses Linguistics is aimed at presenting in an original way the links between corpus linguistics and scientific experimentation on the one hand and laboratory and field methodologies on the other. On the basis of a common set of courses (offered within the first year), TheLiTEx offers two paths: Experimental Linguistics (LEx) and Language Contact & Typology (LCT) The goal of LEx is the study of language, speech and discourse on the basis of scientific experimentation, quantitative modeling of linguistic phenomena and behavior. It focuses on a multidisciplinary approach which borrows its methodologies to human physical and biological sciences and its tools to computer science, clinical approaches, engineering etc.. Among the courses offered: semantics, phonetics / phonology, morphology, syntax or pragmatics, prosody and intonation, and the interfaces between these linguistic levels, in their interactions with the real world and the individual, in a biological, cognitive and social perspective. Within the second year, a set of more specialized courses is offered such as Language and the Brain and Laboratory Phonology. LCT aims at understanding the world's linguistic diversity, focusing on language contact, language change and variation (European, Asian and African languages, Creoles, sign language, etc.).. This specialty focuses, from a a linguistic and sociolinguistic perspective, on issues of field linguistics and taking into account both the human and socio-cultural dimension of language (speakers, communities). It also focuses on documenting rare and endangered languages and to engage a reflection on linguistic minorities. This path also provides expertise and intervention models (language policy and planning) in order to train students in the management of contact phenomena and their impact on the speakers, languages and societies More info at: http://thelitex.hypotheses.org/678
| ||
4-5 | NEW MASTER IN BRAIN AND COGNITION AT UNIVERSITAT POMPEU FABRA, BARCELONA NEW MASTER IN BRAIN AND COGNITION AT UNIVERSITAT POMPEU FABRA, BARCELONA
| ||
4-6 | Masters à la Sorbonne (Paris) Les masters d'Ingénierie de la langue de Paris-Sorbonne, ILGII (R) et IILGI (P), sont maintenant regroupés dans une seule spécialité de la mention Littérature, Philosophie, Linguistique.
| ||
4-7 | New Masters in Machine Learning, Speech and Language Processing at Cambridge University, UK New Masters in Machine Learning, Speech and Language Processing
This is a new twelve-month full-time MPhil programme offered by the Computational and Biological Learning Group (CBL) and the Speech Group in the Cambridge University Department of Engineering, with a unique, joint emphasis on both machine learning and on speech and language technology. The course aims: to teach the state of the art in machine learning, speech and language processing; to give students the skills and expertise necessary to take leading roles in industry; to equip students with the research skills necessary for doctoral study.
UK and EU students applications should be completed by 9 January 2015 for admission in October 2015. A limited number of studentships may be available for exceptional UK and eligible EU applicants.
Self-funding students who do not wish to be considered for support from the Cambridge Trusts have until 30 June 2015 to submit their complete applications.
More information about the course can be found here: http://www.mlsalt.eng.cam.ac.uk/
| ||
4-8 | The International Standard Language Resource Number (ISLRN) The Resource Management Agency (RMA), an important language resource player in South Africa, adopts the International Standard Language Resource Number (ISLRN) initiative
The RMA is now a certified provider to the ISLRN system. This means that the RMA can apply for ISLRNs on behalf of the developers of the data that is managed and distributed via the RMA website. The RMA has already submitted 117 language resources to the ISLRN, including language resources for the 11 official languages of South Africa. These include text and speech resources such as text corpora (annotated, genre classification, parallel), translation memories, custom dictionaries for government domain, compound semantic and splitting datasets, frequency word lists, speech corpora, and pronunciation dictionaries. The meta-information for these language resources is also available on the ISLRN website with a broad international audience. Background As part of an international effort to document and archive the various language resource development efforts around the world, a system of assigning ISLRNs was established in November 2013. The ISLRN is a unique ?persistent identifier? to be assigned to each language resource. The establishment of ISLRNs was a major step in the networked and shared world of human language technologies. Unique resources must be identified as they are, and meta-catalogues require a common identification format to manage data correctly. Therefore, language resources should carry identical identification schemes independent of their representations, whatever their types and wherever their physical locations (on hard drives, internet or intranet) (http://islrn.org/).
About RMA: The Department of Arts and Culture of South Africa established the RMA to manage and distribute reusable text and speech resources developed by the National Centre for Human Language Technology from a centralised location. As many of the South African languages are deemed resource-scarce, the RMA aspires to make data resources for these languages more readily available. To find out more about RMA, please visit the RMA website: http://rma.nwu.ac.za.
About ELRA: The European Language Resources Association (ELRA) is a non-profit-making organisation founded by the European Commission in 1995, with the mission of providing a clearing house for language resources and promoting human language technologies. To find out more about ELRA, please visit the website: http://www.elra.info
| ||
4-9 | SProSIG Bids for Speech prosody 2018 SProSIG
The purpose of the Speech Prosody Special Interest Group (SProSIG) is to promote interest in Speech Prosody; to provide a means of exchanging news of recent research developments and other matters of interest in Speech Prosody; to sponsor meetings and workshops in Speech Prosody that appear to be timely and worthwhile; and to provide and make available resources relevant to Speech Prosody. SProSIG is a special interest group of ISCA, and of IPA. Our web page is http://sprosig.org.
Membership in SProSIG is obtained by signing up for the mailing list. The mailing list is currently housed at https://lists.illinois.edu/lists/info/sprosig.
All members of SProSIG are allowed to vote on the location of the Speech Prosody conference. Bids for Speech Prosody 2018 will be presented orally at Speech Prosody 2016, and in written form during June 2016.
SProSIG is administered by officers under the direction of a Permanent Advisory Committee (PAC). Officers are nominated biennially in August, and elected in September. Current officers are Keikichi Hirose, Mark Hasegawa-Johnson, Hansjörg Mixdorff and Yi Xu.
The founding officers of SProSIG specified services to members including dedicated web pages, an e-mail newsletter, a bibliographic database, workshops and special sessions, and the organization of the international conference Speech Prosody. The web page has been little updated since 2012, and the newsletter has been dormant far longer; it is our intention to revise both. Suggestions about content and frequency are welcome, especially if delivered in a friendly tone of voice to any current officer or PAC member at Speech Prosody 2016. Call for Bids for the hosting of SP9: Speech Prosody 2018
Members of SProSIG with a history of attendance at Speech Prosody conferences are encouraged to submit bids to host SP9: Speech Prosody 2018. Written bids must be submitted by July 15, 2016 to the SProSIG Secretary, Mark Hasegawa-Johnson, at jhasegaw@illinois.edu. All written bids received by that date will be posted at http://sprosig.org. The full membership of SProSIG will then be invited to read the written bids, and an on-line vote will be held to determine the location of SP9. A written bid may contain any information that you believe is likely to sway the members of SProSIG, but must contain at least the following information:
City and Country in which the conference will be held:
General Chair (Name, Affiliation, and a list of Speech Prosody conferences that he or she has attended):
Organizing Committee Members (Same information as above):
Proposed conference period: DD/MM/YYYY – DD/MM/YYYY
Expected early registration fee for ISCA members:
Contractor (University, Company, and/or Contractor organizing the conference; this can be changed later if necessary):
Venue (name of hotel, conference center, university etc. Can be changed later if it necessary):
Access to the venue from the closest major airport (Is it easy for participants to reach the venue?):
Accommodation (a rough idea on number of near-by hotels and their prices. If organizers plan to offer university dormitories for participants, please mention with some information.):
Scientific Theme of Speech Prosody 2018 (if any):
Other points to be emphasized (if any):
| ||
4-10 | To ISCA Members interested in Code-Switching To ISCA Members interested in Code-Switching
We are asking for input on researchers’ interest and engagement in computational approaches to linguistic Code-Switching in any language pair, modality, or genre. A brief survey can be found at the link below. Thanks much for your help!
https://docs.google.com/forms/u/0/d/1ARm04N_si_7VaMPjtbWOFUUjxJZm7TQmjgNSHcUNPcw
Mona Diab Julia Hirschberg Thamar Solorio
| ||
4-11 | Participating to the Singing Synthesis Challenge Your participation to the Singing Synthesis Challenge listening test would be much appreciated. Feel free to disseminate !
Many thanks for participation,
| ||
4-12 | New funding opportunity at iARPA Dear Speech Scientist:
IARPA would like to announce a new funding opportunity involving speech recognition, information retrieval, summarization, domain adaptation and machine translation of low resource languages -- the forthcoming MATERIAL Program.
A Proposers' Day for MATERIAL will occur in the DC area on Sept. 27, 2016. A formal solicitation for proposals is expected to follow the Proposers' Day. Please note that registration for this event closes on Sept. 20. To register for this event, please visit: BRIEF PROGRAM DESCRIPTION AND GOALS
Current methods to produce similar technologies require a substantial investment in training data and/or language specific development and expertise, entailing many months or years of development. A goal of this program is to drastically decrease the time and data needed to field systems capable of fulfilling an English-in, English out task. Limited machine translation and automatic speech recognition training data will be provided from multiple low resource languages to enable performers to learn how to quickly adapt their methods to a wide variety of materials in various genres and domains. As the program progresses, performers will apply and adapt these methods in increasingly shortened time frames to new languages. Program data will include formal and informal genres of text and speech which will not be fully captured by the training data. Image and video are out of scope for this program. Performers will be evaluated, relative to a baseline system, on their ability to accurately retrieve text and speech materials relevant to an English domain-specific query from a database of multi-domain, multi-genre documents in a low resource language, and their ability to convey the relevance of those documents through summaries presented to English speaking domain experts. To develop such an end-to-end system, large multi-disciplinary teams will be required with expertise in a number of relevant technical areas including, but not limited to, natural language processing, low resource languages, machine translation, corpora analysis, domain adaptation, computational linguistics, speech recognition, language identification, semantics, summarization, information retrieval, and machine learning. Since language-independent approaches with quick ramp up time are sought, foreign language expertise in the languages of the program is not expected. IARPA anticipates that universities and companies from around the world will participate in this research program. Researchers will be encouraged to publish their findings in publicly-available, academic journals.
For updated information on the program, please visit: https://www.iarpa.gov/index.php/research-programs/material
| ||
4-13 | msg from ELRA: CEF Automated Translation call for proposals (CEF-TC-2016-3) CEF Automated Translation call for proposals (CEF-TC-2016-3) will be launched on 20 September 2016 with a closing date on 15 December 2016.
| ||
4-14 | Theory of Musical Equalibration We would like to inform you about the 'Theory of Musical Equalibration' that describes the relationship between chords and their emotional impact. If interested, we could introduce the subject at the International Speech Communications Association .
Last year we presented the topic at the 'Croatian Days of Music Theory' in Zagreb:
at the 'Israel Musicological Society' in Tel Aviv: http://media.wix.com/ugd/510480_4e3aa5de255a4868b0a86de92b7fd15e.pdf
and at the Centre for Systematic Musicology, University of Graz: https://www.homepage.uni-graz.at/de/richard.parncutt/research/weekly-seminar/
You can download the English translation of our book 'Music and Emotions - Research on the Theory of Musical Equilibration (die Strebetendenz-Theorie)' for free: http://www.willimekmusic.de/music-and-emotions.pdf
or our article 'Why do minor chords sound sad?' in the Journal for Psychology & Psychotherapy: Journal-of-psychology-and-psychotherapy
The 'Theory of Musical Equilibration' is also discussed in the discussion page of the Society for Music Theory: smt discuss
Thank you for your interest. We're looking forward to hearing from you.
Best regards
Daniela Willimek, University of Music Karlsruhe, and Bernd Willimek, music theorist.
| ||
4-15 | Bids for ACM International Conference on Multimodal Interaction (ICMI) 2018 ACM International Conference on Multimodal Interaction (ICMI) is the premier international forum for multidisciplinary research on multimodal human-human and human-computer interaction, interfaces, and system development. The conference focuses on theoretical and empirical foundations, component technologies, and combined multimodal processing techniques that define the field of multimodal interaction analysis, interface design, and system development.
ICMI Steering Board invites proposals to host the 20th Annual Conference on Multimodal Interaction (ICMI 2018), to be held in a between the end of September and mid-November 2018. We seek preliminary draft proposals from bidders in the North or South American continents (ICMI 2016 is in Tokyo, Japan and ICMI 2017 will be in Glasgow, UK), although strong proposals from other regions are also welcome. Promising bidders will be asked to provide additional information for the final selection.
Please see the attached document for more details about ICMI 2018 bid process.
Important Dates
ICMI 2016 and 2017 websites
All communications, including request for information and bid submission, should be sent to the ICMI Steering Board Chair (Louis-Philippe Morency, morency@cs.cmu.edu).
Best wishes, Louis-Philippe Morency Assistant Professor, Carnegie Mellon University Director, Multimodal Communication and Machine Learning Laboratory Chair, ACM ICMI Steering Board https://www.cs.cmu.edu/~morency/
| ||
4-16 | ACM ICMI CALL FOR BIDS 2018 ACM ICMI CALL FOR BIDS 2018 The Steering Board of the ACM International Conference on Multimodal Interaction (ICMI) invites proposals to host the 20th Annual Conference, to be held in a between the end of September and mid-November 2018. We seek preliminary draft proposals from bidders in the North or South American continents (ICMI 2016 is in Tokyo, Japan and ICMI 2017 will be in Glasgow, UK), although strong proposals from other regions are also welcome. Promising bidders will be asked to provide additional information for the final selection. Evaluation Proposals will be evaluated according to the following criteria (unordered): Experience and reputation of General Chairs and Program Chairs Local multimodal interaction community support (Local) government and industry support Support and opportunities for students Accessibility and attractiveness of proposed site Suitability of proposed dates (with list of specific conflicts to avoid) Adequacy of conference facilities for the anticipated number of attendees Adequacy of accommodations and food services in a range of price categories and close to the conference facilities Overall balance of budget projections Geographical balance with regard to previous ICMI meetings
All communications, including request for information and bid submission, should be sent to the ICMI Steering Board Chair (Louis-Philippe Morency, morency@cs.cmu.edu). Important Dates December 21, 2016 - Notify intention to submit proposal January 27, 2017 - Draft proposals due February 10, 2017 - Feedback to bidders February 24, 2017 - Final bids due March 17, 2015 - Bid selected
Bid Content The following questions have to be answered for the official bid (both draft and final proposals). Text in square brackets [] contains considerations to be taken into account. 1. Describe briefly the conference, including side events 2. Describe briefly the conference site. 3. What date do you consider? 4. What is the nearest (international) airport? 5. Please give price quotes for the cheapest roundtrip to the conference location from Frankfurt, London, New York, San Francisco, Beijing and Tokyo (assume one week of travel with a Saturday overnight stay) 6. What transportation should participants use from the airport to the conference site? 7. Does the conference site both have a large room for a maximum of 300 people and about 5 smaller rooms for a maximum of 30-70 people? Is there wireless connection available for attendees? What about audio-visual facilities? 8. What is the approximate room rate (single and double occupancy)? Is breakfast included? [Often all the attendees of ICMI stay at the same hotel. If this is your case, the conference chair should find a hotel that allows accommodation for the expected number of people. Booking rooms and meals in the same hotel as the conference rooms often helps reducing the overall costs.] 9. Catering, including breaks, receptions, banquet and entertainment. We encourage organizers to provide coffee breaks and lunches in order to promote community building and discussion 10. Which support can your department give for the organization of the conference (e.g., free secretarial assistance, facilities for on-line payment? 11. Which support can your department give during the conference (e.g., free secretarial assistance, PCs / Macs at the conference site)? 12. What are your plans for sponsorship? To which associations / companies / institutions do you plan to apply for financial assistance? What do you realistically expect to receive from them? What are your plans concerning student travel stipend program [A minimum of $5,000 should be reserved from each year's conference budget to support student travel from each of the three major geographic regions (Americas, Europe-Africa, Asia-Pacific), or $15,000 total. For example, if a grant for $15,000 is obtained from NSF to support U.S. student travel but there is no external funding for students from other continents, then an additional $10,000 of you budget should be set aside for students from the other two continents] 13. What actions will you take to make the conference cheaper for students? (e.g., seek financial support from other organizations, provide cheaper rooms)? What reduction do you realistically expect? 14. Provide the names of people who are foreseen or confirmed for the major Conference Committees: General Chairs, Program Chairs, Sponsorship Chair; volunteer labor, registration handling. Describe any experience the team has had in organizing previous conferences and the number of participants at those conferences 15. Local Multimodal Interaction community 16. How do you propose to run the paper reviewing process? Do you see any possible improvements? 17. How will you organize the content of the conference to ensure a high-quality and energetic exchange of information that includes timely topics and stimulating external speakers? Please be specific in your suggestions for how you would organize the main program and workshops/tutorials 18. Any other aspects that you may find relevant for the evaluation of your proposal Preparing a budget proposal Based on estimates from previous attendance, one might expect 200 participants to ICMI. Please, provide two budgets, one for 150 participants and the other for 200. Costs that will have to be covered include: Rental of conference space and meeting rooms AV equipment Coffee breaks and possibly lunch Registration desk/technical helpers (e.g., student volunteers) Tutorials Producing and printing the proceedings ACM 18% contribution and contingency fund Conference poster and advertising Social dinner Welcome reception Dinner/lunch for ICMI board meeting
| ||
4-17 | Prix de thèse AFCP 2016 - 2017
| ||
4-18 | ASVspoof 2017 CHALLENGE: Audio replay detection for automatic speaker verification anti-spoofing =*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*
ASVspoof 2017 CHALLENGE: Audio replay detection for automatic speaker verification anti-spoofing http://www.spoofingchallenge.org/ =*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=*=* Are you good at machine learning for audio signals? Are you good at discriminating 'fake' signals from authentic ones? Are you looking for new audio processing challenges? Do you work in the domain of speaker recognition? ASVspoof 2017 challenge might be for you! CHALLENGE TASK: Given a short clip of speech audio, determine whether it contains a GENUINE human voice (live recording), or a REPLAY recording (fake). You will be provided a development set containing genuine/replay labeled audio examples, along with further metadata such as speech content and the devices used in the replay recordings. Your task is to develop a system that assigns a single 'liveness' or 'genuineness' score value to new audio samples, and to execute that system on a set of test files for which the ground truth is not provided. We provide a Matlab-based reference baseline method to kick-off quickly towards developing your new ideas! For more details, refer to the evaluation plan in the website: http://www.spoofingchallenge.org/ BACKGROUND: The goal of the challenge series is to enhance security of automatic speaker verification (ASV) systems from being intentionally circumvented using fake recordings, also known as 'spoofing attacks' or 'representation attacks' in the context of biometrics. ASVspoof 2017 is a second edition of a challenge kicked off in 2015, and the new perspective in ASVspoof 2017 are the replay attacks, especially 'unseen' attacks - for instance, containing replay environments, devices and speakers that might be very different from those in the development data. Despite 'ASV' being in the challenge title, you do NOT require knowledge of automatic speaker verification: the task is a 'standalone' replay audio detection task that can be addressed as a generic acoustic pattern classification problem. We welcome as many new ideas to the problem as possible! SCHEDULE: Development data published: December 23rd, 2016 Evaluation data published: February 10, 2017 Evaluation set scores due: February 24, 2017 Results available: March 3, 2017 Interspeech paper deadline: March 14, 2017 Metadata/keys published: May 2017 Interspeech special session: August 2017 REGISTRATION: Send a free-worded e-mail to asvspoof2017@cs.uef.fi to register and obtain the dev data. ORGANIZERS: Tomi Kinnunen, University of Eastern Finland, FINLAND Nicholas Evans, Eurecom, FRANCE Junichi Yamagishi, University of Edinburgh, UK Kong Aik Lee, Institute for Infocomm Research, SINGAPORE Md Sahidullah, University of Eastern Finland, FINLAND Massimiliano Todisco, Eurecom, FRANCE Hector Delgado, Eurecom, FRANCE
|