![]() |
ISCApad #325 |
Wednesday, July 09, 2025 by Chris Wellekens |
3-1-1 | (2025-08-17) Interspeech 2025, Rotterdam, The Netherlands INTERSPEECH 2025
| ||
3-1-2 | (2026) Interspeech 2026, Sydney, Australia The Australasian Speech Science and Technology Association is honoured to have been selected to host INTERSPEECH 2026. Our theme of Diversity & Equity ? Speaking Together strongly reflects Sydney and our broader region. Sydney is Oceania?s largest city and is also its most linguistically diverse: more than 300 different languages are spoken and 40% of Sydneysiders speak a language other than English at home. Consistent with the goals of ISCA ?to promote, in an international world-wide context, activities and exchanges in all fields related to speech communication science and technology?, INTERSPEECH Sydney will highlight the diversity of research in our field with a firm focus on equity and inclusivity. Recognizing the importance of multi-dimensional approaches to speech, INTERSPEECH 2026 will foster greater interdisciplinarity to better inform current and future work on speech science and technology. We look forward to welcoming all to Sydney!
| ||
3-1-3 | (2026) Speech Prosody 2026 in Philadelphia, PA, USA Dear Speech Prosody SIG Members,
I'm pleased to announce that Speech Prosody 2026 will be in Philadelphia, organized by Jianjing Kuang and Mark Liberman. (There were 80 votes for Shanghai, 128 for Philadelphia, and 15 had no preference.)
I’m also pleased to announce the resumption of our lecture series with a talk by Simon Roessig:
Syntagmatic prominence relations in prosodic focus marking
Lecturer: Simon Roessig (University of York, UK) Host: Plinio A. Barbosa (Unicamp, Brazil)
Sept 24th at 1 pm (Brasilia time = UTC - 3)
Abstract: This talk is about the role of prenuclear prominences and their relation to nuclear accents in German and English. The production results (German) that I will present show that the realization of the prenuclear domain depends on whether it is focal or prefocal. The prenuclear noun is characterized by larger F0 excursions, higher F0 maxima, and longer durations when it is in broad focus than when it precedes a narrow focus. Furthermore, the realization of the prenuclear domain depends on the following focus type: The prenuclear noun is produced with smaller F0 excursions, lower F0 maxima and shorter durations before a corrective focus than before a non-corrective narrow focus. The findings suggest that the phonetic manifestation of information structure is distributed over larger prosodic domains with an inverse relationship in the syntagmatic dimension. In addition, the study contributes further evidence that continuous phonetic detail is used to encode information structural categories. An important question that arises from the production data is whether this phonetic detail can be used by listeners in perception. I will present first results from a series of perception experiments (German and English) to investigate this question.
Plan: 1. I will begin by outlining what we know about focus prosody in the nuclear and prenuclear domains. 2. I will then present findings from a production study that examines the prosody of the prenuclear domain in different types of focus. 3. These results show that there are interesting strength relations between prenuclear and nuclear prosody in the encoding of focus types. 4. I will present preliminary findings from perception experiments investigating the question whether listeners use prenuclear prominence modulations in identifying focus types. 5. Finally, I will conclude with a discussion of the results and future directions.
Nigel Ward, SProSIG Chair nigel@utep.edu https://www.cs.utep.edu/nigel/
| ||
3-1-4 | (2027) Interspeech 2027, São Paulo, Brazil The ISCA Board has decided to award the organisation of Interspeech 2027 to São Paulo, Brazil. We are very excited to introduce researchers from all over the world to the South American continent for the first time.
| ||
3-1-5 | (2028) Interspeech 2028 at San Antonio, Texas, USA Announcement of Interspeech 2028 location
The ISCA Board has decided to award the organisation of Interspeech 2028 to San Antonio, TX, USA. We are very excited to welcome researchers from all over the world back to the US for the first time since 2016. The conference will be held from September 3 - 8, 2028.
| ||
3-1-6 | ISCA INTERNATIONAL VIRTUAL SEMINARS Now's the time of year that seminar programmes get fixed up.. please direct the attention of whoever organises your seminars to the ISCA INTERNATIONAL VIRTUAL SEMINARS scheme (introduction below). There is now a good choice of speakers: see https://www.isca-speech.org/iscaweb/index.php/distinguished-lecturers/online-seminars ISCA INTERNATIONAL VIRTUAL SEMINARSA seminar programme is an important part of the life of a research lab, especially for its research students, but it's difficult for scientists to travel to give talks at the moment. However, presentations may be given on line and, paradoxically, it is thus possible for labs to engage international speakers who they wouldn't normally be able to afford.
Speakers may pre-record their talks if they wish, but they don't have to. It is up to the host lab to contact speakers and make the arrangements. Talks can be state-of-the-art, or tutorials. If you make use of this scheme and arrange a seminar, please send brief details (lab, speaker, date) to education@isca-speech.org If you wish to join the scheme as a speaker, we need is a title, a short abstract, a 1 paragraph biopic and contact details. Please send them to education@isca-speech.org PS. The online seminar scheme is now up and running, with 7 speakers so far:
Jean-Luc Schwartz, Roger Moore, Martin Cooke, Sakriani Sakti, Thomas Hueber, John Hansen and Karen Livescu.
|
3-2-1 | (2025-07-05) 7th Edition of Summer School on Automatic Speech Recognition (ASR), DAU, Gandhinagar, India. DAU is organizing the 7th Edition of Summer School on Automatic Speech Recognition (ASR) during July 05-09, 2025.The event is sponsored by DAU (formerly, DA-IICT). ASR is a highly multidisciplinary field and it is a key component of commercially successful Voice Assistants, such as Apple's Siri, Microsoft's Cortana, Google Assistant, Amazon's Echo/Alexa, Samsung's Bixby, IBM's Watson, etc. The experts chosen for this event are Hervé Bourlard (EPFL and former Director, IDIAP Research Institute, Switzerland), Akihiko K. Sugiyama (Founder, Damas.cus Corporation, Tokyo and Kansai University, Japan), Thomas Hain (ISCA Fellow, The University of Sheffield, UK), Yu Tsao (Academia Sinica, Taiwan), B. Yegnanarayana (Fellow of IEEE, ISCA, INAE, Retd. IIT Madras), Hema A. Murthy (ISCA Fellow, IIT Madras), S. Umesh (IIT Madras), Nancy F. Chen (I2R Singapore), Tatsuya Kawahara (IEEE Fellow, Kyoto University, Japan), Sriram Ganapthy (IISc Bengaluru), K. S. R. Murty (IIT Hyderabad), Anil Kumar Vuppala (IIIT Hyderabad), and Vinayak Abrol (IIIT Delhi). This event is sponsored by ISCA, IndSCA, DAU Gandhinagar, and technically co-sponsored by IEEE Signal Processing Society (SPS) Gujarat Section.
URL: https://sites.google.com/view/s4p2025/speakers Note: The current version of the poster of this event is attached. The last date of early bird registration is March 31, 2025.
In addition, the Summer School will also have a special session on Industry Perspective Talks, where speakers are: Sunayana Sitaram (Microsoft Research, Bengaluru), Sri Garimella (Amazon, Bengaluru), Premjeet Singh (Samsung Research Institute, Bengaluru), K. Sunilkumar (TCS Innovation Labs, Mumbai), Debmalya Chakrabarty (Amazon, Bengaluru), Bidisha Sharma (Uniphore, Bengaluru), Nirmesh J. Shah (Sony Research, India), Nagaraj Adiga (Krutrim, Bengaluru), and Dipesh K. Singh (Augnito, Mumbai).
The program committee of S4P 2025 includes internationally well-known experts from 10 countries across the world. The S4P 2025 also includes 6th edition of 5 minute Ph.D. thesis (5MPT) contest, which provides doctoral scholars an opportunity to showcase their research work before eminent researchers both from academia and industry, and 2nd edition of Poster Presentation Contest (PPC). Four best presentations by the scholars during 5MPT and PPC will be awarded cash prizes. Further, we are also providing the Student Travel Grants (including DAU Student Grants to 50 students) to student participants. We are enclosing a poster which describes the outline of the event and call for participation.
Please do help in giving wide publicity of this event and you may please encourage students working in relevant areas to register for this event and apply for IndSCA Student Travel Grants, please.
Thanks and best regards, Prof. (Dr.) Hemant A. Patil, Professor and Placement Convenor, DAU (formerly DA-IICT) Gandhinagar, India. On behalf of the Organizing Committee, S4P, July 05-09, 2025. Associate Editor, IEEE Signal Processing Magazine 2021-2023. ISCA Distinguished Lecturer 2020-2022, and APSIPA Distinguished Lecturer 2018-2019 Speech Research Lab @ DAU Gandhinagar, https://sites.google.com/site/speechlabdaiict/ PS: The closure report of the 6th edition of Summer School is available at https://sites.google.com/view/s4p2025/past_events
|
3-3-1 | Call for ICMI Workshop Papers ============================= Cross-Cultural Multimodal Interaction (CCMI) Summary: This workshop seeks to establish an international research platform to investigate the impact of linguistic and cultural differences on nonverbal behavior and their effects on communication dynamics. Moving beyond merely identifying nonverbal behavior patterns in specific cultural contexts, the workshop aims to uncover the mechanisms behind adaptation, change, and misunderstanding in intercultural interactions. The first year will focus on data-related challenges, such as collecting and annotating high-quality data across different regions. While advances in sensor technology, machine learning, and Large Language Models (LLMs) have been applied to linguistic diversity, their use in nonverbal communication remains underexplored. Given the known cultural variations in gestures, facial expressions, and turn-taking, integrating insights from humanities research with multimodal analysis is crucial. As LLMs continue to shape human-machine interactions globally, understanding and incorporating cultural differences in nonverbal behavior is an urgent and significant research challenge. Webpage: https://sites.google.com/view/ccmi2025/home Deadline: 7 July 2025 Organizers: Koji Inoue, Kyoto University, Japan Shogo Okada, Japan Advanced Institute of Science and Technology (JAIST), Japan Divesh Lala, Kyoto University, Japan Sahba Zojaji, The Chinese University of Hong Kong, Shenzhen, China Nancy F. Chen, Agency for Science, Technology, and Research (A*STAR), Singapore Tatsuya Kawahara, Kyoto University, Japan Holistic and Responsible Affective Intelligence (HRAI) Summary: Affective computing techniques are typically developed for specific tasks in controlled settings, lacking the flexibility to handle multiple affective states simultaneously. Recently, foundation models have emerged as a promising solution, demonstrating strong performance across various affective tasks and offering a more comprehensive approach to affective intelligence. However, their adoption also raises critical ethical concerns, including privacy risks, fairness, sustainability, and bias. Therefore, ensuring their responsible and ethical use is more urgent than ever. This workshop aims to advance both the holistic development of affective computing and the understanding of its associated ethical challenges. Webpage: https://sites.google.com/view/hariworkshop Deadline: 8 July 2025 Organizers: Yuanchao Li, University of Edinburgh, UK Dimitrios Kollias, Queen Mary University London, UK Guillaume Chanel, University of Geneva , Switzerland Marios Fanourakis, University of Geneva , Switzerland Leimin Tian, CSIRO, Australia Michal Muszynski, IBM Research, Switzerland Brandon Booth, University of Memphis, USA Huili Chen, Princeton University, USA Catherine Lai, University of Edinburgh, UK IMAGINE-RS: 1st Workshop on Interactive Multimodal Analysis and Geospatial Intelligence for Remote Sensing Summary: The fusion of AI-driven geospatial intelligence with interactive and explainable AI redefines how we interpret and interact with remote sensing data. IMAGINE-RS brings together researchers from remote sensing, HCI, AI, cognitive science, and geospatial analytics to explore advancements in AR/VR, conversational AI, BCI, and haptic feedback for geospatial applications. This workshop brings interdisciplinary discussions on enhancing interactive AI methods and evaluating their impact on precision agriculture, disaster response, and environmental monitoring. Promoting human-centered geospatial intelligence aims to identify key challenges and drive collaboration toward trustworthy, accessible, and transparent AI-driven Earth observation systems. Webpage: https://sites.google.com/view/imagine-rs-icmi2025/ Deadline: 15 July 2025 Organizers: Ankit Jha, LNM Institute of Information Technology Jaipur, India Biplab Banerjee, Indian Institute of Technology Bombay, India Danfeng Hong, Chinese Academy of Sciences, China Moloud Abdar, Deakin University, Australia Anmol Srivastav, Indraprastha Institute of Information Technology Delhi, India Poulami Dalapati, LNM Institute of Information Technology Jaipur, India Keshav Kaushik, LNM Institute of Information Technology Jaipur, India Lokendra Chauhan, QEN Labs, USA The Fifth International Workshop on Automated Assessment of Pain (AAP) Summary: Pain typically is measured by patient self-report, but self-reported pain is difficult to interpret and may be impaired or in some circumstances not possible to obtain. For instance, in patients with restricted verbal abilities such as neonates, young children, and in patients with certain neurological or psychiatric impairments (e.g., dementia). Additionally, the subjectively experienced pain may be partly or even completely unrelated to the somatic pathology of tissue damage and other disorders. Therefore, the standard self-assessment of pain does not always allow for an objective and reliable assessment of the quality and intensity of pain. Given individual differences among patients, their families, and healthcare providers, pain often is poorly assessed, underestimated, and inadequately treated. To improve assessment of pain, objective, valid, and efficient assessment of the onset, intensity, and pattern of occurrence of pain is necessary. To address these needs, several efforts have been made in the machine learning and computer vision communities for automatic and objective assessment of pain from video as a powerful alternative to self-reported pain. The workshop aims to bring together interdisciplinary researchers working in the field of automatic multimodal assessment of pain (using video and physiological signals). A key focus of the workshop is the translation of laboratory work into clinical practice. Webpage: http://aap-workshop.net Challenge: https://sites.google.com/view/ai4pain2025/home Deadline: 8 July 2025 Organizers: Zakia Hammal, The Robotics Institute, Carnegie Mellon University, USA. Raul Fernandez-Rojas, University of Canberra, Australia. Steffen Walter, University Hospital Ulm, Germany. Nadia Berthouze, University College London, UK. Roland Goecke, University of Canberra, Australia. Ben Seymour, University of Oxford, UK.
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-2 | (2025-07-13 )International Forensics Summer School (IFOSS) - Punta Sampieri - Scicli (Ragusa), Sicily, IT 13-19 July 2025 International Forensics Summer School (IFOSS) - Sicily, IT 13-19 July 2025 Forensic Horizons: Investigating Truth in the Digital and AI Era Challenges and Opportunities in Digital Forensics, Cybersecurity, and AI-Driven Investigations
The fourth edition of IFOSS will aim to provide both an objective and clear overview and an in-depth analysis of the state-of-the-art research as well as the professional best practices in Forensics and related fields. The courses will be delivered by world renowned experts, from either academia, law enforcement and industry, and will cover both theoretical and practical aspects of real Forensics problems. Leading scientists from the different fields involved will introduce the topics. A typical course is broad enough to provide a general introduction to the chosen topic, whilst one can learn the most relevant contributions in depthThe school will aim to provide a stimulating opportunity for professionals and young researchers, as well as Ph.D. students.
After the big success of the first two editions with about 75 attendees from all over the world, also this year we will focus on Digital Forensics and Law with specific emphasis on the AI revolution.
In the following the related recap videos: IFOSS 2022: https://www.youtube.com/watch?v=7a35Wjkygx0&ab_channel=IFOSS IFOSS 2023: https://youtu.be/lfRZ8LyfYYs?si=MJ_7g3BORdZH-sNo IFOSS 2024: https://youtu.be/525Krq0ZGF4?si=1a6VlSdO1PWr9VJt
The underlying theme of the current edition is: Forensic Horizons: Investigating Truth in the Digital and AI Era Challenges and Opportunities in Digital Forensics, Cybersecurity, and AI-Driven Investigations
Why Attend?
LIST OF SPEAKERS (fully confirmed)
Fernando-Perez Gonzalez, University of Vigo, ES Andrea Cavallaro, EPFL, CH Didier Meuwly - University of Twente, NL Jacopo Della Torre - University of Genova, IT
Fabio Bruno, Interpol, Singapore Gianluca Foresti, University of Udine, IT Stefano Mele - ICT Authority, San Marino ..others coming soon.
DIRECTORS Sebastiano Battiato - University of Catania, Italy Donatella Curtotti - University of Foggia, Italy Giovanni Ziccardi, University of Milan, Italy
PhD FORUM A special session is organized for participants who intend to take advantage of the audience for presenting their current research/tool in the covered areas. A Special Issue to follow up on the themes covered in the School will be published in some indexed high-impact journals (to be announced early). Applicants are particularly encouraged to submit their original research to the SI (the usual refereeing procedure applies to guarantee the highest scientific standards).
APPLICATION The school will be open to about 75 qualified, motivated and pre-selected candidates. Ph. D. students, post-docs, young researchers (both academic and industrial), senior researchers (both academic and industrial) or academic/industrial professionals are encouraged to apply at: www.ifoss.it
The expected school fee will be in the order of 550 euros for Master and Phd students granted by academia, € 600 for other academic positions and € 700 for industrial. Reduced Fee will be reserved to LEAs, private lawyers and practitioners 400 Euros. The fee will include all course materials, coffee breaks, bus service from Catania Airport to School Location and return, WiFi Internet Connection, a guided tour, a social dinner and all the events scheduled in the programme.
A certain number of scholarships will be available soon depending on sponsorship income.
Applications to attend IFOSS 2025 should be received before 04/05/2025.
Applicants will receive notification of acceptance by mid of May. Late registration can be done with an extra payment of € 100.
ACCOMODATIONS IFOSS participants will be hosted at Hotel Village Baia Samuele (school location) at very special rates. There are no other accommodation options. IFOSS 2025 participants must make reservations for accommodation, using the accommodation reservation form (available soon) to be sent directly to Baia Samuele reception.
After a certain date there is no guarantee for reservations in Hotel Village Baia Samuele. More information will be announced as soon as possible on the web site. Depending on chosens settings (Single, Double or Triple Room) the overall cost enclosing Breakfast, Lunch and Dinner should span in the range (650-1000 euros) in the period 13(in) to 19 (out) July 2025.
LOCATION IFOSS 2025 will be hosted by Hotel Village Baia Samuele in Punta Sampieri - Scicli (Ragusa), Sicily from 13-19 July 2025. Sicily is one of the most beautiful islands of the Mediterranean. The island is very rich in archeological sites from various Ancient Civilizations. The sea, weather, food and the wine are excellent. In particular Punta Sampieri - Scicli (RG) is located in the south east of Sicily in a late Baroque area called Val di Noto. The Val di Noto area is included in the Unesco World Heritage List and includes eight nearby towns: Caltagirone, Militello Val di Catania, Catania, Modica, Noto, Palazzolo, Ragusa and Scicli. The location of the school rises in the middle of an ample bay delimited on the west from Sampieri and on the east from a cliff, on which is found an ancient furnace, rare example of industrial archaeology. The Hotel Village Baia Samuele stretches in a gentle slant to the beach: 120 thousand square meters delimited from rows of secular cypresses. An ultramodern village with an original architecture, pleasant design and all the comforts you can imagine. The frame of plants and flowers, typical of this angle of Sicily, in front of the island of Malta, completes this gilded dream of the Mediterranean.
MORE INFORMATION Website: www.ifoss.it
FOLLOW US ON OUR SOCIAL MEDIA Facebook: https://www.facebook.com/InternationalForensicsSummerSchool/ Twitter: https://twitter.com/ifoss_official Instagram: https://www.instagram.com/ifoss_official/ LinkedIn: https://www.linkedin.com/company/ifoss
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-3 | (2025-07-20) CfP 61st IEEE Professional Communication Conference (ProComm), University of Southern Denmark, Sonderborg, Denmark Call for Papers for the 61st IEEE Professional Communication Conference (ProComm)
20-23 July 2025 at the University of Southern Denmark, Sonderborg, Denmark.
ProComm is the flagship conference of the IEEE Professional Communication Society.
The conference takes place under the theme: “Digital Solutions and Multimodal Challenges”
Modern communication calls for more than traditional methods today. Digital solutions play a crucial role as they expand the efficiency and reach of communication. Multimodal communication, an evolution of this trend, is about the challenge of seamlessly integrating various media forms, from messaging apps to video calls – and it is also about the challenge of understanding how the different type and layers of communication, text and images, speech and body language, interact and interfere in the creation of attractive and effective messages. In an increasingly interconnected world, the integration of digital solutions and multimodal communication is crucial for effective exchange and collaboration. While, of course, inviting papers from all areas of professional communication, the conference will put a focus on building such inter-disciplinary bridges, using our Acoustics Lab’s international network to invite (foreign) language teachers, public-speaking coaches, researchers from the speech sciences, and speech-communication engineers to the event.
The 2025 issue of the conference is hosted by the CIE Acoustics Lab at the University of Southern Denmark in Sonderborg, Denmark.
Conference website: https://event.sdu.dk/procomm2025
All papers are subject to two rounds of double-bling peer review, and accepted papers will be published as proceedings in IEEE Xplore®, see here for further information and indexing: https://ieeexplore.ieee.org/Xplorehelp/overview-of-ieee-xplore/about-content
Please note that the website for the 2025 conference is constantly updated. Information about keynotes, important dates etc. follow in autumn 2024.
*****************************************
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-4 | (2025-07-21) 12th INTERNATIONAL SCHOOL ON DEEP LEARNING, Porto, Portugal 12th INTERNATIONAL SCHOOL ON DEEP LEARNING (with a special focus on Large Language Models, Foundation Models and Generative AI) DeepLearn 2025 Porto – Maia, Portugal July 21-25, 2025 https://deeplearn.irdta.eu/2025/ ****************************************************** Co-organized by: University of Maia Institute for Research Development, Training and Advice – IRDTA Brussels/London ****************************************************** Early registration: January 23, 2025 ****************************************************** SCOPE: DeepLearn 2025 will be a research training event with a global scope aiming at updating participants on the most recent advances in the critical and fast developing area of deep learning. Previous events were held in Bilbao, Genova, Warsaw, Las Palmas de Gran Canaria, Guimarães, Las Palmas de Gran Canaria, Luleå, Bournemouth, Bari, Las Palmas de Gran Canaria and Porto. Deep learning is a branch of artificial intelligence covering a spectrum of current frontier research and industrial innovation that provides more efficient algorithms to deal with large-scale data in a huge variety of environments: computer vision, neurosciences, speech recognition, language processing, human-computer interaction, drug discovery, biomedicine and health informatics, medical image analysis, recommender systems, advertising, fraud detection, robotics, games, business and finance, biotechnology, physics experiments, biometrics, communications, climate sciences, geographic information systems, signal processing, genomics, materials design, video technology, social systems, earth and sustainability, etc. etc. The field is also raising a number of relevant questions about robustness of the algorithms, explainability, transparency, interpretability, as well as important ethical concerns at the frontier of current knowledge that deserve careful multidisciplinary discussion. Most deep learning subareas will be displayed, and main challenges identified through 18 four-hour and a half courses, 2 keynote lectures, 1 round table and a hackathon competition among participants. Renowned academics and industry pioneers will lecture and share their views with the audience. The organizers are convinced that outstanding speakers will attract the brightest and most motivated students. Face to face interaction and networking will be main ingredients of the event. It will be also possible to fully participate in vivo remotely. DeepLearn 2025 will place special emphasis on large language models, foundation models and generative artificial intelligence. ADDRESSED TO: Graduate students, postgraduate students and industry practitioners will be typical profiles of participants. However, there are no formal pre-requisites for attendance in terms of academic degrees, so people less or more advanced in their career will be welcome as well. Since there will be a variety of levels, specific knowledge background may be assumed for some of the courses. Overall, DeepLearn 2025 is addressed to students, researchers and practitioners who want to keep themselves updated about recent developments and future trends. All will surely find it fruitful to listen to and discuss with major researchers, industry leaders and innovators. VENUE: DeepLearn 2025 will take place in Porto, the second largest city in Portugal, recognized by UNESCO in 1996 as a World Heritage Site. The venue will be: University of Maia Avenida Carlos de Oliveira Campos - Castêlo da Maia 4475-690 Maia Porto, Portugal https://www.umaia.pt/en STRUCTURE: 3 courses will run in parallel during the whole event. Participants will be able to freely choose the courses they wish to attend as well as to move from one to another. All lectures will be videorecorded. Participants will be able to watch them again for 45 days after the event. An open session will give participants the opportunity to present their own work in progress in 5 minutes. Also companies will be able to present their technical developments for 10 minutes. The school will include a hackathon, where participants will be able to work in teams to tackle several machine learning challenges. Full live online participation will be possible. The organizers highlight, however, the importance of face to face interaction and networking in this kind of research training event. KEYNOTE SPEAKERS: Yonina Eldar (Weizmann institute of Science), Model Based Deep Learning: Applications to Imaging and Communications Manuela Veloso (JPMorganChase), The Journey of Humans and AI: Insights from AI in Robotics and AI in Finance PROFESSORS AND COURSES: Pierre Baldi (University of California Irvine), [intermediate/advanced] From Deep Learning and Transformers to AI Risks and Safety Sean Benson (Amsterdam University Medical Center), [intermediate] Digital Twins and Generative AI for Personalised Medicine Xavier Bresson (National University of Singapore), [intermediate/advanced] Graph Transformers, Graph Generative Models and Large Language Models Nello Cristianini (University of Bath), [introductory] Machina Sapiens - Towards More General Forms of AI Mark Derdzinski (Dexcom), [introductory] From Prototype to Production: Evaluation Strategies for Agentic Applications Samira Ebrahimi Kahou (University of Calgary), [intermediate/advanced] Explainability in Machine Learning Elena Giusarma (Michigan Technological University), [introductory/intermediate] Machine Learning at the Frontier of Astrophysics: Simulating the Universe Shih-Chieh Hsu (University of Washington), [intermediate/advanced] Real-Time Artificial Intelligence for Science and Engineering Xia 'Ben' Hu (Rice University), [introductory/advanced] Efficient LLM Serving: Algorithms and Systems Lu Jiang (ByteDance & Carnegie Mellon University), [introductory/intermediate] Transformers for Image and Video Generation: Fundamentals, Design, and Innovations Jayashree Kalpathy-Cramer (University of Colorado), [introductory/intermediate] Multimodal AI for Healthcare Yingbin Liang (Ohio State University), [intermediate/advanced] Theory on Training Dynamics of Transformers Chen Change Loy (Nanyang Technological University), [intermediate/advanced] Harnessing Prior for Content Enhancement and Creation Evan Shelhamer (DeepMind), [intermediate] Test-Time Adaptation for Updating Models on New and Different Data Atlas Wang (University of Texas Austin), [intermediate] Low Rank Strikes Back in the Era of Large Language Models Xiang Wang (University of Science and Technology of China), [advanced] Large Language Models for User Behavior Modeling: Cross-Modal Interpretation, Preference Optimization, and Agentic Simulation Cao (Danica) Xiao (GE HealthCare), [introductory/intermediate] Transforming Healthcare and Drug Development through Multimodal AI with LLMs and Generative AI Technologies Rex Ying (Yale University), [intermediate/advanced] Multimodal Foundation Models for Graph-Structured Data: Framework and Scientific Applications OPEN SESSION: An open session will collect 5-minute voluntary oral presentations of work in progress by participants. They should submit a half-page abstract containing the title, authors, and summary of the research to david@irdta.eu by July 13, 2025. INDUSTRIAL SESSION: A session will be devoted to 10-minute demonstrations of practical applications of deep learning in industry. Companies interested in contributing are welcome to submit a 1-page abstract containing the program of the demonstration and the logistics needed. People in charge of the demonstration must register for the event. Expressions of interest have to be submitted to david@irdta.eu by July 13, 2025. HACKATHON: A hackathon will take place, where participants can work in teams to tackle several machine learning challenges. They will be coordinated by Professor Sergei V. Gleyzer (University of Alabama). The challenges will be released 2 weeks before the beginning of the school. A jury will judge the submissions and the winners of each challenge will be announced by August 25, 2025. The winning teams will receive a modest monetary prize and the runners-up will get a certificate. SPONSORS: Companies/institutions/organizations willing to be sponsors of the event can download the sponsorship leaflet from https://deeplearn.irdta.eu/2025/sponsors/ ORGANIZING COMMITTEE: Sergei V. Gleyzer (Tuscaloosa, hackathon chair) José Paulo Marques dos Santos (Maia, local chair) Carlos Martín-Vide (Tarragona, program chair) Sara Morales (Brussels) José Luís Reis (Maia) Luís Paulo Reis (Porto) David Silva (London, organization chair) REGISTRATION: It has to be done at https://deeplearn.irdta.eu/2025/registration/ The selection of 6 courses requested in the registration template is only tentative and non-binding. For logistical reasons, it will be helpful to have an estimation of the respective demand for each course. Since the capacity of the venue is limited, registration requests will be processed on a first come first served basis. The registration period will be closed and the on-line registration tool disabled when the capacity of the venue will have got exhausted. It is highly recommended to register prior to the event. FEES: Fees comprise access to all program activities and lunches. There are several early registration deadlines. Fees depend on the registration deadline. The fees for on site and for online participation are the same. ACCOMMODATION: Accommodation suggestions will be available in due time at https://deeplearn.irdta.eu/2025/accommodation/ CERTIFICATE: A certificate of successful participation in the event will be delivered indicating the number of hours of academic activities. This should be sufficient for those participants who plan to request ECTS recognition from their home university. QUESTIONS AND FURTHER INFORMATION: david@irdta.eu ACKNOWLEDGMENTS: Universidade da Maia Universidade do Porto Universitat Rovira i Virgili Institute for Research Development, Training and Advice – IRDTA, Brussels/London
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-5 | (2025-08) Registration at the 2025 editions of the Speech Synthesis Workshop (SSW) and The Blizzard ChallengeDear colleagues, while we are finalising the program, the registration for the 13th edition of the Speech Synthesis Workshop is now open with all the information available on the official SSW website at the registration page: <https://blogs.helsinki.fi/ssw13-2025/registration-2/> There are two ways to register: - if you *register for Interspeech*, please use the Interspeech registration portal: <https://www.aanmelder.nl/164976/registration>. The payment will then be made using their platform - if you **do not register for Interspeech**, you can register by using the dedicated form: <https://forms.gle/Hn4NqmiJP8ptWCgh8>. The payment will be made by bank transfer Due to circumstances beyond our control, there is no ISCA discount this year. The registration fees are: - *regular* 500€ - *student* 300€ Registration for the 2025 Blizzard Challenge is also open. The workshop attendance is free, but the space is limited. Priority is given to the participants. Therefore, if you want to secure your place, please register using the following form: <https://forms.gle/vqtRSRuz1wEUYEKv5> We are looking forward to welcoming you to Leeuwarden, The SSW and the Blizzard Challenge Organising committees -- Sébastien Le Maguer - Department of Digital Humanities - University of Helsinki website: <http://seblemaguer.github.io/>
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-6 | (2025-08-17) Cf abstracts “Young Female Researchers in Speech Workshop” (YFRSW) @ Interspeech 2025, Rotterdam, The Netherlands Call for Abstracts
This is a Call for Abstracts for the “Young Female Researchers in Speech Workshop” (YFRSW) to be held right before Interspeech 2025 in the Netherlands. The purpose of this workshop is to encourage female* students to pursue a career in speech research and technology. This is a great opportunity to network with and get feedback from senior researchers in the field. If you are (or someone you know is) a female* Bachelor’s or Master’s student in speech science and/or technology, please consider submitting (or encourage others to submit) an abstract by May 2nd, 2025! Participants of YFRSW will be offered partial financial support towards their travel and Interspeech 2025 registration expenses. All details on the Call for Abstracts can be found here: https://sites.google.com/view/yfrsw-2025/abstract-submission
If you have any questions, please get in touch with the YFRSW organization committee!
Sincerely, YFRSW Organizing Committee youngfemaleresearchersinspeech@gmail.com
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-7 | (2025-08-18) The 2nd International Generative AI and Computational Language Modelling Conference (GACLM 2025), Valencia, Spain The 2nd International Generative AI and Computational Language Modelling Conference (GACLM 2025)
18-21 August, 2025 | Valencia, Spain
Hybrid Event
Technically Co-Sponsored by IEEE Spain Section GACLM 2025 CFP: With the emergence of Generative AI (GenAI), the world is experiencing a new era of generative models producing various types of data, including textual and visual data. This is accompanied by the unprecedented advancement of Computational Language Modeling (CLM) techniques that are supporting a wide range of downstream applications. Both Generative AI and Computational Language Modeling are leading the paradigm shift we are witnessing in Artificial Intelligence. The International Generative AI and Computational Language Modeling Conference (GACLM 2025) addresses the architectures, applications, challenges, approaches, and future directions of this new era of Artificial Intelligence systems based on GenAI and CLM. We invite the submission of original papers on all topics related to GACLM, with special interest in, but not limited to, the following:
Submissions Guidelines and Proceedings Manuscripts should be prepared in 10-point font using the IEEE 8.5' x 11' two-column format. All papers should be in PDF format, and submitted electronically at Paper Submission Link. A full paper can be up to 8 pages (including all figures, tables and references). Submitted papers must present original unpublished research that is not currently under review for any other conference or journal. Papers not following these guidelines may be rejected without review. Also submissions received after the due date, exceeding length limit, or not appropriately structured may also not be considered. Authors may contact the Program Chair for further information or clarification. All submissions are peer-reviewed by at least three reviewers. Accepted papers will appear in the GACLM Proceeding, and be published by the IEEE Computer Society Conference Publishing Services and be submitted to IEEE Xplore for inclusion. Important Dates:
Contact:
Please send any inquiry on GACLM to: info@gaclm.org
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-8 | (2025-08-24) 13th ISCA Speech Synthesis Workshop (SSW13) , Leeuwarden, The Netherlands (updated)This is the second call for papers for the 13th ISCA Speech Synthesis Workshop (SSW13). Submission website: <https://cmt3.research.microsoft.com/SSW2025/Track/1/Submission/Create> Submission deadline: April 1, 2025 - initial submission; April 14, 2025 - paper update = General information = 13th ISCA Speech Synthesis Workshop (SSW13) - 24-26 August 2025 in Leeuwarden, NL - <https://blogs.helsinki.fi/ssw13-2025/> We are delighted to announce the 13th edition of the Speech Synthesis Workshop (SSW) which will take place in Leeuwarden (the Netherlands), from Sunday the 24th till Wednesday the 26th of August 2025. The workshop is a satellite event of the Interspeech 2025 conference (held in Rotterdam, the Netherlands). The SSW is the main meeting place for research and innovation in speech synthesis, i.e. predicting speech signals from text input. Text-to-Speech (TTS) technology is a key component of numerous applications: speech-to-speech translation, digital assistants, conversational agents, social robots. While early research focused on basic intelligibility, contemporary systems now achieve remarkable naturalness. Current research frontiers include emotional expression, speaking style control, and efficient deployment for the world's languages. **Theme: Scaling down: Sustainable synthesis for language diversity** SSW13 focuses on making speech synthesis more accessible for the world's languages. We encourage submissions addressing: - Data-efficient methods for low-resource languages - Computationally sustainable approaches - Cross-lingual transfer learning - Language-specific challenges in TTS = Key Information = - Paper Format: Up to 6 pages including references using the Interspeech 2025 template - Review Process: Double-blind peer review - Presentation format: Oral and poster sessions - Virtual Participation: None, in-person only = Important Dates = - **April 6, 2025: Updated submission deadline (title/authors)** - April 20, 2025: Updated full paper deadline - June 21, 2025: Notification of acceptance - July 4, 2025: Camera-ready deadline - August 17-22, 2025: Interspeech (Rotterdam) - August 24-26, 2025: SSW13 (Leeuwarden) = Topics = SSW welcomes contributions not only in the core TTS technology but also includes researchers from related science -- from phoneticians, phonologists, and neuroscientists to experts of multimodal human-machine interaction. First, we are delighted to already **announce two keynote speakers**: - **Alistair Conkie** from Apple - **Anna-Mari Wallenberg** from the University of Helsinki
Core tech - End-to-end text-to-speech synthesis - Direct waveform generation - Voice conversion and modification - Multilingual/cross-lingual synthesis - Low-resource TTS methods Linguistic aspects - Text normalization and preprocessing - Prosody modeling - Expression and emotion - Natural language generation for TTS - G2P conversion Applications & eval - Speech synthesis for accessibility - Embedded/edge deployment - Quality assessment metrics - Privacy and security - Ethical considerations Special applications - Singing synthesis - Non-human vocalization - Talking faces/avatars - Clinical applications We look forward to welcoming you in Leeuwarden! The SSW13 Organising committee
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-9 | (2025-08-25) CfP The 26th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL), Avignon, France SIGDIAL 2025: Second Call for Papers (**submission deadline is April 21**)
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-10 | (2025-08-25) CfP TSD-2025, Erlangen-Nürnberg, Germany************************************************************************** TSD 2025 - SECOND CALL FOR PAPERS ************************************************************************** The twenty-eighth International Conference on Text, Speech and Dialogue (TSD 2025) Erlangen-Nuernberg, Germany, 25-28 August 2025 http://www.tsdconference.org/ TSD 2025 will be organized by the Faculty of Applied Sciences, University of West Bohemia, and the Faculty of Informatics, Masaryk University, in cooperation with Friedrich-Alexander-Universität Erlangen-Nürnberg. This event continues the tradition of the TSD series, which started in 1998 and has become a prime forum for interaction between researchers in computer processing of both spoken and written language from all over the world. IMPORTANT DATES * >>> 30 May 2025 <<< ..... Deadline for submission of contributions * 20 June 2025 ............ Notification of acceptance or rejection * 27 June 2025 ............ Deadline for submission of camera-ready papers * 25-28 August 2025 ....... Conference dates For the review, a full paper must be submitted by the above deadline. CONFERENCE FEES The Early Bird registration fees are as follows: Including Event Full Registration Student Registration Conference Fee EUR 360 EUR 280 w/o Conference Trip Full Conference Fee EUR 400 EUR 310 Please note that the Early Bird fee is available by June 27, 2025. After this date, regular registration fees will apply, which are higher. CONFIRMED GUEST SPEAKERS * Bernd Möbius (Phonetics, Prosody, Surprisal, Speech Modeling) * Heidi Christesen (Disordered Speech, ASR, Accessibility) * Others in negotiation... TSD SERIES Proceedings of TSD form a book published by Springer in their Lecture Notes in Artificial Intelligence (LNAI) series. The TSD proceedings are regularly indexed by major citation databases such as Thomson Reuters Conference Proceedings Citation Index, DBLP, SCOPUS, EI, INSPEC, or COMPENDEX. TOPICS Topics include (but are not limited to): * Speech Recognition (multilingual, continuous, emotional speech, new acoustic/language models) * Corpora and Language Resources (large language models, text and spoken corpora, lexicons, disambiguation) * Speech and Spoken Language Generation (multilingual speech synthesis, expressive speech) * Tagging, Classification, and Parsing (sentiment analysis, credibility analysis, summarization) * Semantic Processing (information extraction, retrieval, data mining, ontologies) * Applications of Text and Speech Processing (machine translation, question-answering, assistive tech) * Automatic Dialogue Systems (multilingual, self-learning, question-answering) * Multimodal Techniques (visual speech synthesis, emotion and personality modeling) PROGRAMME COMMITTEE Elmar Nöth, Friedrich-Alexander-Universität Erlangen-Nuernberg, Germany (General Chairman) Rodrigo Agerri, University of the Basque Country, Spain Vladimír Benko, Slovak Academy of Sciences, Slovakia Archna Bhatia, Carnegie Mellon University, United States Jan Černocký, Brno University of Technology, Czechia Simon Dobrišek, University of Ljubljana, Slovenia Kamil Ekštein, University of West Bohemia, Czechia Karina Evgrafova, Saint-Petersburg State University, Russia Yevhen Fedorov, Cherkasy State Technological University, Ukraine Volker Fischer, EML Speech Technology GmbH, Germany Darja Fišer, Institute of Contemporary History, Slovenia Lucie Flek, Philipps-Universität Marburg, Germany Björn Gambäck, Norwegian University of Science and Technology, Norway Radovan Garabík, Slovak Academy of Sciences, Slovakia Alexander Gelbukh, Instituto Politécnico Nacional, Mexico Louise Guthrie, University of Texas at El Paso, United States Jan Hajič, Charles University, Czechia Eva Hajičová, Charles University, Czechia Yannis Haralambous, IMT Atlantique, France Hynek Hermansky, Johns Hopkins University, United States Daniel Hládek, Technical University of Košice, Slovakia Aleš Horák, Masaryk University, Czechia Eduard Hovy, Carnegie Mellon University, United States Maria Khokhlova, Saint Petersburg State University, Russia Aidar Khusainov, Tatarstan Academy of Sciences, Russia Daniil Kocharov, Saint Petersburg State University, Russia Miloslav Konopík, University of West Bohemia, Czechia Valia Kordoni, Humboldt University of Berlin, Germany Evgeny Kotelnikov, Vyatka State University, Russia Pavel Král, University of West Bohemia, Czechia Siegfried Kunzmann, Amazon Alexa Machine Learning, United States Nikola Ljubešić, Jožef Stefan Institute, Croatia Oier Lopez de Lacalle, Universtity of the Basque Country, Spain Natalija Loukachevitch, Lomonosov Moscow State University, Russia Bernardo Magnini, Fondazione Bruno Kessler, Italy David Mareček, Charles University, Czechia Václav Matoušek, University of West Bohemia, Czechia Roman Mouček, University of West Bohemia, Czechia Daša Munková, Constantine the Philosopher University in Nitra, Slovakia Agnieszka Mykowiecka, Polish Academy of Sciences, Poland Hermann Ney, RWTH Aachen University, Germany Joakim Nivre, Uppsala University, Sweden Juan Rafael Orozco-Arroyave, University of Antioquia, Colombia Maciej Piasecki, Wroclaw University of Science and Technology, Poland Josef Psutka, University of West Bohemia, Czechia James Pustejovsky, Brandeis University, United States German Rigau, University of the Basque Country, Spain Paolo Rosso, Universitat Politecnica de Valencia, Spain Leon Rothkrantz, Delft University of Technology, Netherlands Anna Rumshisky, University of Massachusetts Lowell, United States Milan Rusko, Slovak Academy of Sciences, Slovakia Pavel Rychlý, Masaryk University, Czechia Mykola Sazhok, International Research and Training Center for Information Technologies and Systems, Ukraine Pavel Skrelin, Saint Petersburg State University, Russia Pavel Smrž, Brno University of Technology, Czechia Petr Sojka, Masaryk University, Czechia Ján Staš, Technical University of Košice, Slovakia Georg Stemmer, Intel Corp., Germany Marko Robnik Šikonja, University of Ljubljana, Slovenia Marko Tadić, University of Zagreb, Croatia Jan Trmal, Johns Hopkins University, Czechia Tamas Varadi, Hungarian Academy of Sciences, Hungary Zygmunt Vetulani, Adam Mickiewicz University, Poland Aleksander Wawer, Polish Academy of Sciences, Poland Pascal Wiggers, Amsterdam University of Applied Sciences, Netherlands Alina Wróblewska, Polish Academy of Sciences, Poland Jerneja Žganec Gros, Alpineon, Slovenia FORMAT OF THE CONFERENCE The conference program will include: * Invited Papers * Oral Presentations * Poster/Demonstration Sessions Papers will be presented in topic-oriented sessions. The official language of TSD 2025 is English. However, papers dealing with text and speech processing in linguistic environments other than English are strongly encouraged (as long as they are written in English). The conference is planned as an on-site event. The conference will offer a rich social programme. SUBMISSION OF PAPERS Authors are invited to submit full papers of up to 12 pages (including references) in the LNCS format. Authors are also encouraged to present practical demonstrations of software, projects, or interesting material relevant to the conference topics. Demonstration abstracts of up to one page will not appear in the proceedings. Papers must not be under review by any other conference or publication during the TSD review cycle, and they must not be previously published or accepted for publication elsewhere. VENUE Erlangen-Nuernberg is home to Friedrich-Alexander-Universitat Erlangen-Nuernberg. The city offers a rich cultural heritage and convenient transport links within Germany and throughout Europe. CONTACT INFORMATION All correspondence related to the conference, including paper submissions and general inquiries, should be directed to: TSD 2025 Organizing Committee E-mail: tsd2025@tsdconference.org We look forward to your submissions and to seeing you in Erlangen-Nürnberg for TSD 2025
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-11 | (2025-08-25) SIGDIAL 2025 Conference, Avignon, France *** Call for Participation ***
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-12 | (2025-08-27) Blizzard Challenge, Leeuwarden, The Netherlands (updated) We have now released updated version of the Bildts dataset and the lexicon. Due to unforeseen circumstances, the release of the lexicon was delayed, so we have updated the schedule of the challenge which is now[1].
The lexicon is now available at <https://www.cstr.ed.ac.uk/projects/blizzard/2025/> and the dataset is available on zenodo: <https://zenodo.org/records/14995211> Finally, as the challenge moves to its next stage, all discussion will now happen on the blizzard discuss group (<https://groups.google.com/g/blizzard-challenge-discuss/> ). Kind regards, Sébastien Le Maguer for the Blizzard Challenge organising committee [1] *Mar 21 2025 – team registration closes* Apr 04 2025 – test sentences released to participants *Apr 11 2025 – deadline for participants to submit synthetic speech (23:59 AoE)* Apr 14 2025 – last date for payment of the entry fee (more details will be sent soon) Apr 30 2025 – evaluation systems go live Jun 20 2025 – end of the evaluation period Jun 27 2025 – release of results ***************************************************************
We are delighted to announce the 2025 edition of the Blizzard Challenge, which will be co-located with the Speech Synthesis Workshop 2025 (SSW13).
The theme of this edition of SSW is 'Scaling down: sustainable synthesis for language diversity'. In line with this theme, the challenge focuses on synthesizing speech for Bildts, a unique language variety from the Netherlands.
The information about the challenge is available (and will be updated) here: https://blogs.helsinki.fi/ssw13-2025/the-blizzard-challenge-2025/
About Bildts
Bildts (Indo-European > West Germanic) is spoken in Het Bildt, a region in the Dutch province of Friesland. With approximately 10,000 first and second language speakers, it represents a vibrant example of European linguistic diversity. The language variety has been systematically through grammatical descriptions, dictionaries with pronunciation information, literary works, media productions (weekly radio broadcasts, theater performances), and regular newspaper columns.
For the Blizzard Challenge 2025, we have curated a dataset of high-quality audio recordings with corresponding linguistic resources. This choice of Bildts aligns with our theme of 'sustainable synthesis for language diversity' - it presents participants with the real-world challenge of developing synthesis capabilities for a well-documented but data-limited language variety, representative of the thousands of smaller languages that could benefit from speech technology. The challenge is composed of two tasks:
Data download
The training data is available here: https://doi.org/10.5281/zenodo.14792457
A pronunciation dictionary for Bildts will be made available to participants in February 2025. This will include phonetic transcriptions that can be used for training text-to-speech systems. Participants will be notified when this resource becomes available.
Timeline
This is a preliminary timeline and is subject to change.
Any questions?
Please contact blizzard-challenge-organisers@googlegroups.com if you have any questions.
Please feel free to distribute this announcement.
The Blizzard Challenge 2025 Organising team
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-13 | (2025-08-28) Third Run of the Automatic Minuting Shared Task - AutoMin @ SIGDIAL 2025 Third Run of the Automatic Minuting Shared Task - AutoMin @ SIGDIAL 2025
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-14 | (2025-09-01) Workshop « Lenition and fortition in world's languages: new interdisciplinary insights », Lille, France
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-15 | (2025-09-01) L’école d’été ETAL 2025, Roscoff, France L’école d’été ETAL 2025 aura lieu du 1 au 5 septembre dans le magnifique cadre de Roscoff à la station biologique de Sorbonne Université. La liste des cours et intervenants est encore en cours, mais vous pourrez trouver un aperçu ci-dessous (cours en français et supports en anglais) :
Nous ouvrons les pré-inscriptions pour participer à l'école ETAL 2025. Le nombre de places est limité, une pré-inscription est mise en place via le lien suivant : https://framaforms.org/pre-inscription-ecole-dete-etal-2025-roscoff-1746001772 Plus d'informations sur le site web : https://etal2025.sciencesconf.org/ --
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-16 | (2025-09-06) Labs @CLEF2025, Madrid, Spain
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-17 | (2025-09-08) EUSIPCO 2025, Palermo, Italy
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-18 | (2025-09-10) 7th Workshop on the history of speech communication research (HSCR), Paris, France Ce message pour vous annoncer que la deadline pour le prochain workshop History of Speech Communication Research (Paris les 10-12 septembre) est prolongée jusqu'au 22 mai. Cf: https://iwhscr2025.sciencesconf.org/?lang=en
This is the last call for paper for the 7th Workshop on the history of speech communication research (HSCR), which will take place in Paris, France from 10th to 12th September 2025. A special focus of this edition will be on geographical and disciplinary exchanges within and beyond the field of speech research. Advances in our field have, from its beginning, been stimulated by intensive exchanges within the community through international exchanges between scholars and students, the development of international societies, congresses and journals, but also through fruitful transfers of knowledge, approaches, technology and instrumentation between disciplines. Contributions must not be longer than 10 pages and follow the guidelines provided. The papers will be published in the book series Studientexte zur Sprachkommunikation at TUDpress (Technical University Dresden) and also stored electronically in the ISCA archive. We are also pleased to announce a joined keynote by Jacques Durand (CLLE, Université Toulouse Jean-Jaurès) & Chantal Lyche (University of Oslo) on “Early transcriptions of English and French in the International Phonetic Association: the weight of spelling reform on the birth of the IPA” and a keynote by Pavel Šturm (Charles University in Prague) on “Transfer of knowledge across spaces and disciplines: The case of Czech phonetics” contact: hscr2025@gmail.com
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-19 | (2025-09-18) Approches phonétiques des langues sous-dotées, Aubervilliers, Paris, France (updated) Nous avons le plaisir de vous informer que la date limite de soumission des résumés pour la conférence Phonetic Approaches to Under-Documented Languages (avec un accent particulier sur les langues tibéto-birmanes), qui se tiendra les 18 et 19 septembre 2025 au Campus Condorcet à Aubervilliers (Paris), a été prolongée.
Approches phonétiques des langues sous-dotées (Avec un accent particulier sur les langues tibéto-birmanes) (Phon-UDL) 18–19 septembre 2025 | Campus Condorcet, Aubervilliers, Paris Organisé par le Centre de Recherches Linguistiques sur l’Asie Orientale (CRLAO, CNRS) et le Laboratoire de Phonétique et Phonologie (LPP, CNRS)
À travers ce colloque, nous souhaitons explorer le potentiel des méthodes phonétiques instrumentales pour renforcer la documentation des langues en danger, ainsi que réfléchir au rôle de la phonétique dans la classification des langues sous-dotées. Les langues sous-dotées — telles que de nombreuses langues tibéto-birmanes — offrent un potentiel considérable pour faire progresser notre compréhension de la typologie phonologique et des processus de changement phonétique. Toutefois, la pleine exploitation de ce potentiel suppose souvent l’intégration de la phonétique instrumentale et expérimentale aux méthodologies plus traditionnelles de la linguistique descriptive et documentaire. Des approches analytiques telles que l’analyse acoustique des phonèmes ou des systèmes tonals, les descriptions articulatoires, ou encore les études aérodynamiques constituent des outils précieux complémentaires. Ces méthodes permettent d’enrichir les descriptions synchroniques des langues sous-dotées, tout en jetant les bases pour l’identification de correspondances phonétiques entre variétés apparentées, et pour affiner les typologies phonologiques. L’association de la phonétique descriptive aux méthodes instrumentales modernes permet d’aboutir à des descriptions plus précises et approfondies des systèmes phonologiques. Ce colloque vise à examiner les apports de la phonétique instrumentale et expérimentale à l’étude et à la classification des langues sous-dotées, en invitant des communications autour des thématiques suivantes : - Le rôle de la phonétique descriptive dans la documentation et l’analyse des langues peu dotées en ressources ; - Les corrélations entre les aspects articulatoires et acoustiques dans les langues sous-dotées ; - L’interaction entre l’articulation et la perception dans la structuration des systèmes phonologiques ; - Applications de techniques instrumentales telles que l’électropalatographie, l’imagerie par ultrasons dans le travail de terrain en phonétique ; - La place des langues à tons et des systèmes prosodiques complexes dans les typologies phonologiques ; - Les études sur la qualité de la voix et la variation intra- et interlocuteurs dans la production des phonèmes ; - Les apports aux bases de données phonologiques visant à enrichir les typologies existantes.
Nous encourageons les contributions abordant une ou plusieurs de ces thématiques. Les propositions portant sur des langues tibéto-birmanes peu documentées sont particulièrement encouragées.
Modalités de soumission Les résumés ne doivent pas dépasser 1 000 mots, références, tableaux et figures compris. - Date limite de soumission des résumés : 1er juin 2025 - Notification des décisions : 10 juillet 2025 - Dates du colloque : 18–19 septembre 2025, à Paris
Les propositions de communication doivent être soumises via le site de la conférence : https://phon-udl.sciencesconf.org
Pour toute question ou difficulté, vous pouvez nous contacter à l’adresse suivante :
La conférence se déroulera en mode hybride, permettant aux participants de présenter leurs communications soit en présentiel, soit à distance. Nous sommes impatients de recevoir des contributions de collègues, qu’ils soient proches ou éloignés.
Nous vous remercions pour votre attention et nous réjouissons de découvrir vos contributions !
Le comité d’organisation
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-20 | (2025-09-24) First Workshop on Semantics for Interdisciplinary Research SIR@IXCS2025 - Düsseldorf - Germany (updated)=== Workshop SIR === First Workshop on Semantics for Interdisciplinary Research SIR@IXCS2025 - Düsseldorf - September 24 2025 ================================= https://team.inria.fr/semagramme/first-workshop-on-semantics-for-interdisciplinary-research/ https://openreview.net/group?id=inria.fr/INRIA/S%C3%A9magramme/2025/SIR01 ================================= In recent years, Natural Language Processing (NLP) has increasingly intersected with the humanities and social sciences, offering new methodologies for analyzing textual data, interpreting meaning, and modelling language-based phenomena. The potential for multi-disciplinary research using NLP methods is particularly great in computational semantics (CS), as its ability to process and represent meaning opens up innovative pathways for researchers in history, philosophy, literary studies, political science, etc. This workshop aims to explore how semantic models and tools can be leveraged to tackle traditional and emerging questions in the Humanities in a broader sense (Social Sciences, Law, Economics, Management, Literature, Languages, Art, …).
A major theme of SIR is the role of semantics in NLP applied to the humanities (both statistical and symbolic approaches).
=== Topics to Explore ===
• CS and the humanities: issues, tools and applications
• Quantitative and qualitative approaches as a breakthrough in the Humanities
• NLP transforming humanities issues
• Contributions and limitations for understanding meaning
• Links between formal semantics and neural models
• Ambiguity, polyphony and interpretation in the Humanities
• Ethics and bias in semantic modelling
• Interdisciplinary dialogue between AI, NLP and Humanities
=== Dates ===
• Deadline : July 21th (anywhere on earth) (previously July 14th)
• Notification : August 25th (anywhere on earth)
• Camera Ready : September 10th (anywhere on earth)
• Workshop : September 24th (anywhere on earth)
=== Submission Information ===
Papers should describe original research and must not exceed 4 pages (with an extra page in the camera ready version for
accepted papers). Papers should be submitted no later than 14 July 2025 (anywhere on earth). Accepted papers will be published in the conference proceedings in the ACL Anthology. For inclusion in the proceedings, at least one author must register to the conference and present the paper in person. Submissions should be fully anonymous to ensure double-blind reviewing. === Submission === https://openreview.net/group?id=inria.fr/INRIA/S%C3%A9magramme/2025/SIR01 === Style Files === The workshop follow the IWCS 2025 template see the workshop web page. === Organizers === Maxime Amblard, Université de Lorraine Ellen Breitholtz, Gothenburg University === Contact === maxime.amblard@univ-lorraine.fr and ellen.breitholtz@ling.gu.se
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-21 | (2025-09-?) 8th International Workshop on Multimedia Content Analysis in Sports, Dublin, Ireland Call for Papers ------------------- 8th International Workshop on Multimedia Content Analysis in Sports (MMSports'25) @ ACM Multimedia, Oct 27 – Oct 31, 2025, Dublin, Ireland
We'd like to invite you to submit your paper for the 8th International Workshop on Multimedia Content Analysis in Sports to be held in Dublin, Ireland together with ACM Multimedia 2025. The ambition of this workshop is to bring together researchers and practitioners from many different disciplines to share ideas and methods on current multimedia/multimodal content analysis research in sports. We welcome multimodal-based research contributions as well as best-practice contributions focusing on the following (and similar, but not limited to) topics: - annotation and indexing in sports - tracking people/ athlete and objects in sports - activity recognition, classification, and evaluation in sports - 3D scene and motion reconstruction in sports - event detection and indexing in sports - performance assessment in sports - injury analysis and prevention in sports - data driven analysis in sports - graphical augmentation and visualization in sports - automated training assistance in sports - camera pose and motion tracking in sports - brave new ideas / extraordinary multimodal solutions in sports - personal virtual (home) trainers/coaches in sports - datasets in sports - graphical effects in sports - alternative sensing in sports (beyond the visible spectrum) - multimodal perception in sports - exploiting physical knowledge in learning systems for sports - sports knowledge discovery - narrative generation and narrative analysis in sports - mobile sports application - multimedia in sports beyond video, including 3D data and sensor data
Submissions can be of varying length from 6 to 8 pages, plus up to two pages for the references. There is no distinction between long and short papers, but the authors may themselves decide on the appropriate length of their paper. All papers will undergo the same review process with the same review period.
Please refer to the workshop website for further information: http://mmsports.multimedia-computing.de/mmsports2025/index.html
IMPORTANT DATES Submission Due: 16 July 2025 Acceptance Notification: 24 July 2025 Camera Ready Submission: 26 August 2025 Workshop Date: TBA; either Oct 27th or Oct 28th, 2025
Challenges -------------- Once again, MMSports is running a competition that challenges participants to solve a cutting-edge problem applied to real-world sport-specific data. This year’s challenge is 'SoccerTrack Challenge 2025: Tracking and Identifying Soccer Players in Fixed Viewpoint Video“. It is a competition designed to advance the tracking and identification of soccer players in fixed viewpoint video footage. Participants will be provided with a dataset of match footage annotated with bounding boxes and player IDs for training. During the test phase and the final challenge phase, participants will be given unseen match footage in which they must perform player identification and tracking and submit their results. The ranking will be based on the performance of the tracking models on this unseen data. Also the 1st, 2nd and 3rd place will also be awarded with 150,000 JPY, 60,000 JPY and 30,000 JPY, respectively. More information on the challenges can be found at http://mmsports.multimedia-computing.de/mmsports2025/challenge.html.
ACM MMSports’25 Chairs: Thomas Moeslund, Rainer Lienhart and Hideo Saito
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-22 | (2025-10-08) CfP SPECOM 2025, Szeged, Hungary (updated) ******************************************************* SPECOM-2025 – SECOND CALL FOR PAPERS *******************************************************
27th International Conference on Speech and Computer (SPECOM-2025) October 13-15, 2025 Szeged, Hungary Web: https://specom.inf.u-szeged.hu
ORGANIZER SPECOM-2025 conference is organized by the University of Szeged.
CONFERENCE TOPICS SPECOM attracts researchers, linguists and engineers working in the following areas of speech science, speech technology, natural language processing, human-computer interaction:
OFFICIAL LANGUAGE The official language of the event is English. However, papers on processing languages other than English are strongly encouraged.
FORMAT OF THE CONFERENCE The conference program will include presentation of invited talks, oral sessions, and poster/demonstration sessions.
SUBMISSION OF FULL PAPERS Authors are invited to submit full papers of 10-15 pages formatted in the Springer LNCS style. Each paper will be reviewed by at least three independent reviewers (single-blind), and accepted papers will be presented either orally or as posters. Papers submitted to SPECOM must not be under review by any other conference or publication during the SPECOM review cycle, and must not be previously published or accepted for publication elsewhere. Authors should submit their papers using the on-line submission system: https://easychair.org/conferences/?conf=specom2025
DEADLINES June 30, 2025 ………. Submission of full papers (extended!) July 31, 2025 ........... Notification of acceptance/rejection August 10, 2025 ...... Camera-ready papers August 15, 2025 ...... Early registration
PROCEEDINGS SPECOM Proceedings will be published by Springer Nature as books in the Lecture Notes in Artificial Intelligence (LNCS / LNAI, Scopus Q2) series listed in all major international citation databases.
GENERAL CHAIRS Gábor Gosztolya – University of Szeged, Hungary Alexey Karpov – SPC RAS, St. Petersburg, Russia
CONTACTS All correspondence regarding the conference should be addressed to SPECOM-2025 Secretariat E-mail: specom@inf.u-szeged.hu Web: https://specom.inf.u-szeged.hu
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-23 | (2025-10-19) The 13th conference on Speech Technology and Human-Computer Dialogue, Cluj-Napoca, RomaniaSpeD 2025 – Welcome messageThe “SpeD 2025” Organizing Committee warmly invites you to attend the 13th Conference on Speech Technology and Human-Computer Dialogue, in Cluj-Napoca, Romania. The conference will be held in-person at the Technical University of Cluj-Napoca. The conference will bring together scientists, developers, and professionals to present their work, meet colleagues, discuss new ideas, and build collaboration between university, research center, and commercial sector research groups. The technical program will include oral sessions, keynotes by renowned speakers, and demonstrations of latest research on a wide range of topics positioned at the forefront of science and engineering in speech technology and human-computer dialogue. The past editions of the “SpeD” conference series were sponsored by IEEE and EURASIP (technical sponsors), the proceedings being indexed by the IEEE Xplore® Digital Library, Scopus, and the Web of Science Conference Proceedings Citation Index (the WoS indexing process has not been finalized for the previous 2023 edition). This year, papers accepted and presented during the conference will also be submitted for inclusion into IEEE Xplore, subject to meeting IEEE Xplore’s scope and quality requirements, and for indexing in Web of Science.
Joint eventThis year, the Language Data Space (LDS) Workshop organised by the Research Institute for Artificial Intelligence “Mihai Drăgănescu”, Romanian Academy, will be co-located with “SpeD”. Main Topics
Additional Topics in NLP and Multimodal Processing
Schedule (provisional)
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-24 | (2025-10-22) 21st International Conference on Content-based Multimedia Indexing, CBMI 2025, Dublin, Ireland ================================================================== 21st International Conference on Content-based Multimedia Indexing, CBMI 2025, Dublin, Ireland, 22-24 october 2025 ==================================================================
Important Dates (2nd submission round)Regular, Special Session Papers and Demonstration Papers
The second round also invites submissions rejected at ACM MM for a fast track, for details see https://www.cbmi2025.org/guidelines/overview/ Call for Regular PapersCBMI aims at bringing together the various communities involved in all aspects of content-based multimedia indexing for retrieval, browsing, management, visualization and analytics. The organisers of CBMI 2025 call for novel and original research papers that are addressing the various topics of interest related to the conference. We encourage contributions both on theoretical aspects and applications of CBMI in the new era of Artificial Intelligence and foundation/language-backed-backed models for multimedia for multimedia. Authors are invited to submit previously unpublished research papers highlighting significant contributions addressing these topics. Authors can submit full papers (6 pages + up to 2 pages for references) or short papers (4 pages + up to 2 pages references). Submissions to CBMI are peer reviewed in a double blind process and the language of the conference is English. For full details on the submission process see the submission guidelines. Authors of high-quality papers accepted to the conference may be invited to submit extended versions of their contributions to a special journal issue. Call for Special Session PapersThe organisers of CBMI 2025 call for novel and original research papers that are relevant for the following special sessions:
Please seehttps://www.cbmi2025.org/cfp/special-sessions/ for more details.
Call for DemonstrationsCBMI aims at bringing together the various communities involved in all aspects of content-based multimedia indexing for retrieval, browsing, management, visualization and analytics. We invite authors to report on novel and compelling demonstrations in all topic areas of CBMI. Demonstration papers are subject to peer review according to criteria such as novelty, interestingness, applications of or enhancements to state-of-the-art, and potential impact. The length of the papers should be up to 4 pages. An additional 1-2 pages should be appended to the paper that illustrate how the demo will be conducted on-site at CBMI 2025. This additional content will not be published in the conference proceedings, should the submission be accepted. Including a link to a video showing the demo in action is highly encouraged. The submissions are peer-reviewed in a single-blind process. For full details on the submission process see the submission guidelines. Presenters are expected to bring the necessary equipment (computers, etc.) themselves. The conference will provide a table, power outlet, screen, wireless (shared) internet and a poster board. If you have special needs (e.g., more space), please include a related note in your demo submission. Topics of InterestTopics of interest to the CBMI community include (but are not limited to) the following:
Multimedia Content Analysis and Indexing:
Multimedia User Experiences:
Applications of Multimedia Indexing and Retrieval:
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-25 | (2025-10-27) 8th International Workshop on Multimedia Content Analysis in Sports (MMSports'25) @ ACM Multimedia,, Dublin, Ireland Call for Papers
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-26 | (2025-10-27) First Workshop on Multimedia Analytics with Multimodal Large Language Models, Dublin, Ireland ====== MA-LLM Call for Papers ====== First Workshop on Multimedia Analytics with Multimodal Large Language Models Dates: 27/28 October 2025 Location: Dublin, Ireland ================================================================== The First Workshop on Multimedia Analytics with Multimodal Large Language Models at ACM Multimedia 2025 aims to explore the potential and pitfalls of bringing Multimodal Large Language Models into multimedia analytics, and the new forms of interaction between system and experts that emerge from this. To guide this exploration, we invite original research and position papers on (but not limited to) the following topics:
Important Dates:
Authors are invited to submit original full (up to 8 pages) or short (up to 4 pages) papers to be presented at the workshop upon acceptance. Papers rejected or withdrawn from ACM Multimedia 2025 can be resubmitted to this workshop via the Fast Track. Accepted papers will be published in the ACM-MM Workshops proceedings. You can submit your work via OpenReview. All listed authors must have an up-to-dateOpenReview profile. Note that creating a profile without an institutional email may require moderation (up to 2 weeks). Complete submission instructions are available on the website https://ma-llm25.github.io/.
Organizers:
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-27 | (2025-11-05) Rencontre des Jeunes Chercheurs en Parole 2025, Paris, France (extended deadline) ============================================================================
============================================================================
La 11ème édition des Rencontres Jeunes Chercheurs en Parole, organisée par le Groupe des Jeunes Chercheurs en Parole (JCeP) et marrainée par l'Association Francophone de la Communication Parlée (AFCP), se tiendra à Paris, du 5 au 7 novembre 2025. Ces journées s’adressent prioritairement aux jeunes chercheur·se·s travaillant dans les domaines de la parole, offrant un cadre convivial pour présenter leurs travaux, échanger avec la communauté scientifique, et élargir leurs perspectives de recherche. L’évènement est également ouvert, dans la limite des places disponibles, à toute personne intéressée par la recherche sur la parole.
Le programme comprendra des sessions posters, des conférences, des ateliers de formation, ainsi que des visites de plateformes expérimentales dédiées à l’étude de la parole. Une table ronde sur la communication scientifique en tant qu’expert, explorant l’apport de l’expertise en parole à la société, sera également au programme.
Des informations complémentaires seront mises en ligne sur notre site prochainement (à découvrir ici). Restez connectés !
POURQUOI PROPOSER UN POSTER ?
Les RJCP s’adressent aux jeunes chercheur·se·s désireux·ses de partager leurs travaux dans un cadre bienveillant et stimulant. Y présenter un poster est l’opportunité de valoriser vos recherches, quel que soit leur degré d’avancement, d’échanger avec vos pairs et des experts du domaine et de construire un réseau scientifique essentiel pour la suite de votre parcours académique ou professionnel.
THÉMATIQUES
Nous invitons les communications sur les thématiques suivantes (liste non exhaustive):
* Acoustique de la parole
* Acquisition de la parole et du langage
* Analyse, codage et compression de la parole
* Applications à composantes orales (dialogue, indexation, etc)
* Apprentissage d’une langue étrangère
* Communication multimodale
* Dialectologie
* Évaluation, corpus et ressources
* Langues peu dotées
* Modèles de langage
* Parole audio-visuelle
* Pathologies de la parole
* Phonétique et phonologie
* Phonétique clinique
* Production / Perception de la parole
* Prosodie
* Psycholinguistique
* Reconnaissance et compréhension de la parole
* Reconnaissance de la langue
* Reconnaissance du locuteur
* Signaux sociaux, sociophonétique
* Synthèse de la parole
MODALITÉS DE SOUMISSION
Masterant·e·s, doctorant·e·s, post-doctorant·e·s†, industriel·le·s† et jeunes chercheur·se·s en recherche d'emploi† sont invité·e·s à soumettre un résumé de 300 mots maximum présentant leurs travaux à venir, en cours ou récemment achevés, pour les sessions posters.
Les soumissions se font en ligne sur le site des RJCP : https://rjcp2025.sciencesconf.org/submission/submit
Le nombre de pages de références n’est pas restreint.
Les modèles des résumés (LaTeX, Word et LibreOffice) sont disponibles dans la section “Appel à communications” de notre site.
Les résumés doivent être soumis au format PDF.
† Jusqu’à 3 ans après la thèse
DATES IMPORTANTES
INFORMATIONS PRATIQUES
Le Comité d’organisation RJCP pourra prendre en charge l’impression des posters pour les personnes non affiliées à un laboratoire de recherche.
Cette prise en charge se fera au cas par cas. Pour en faire la demande, veuillez nous contacter à jcparole@gmail.com.
Nous espérons vous voir nombreux·es,
Nous remercions nos partenaires actuels :
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-28 | (2025-11-06) The First VoicePrivacy Attacker Challenge, ICASSP 2025, Hyderabad, India The First VoicePrivacy Attacker Challenge
**********************
Dear colleagues,
Registration for The First VoicePrivacy Attacker Challenge is now open!
The First VoicePrivacy Attacker Challenge is supported by ICASSP 2025 as the SP Grand Challenge (https://2025.ieeeicassp.org/sp-grand-challenges/#gc7). It focuses on developing attacker systems against voice anonymization, which will be evaluated against a set of anonymization systems submitted to the VoicePrivacy 2024 Challenge. Training, development, and evaluation datasets are provided along with a baseline attacker system. Participants shall develop their attacker systems in the form of automatic speaker verification systems and submit their scores on the development and evaluation data to the organizers. To do so, they can use any additional training data and models, provided that they are openly available and declared before the specified deadline. The metric for evaluation is equal error rate (EER). Results will be presented at the ICASSP 2025 special session to which 5 selected top-ranked participants will be invited to submit and present their challenge systems.
Please find more information in The First VoicePrivacy Attacker Challenge Evaluation Plan: https://www.voiceprivacychallenge.org/attacker/docs/Attacker_Challenge_Eval_Plan.pdf
Registration: https://t.co/pPEXxHEtP6
Contact: attacker.challenge@inria.fr
The VoicePrivacy Attacker Challenge Organizers: Xiaoxiao Miao - Singapore Institute of Technology, Singapore Natalia Tomashenko - Inria, France Emmanuel Vincent - Inria, France Junichi Yamagishi - NII, Japan
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-29 | (2025-11-25) The 3rd International Conference on Foundation and Large Language Models (FLLM2025), Vienna, Austria The 3rd International Conference on Foundation and Large Language Models (FLLM2025)
https://fllm-conference.org/2025/
25-28 November, 2025 | Vienna, Austria
Hybrid Conference and Technically Co-Sponsored by IEEE Austrian SectionFLLM 2025 CFP: With the emergence of foundation models (FMs) and Large Language Models (LLMs) that are trained on large amounts of data at scale and adaptable to a wide range of downstream applications, Artificial intelligence is experiencing a paradigm revolution. BERT, T5, ChatGPT, GPT-4, Falcon 180B, Codex, DALL-E, Whisper, and CLIP are now the foundation for new applications ranging from computer vision to protein sequence study and from speech recognition to coding. Earlier models had a reputation of starting from scratch with each new challenge. The capacity to experiment with, examine, and comprehend the capabilities and potentials of next-generation FMs is critical to undertaking this research and guiding its path. Nevertheless, these models are currently inaccessible as the resources required to train these models are highly concentrated in industry, and even the assets (data, code) required to replicate their training are frequently not released due to their demand in the real-time industry. At the moment, mostly large tech companies such as OpenAI, Google, Facebook, and Baidu can afford to construct FMs and LLMS. Despite the expected widely publicized use of FMs and LLMS, we still lack a comprehensive knowledge of how they operate, why they underperform, and what they are even capable of because of their emerging global qualities. To deal with these problems, we believe that much critical research on FMs and LLMS would necessitate extensive multidisciplinary collaboration, given their essentially social and technical structure. The International Conference on Foundation and Large Language Models (FLLM) addresses the architectures, applications, challenges, approaches, and future directions. We invite the submission of original papers on all topics with special interest in but not limited to:
Call for Workshop Papers:
Journal Special Issue: Selected high quality papers will be invited for special issue submission at the Information Processing & Management (impact factor : 6.9) https://www.sciencedirect.com/journal/information-processing-and-management
Submissions Guidelines and Proceedings Manuscripts should be prepared in 10-point font using the IEEE 8.5' x 11' two-column format. All papers should be in PDF format, and submitted electronically at Paper Submission Link. A full paper can be up to 8 pages (including all figures, tables and references). Extra pages (up to 4 pages) can be purchased for a fee. Submitted papers must present original unpublished research that is not currently under review for any other conference or journal. Papers not following these guidelines may be rejected without review. Also submissions received after the due date, exceeding length limit, or not appropriately structured may also not be considered. Authors may contact the Program Chair for further information or clarification. All submissions are peer-reviewed by at least three reviewers. Accepted papers will appear in the FLLM Proceeding, and be published by the IEEE Computer Society Conference Publishing Services and be submitted to IEEE Xplore for inclusion. Submitted papers must include original work, and must not be under consideration for another conference or journal. Authors of accepted papers are expected to present their work at the conference. Submitted papers that are deemed of good quality but that could not be accepted as regular papers will be accepted as short papers. Important Dates:
Contact: Please send any inquiry on FLLM to: info@fllm-conference.org
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-30 | (2025-12-06) CfP ASRU 2025, Honolulu, HI, USA
Call For PapersA call for papers invites researchers to submit their work to ASRU 2025 for presentation or publication, outlining topics, and deadlines. The ASRU Workshop is a flagship event of IEEE Speech and Language Processing Technical Committee. It has a tradition of bringing together researchers from academia and industry to discuss problems in automatic speech recognition and understanding. If you have any questions, please contact the ASRU 2025 TPC Chairs at tpc@ieeeasru.org. TopicsTopics of interest include, but not limited to:
Mark Your Calendars
Important Dates
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-31 | (2025-12-12) Journée d'étude conjointe (TLH, AFIA,ATALA) sur les technologies linguistiques pour les langues peu dotées, Paris, France Le collège TLH (Traitement du Langage Humain) de l'Association
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-32 | (2026-05-11) LREC 2026, Palma, Mallorca, Spain FIRST CALL FOR PAPERS
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-33 | (2026-05-21) CfP International conference “Prosody at the crossroads of disciplinary pathways”, Université Grenoble Alpes, France (Saint-Martin-d’Hères campus,IMAG building) Call for Papers
International conference “Prosody at the crossroads of disciplinary pathways” Thursday 21 and Friday 22 May, 2026, Université Grenoble Alpes, France (Saint-Martin-d’Hères campus,IMAG building) Over the past decades, the enthusiasm generated by prosodic studies has spread well outside the boundaries of the traditional subfields of linguistics (phonetics, phonology, syntax, semantics and pragmatics) and reached the related disciplines of psycholinguistics, neurolinguistics, clinical and forensic practice as well as language processing (cf. Di Cristo, 2004). Whereas prosody has traditionally been used as an umbrella term covering the interconnected phenomena of stress, rhythm and intonation (whose phonetic expressions mainly involve changes in fundamental frequency, intensity and duration and their perceptual correlates [Lehiste, 1970; Arvaniti, 2020]), we also aim to include voice quality in the program alongside the established categories of stress and intonation, to encourage discussion on the nature of prosodic features. Intonationis a supralexical phenomenon consisting in variations in fundamental frequency and their perceptual correlates observed at sentence or constituent level (Ladd, 1996; Cruttenden, 1997). It canfulfilallthreeessentialfunctionsofspeech:linguistic,paralinguisticandextralinguistic (Abercrombie, 1967). Therhythmof languages or varieties of languages relates to the hierarchical organisation of variably salient speech units in the temporal development of the production of the speech chain (Dellwo, 2003). The nature of these units varies depending on languages or language varieties. Along with Laver (1968, 1980, 1994), Sharpe (1970) and Mackenzie-Beck (2005), we consider thatvoice quality– which also conveys linguistic, paralinguistic and extralinguistic information – does not only result from speakers’ biologically-derived differences in vocal apparatus, but also from articulatory(orsupralaryngeal) as well asphonatory(orlaryngeal) settings. Articulatory settingconsists in the overall positioning of the articulatory organs. Wilson (2006) defines it as the “underlying or default posture of the articulators (i.e., the tongue, jaw, and lips)”, whereas Honikman (1964) considers that it consists in “the gross oral posture and mechanics [requisite as a framework for the integrating of the isolated sounds into that whole] which constitutes the pronunciation of a language”. This also applies to every idiolect. Phonatory settingsconsist in the way the vocal folds are made to vibrate. Stuart-Smith (2004) describes them as “glottal configurations” or “stricture types”; that is, the potential combination of specific types of tensions that can be brought to bear on the vocal folds. Laver (1994) established a typology of the various phonatory settings that can be achieved through these means. Prosody is traditionally defined as a set of elements whose function is superimposed upon that of the intrinsic features of segments (Lehiste, 1970). However, the growing literature establishing that children’s acquisition of prosodic structure far predates that of discrete units like phonemes and words (cf. e.g., Davis et al., 2000; Polzehl et al., 2024) suggests that it is preferable to conceive of it as an underlying matrix into which the segments are embedded. The main aim of this conference is to promote and enhance collaborations between researchers from different subdomains in order to review and discuss the applications of prosodic research to such fields as language acquisition, foreign language teaching, forensic and clinical phonetics, voice recognition, speech synthesis and sociolinguistics. Despite the existence of a growing literature on the subject, voice quality remains by far the most under-investigated of the elements listed above, especially as it has not conventionally been recognised as a component of prosody. Hence our desire to lay particular emphasis on its structural makeup and REFERENCES
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-34 | (2026-05-26) Speech Prosody 2026, the 13th International Conference on Speech Prosody, University of Pennsylvania, Philadelphia, PA, USA We are pleased to invite submissions for Speech Prosody 2026, the 13th International Conference on Speech Prosody, to be held May 26–29, 2026, at the University of Pennsylvania in Philadelphia. https://www.speechprosody2026.org Speech Prosody is the largest international forum dedicated to prosody in all its forms. This year’s theme, Prosodic Encodings in Context: From Structure and Cognition to Technology, and Back, highlights our commitment to advancing interdisciplinary inquiry into the role of prosody in human communication. The conference aims to bring together scholars from linguistics, cognitive science, neuroscience, speech technology, and related fields to explore emerging insights and challenges. It will foster collaboration across disciplines and inspire new research at the intersection of structure, cognition, and technology. To ensure inclusive participation in a time of uncertainty, Speech Prosody 2026 will be held in a hybrid format. We warmly encourage participants to attend in person, but will also provide meaningful virtual participation options for those facing travel restrictions. Our goal is to bring the field together despite these challenges and uphold the spirit of community that defines Speech Prosody. We welcome submissions on any aspect of prosody in spoken or signed languages. Submissions that reflect the conference theme or adopt integrative approaches are especially encouraged. We also welcome work by junior researchers, on understudied languages, or using interdisciplinary methods. Topics of interest include, but are not limited to: Important dates: For full details, please visit the official website: https://www.speechprosody2026.org We look forward to your contributions! Submissions for special sessions, workshops, tutorials and show&tells are open!
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-35 | (2026-07-06) 10e CONGRÈS MONDIAL DE LINGUISTIQUE FRANÇAISE (CMLF), Arras, France
| |||||||||||||||||||||||||||||||||||||||||||||||||||
3-3-36 | (2029-04-22) 2029 IEEE International Conference on Acoustics, Speech, and Signal Processing, Copenhague, Denmark.
|