ISCA Services

ISCA - International Speech
Communication Association

ISCApad Archive » 2021 » ISCApad #274 » Events » Other Events » (2021-06-23) Workshop 'From speech technology to big data phonetics and phonology: a win-win paradigm' @ PaPE 2021, Barcelona, Spain

ISCApad #274

Sunday, April 11, 2021 by Chris Wellekens

3-3-17 (2021-06-23) Workshop 'From speech technology to big data phonetics and phonology: a win-win paradigm' @ PaPE 2021, Barcelona, Spain

The workshop titled 'From speech technology to big data phonetics and phonology: a win-win paradigm' will be held on June 23, 2021, during the international conference Phonetics And Phonology In Europe (PaPE 2021). We are pleased to invite you to submit abstracts on related topics to the workshop. The workshop will take place virtually or in personin Barcelona, Spain (depending on the evolution of the current sanitary situation).

Important dates:

Abstract deadline: 1 February 2021?

Notification of acceptance: 28 February 2021?

Workshop: 23 June 2021, 14:00 ? 17:00 (Barcelona time)

Detailed information on the workshop can be find below or at https://pape2021.upf.edu/session/creativity-and-variability-prosody-and-information-2-2-2/.

We look forward to receiving your abstracts!

Best wishes,

Yaru Wu, on behalf of the organisers

===========================================================================

From speech technology to big data phonetics and phonology: a win-win paradigm

Organizers:

Martine Adda-Decker (CNRS LPP, Université Sorbonne Nouvelle, France)

Ioana Chitoran (Université de Paris, France)

Adèle Jatteau (Université de Lille, France)

Mathilde Hutin (CNRS LIMSI, Université Paris-Saclay, France)

Lori Lamel (CNRS LIMSI, Université Paris-Saclay, France)

Mark Liberman (University of Pennsylvania, USA)

Peggy Renwick (University of Georgia, Athens, USA)

Barbara Schuppler (Graz University of Technology, Austria)

Laura Spinu (Kingsborough Community College, CUNY, USA)

Ioana Vasilescu (CNRS LIMSI, Université Paris-Saclay, France)

Yaru Wu (CNRS LIMSI, Université Paris-Saclay, France; CNRS LPP ? Sorbonne Nouvelle, France)

Summary description / Motivation

?During the last decade, the term ?big data? has become a major keyword in numerous areas of social sciences and humanities, which are increasingly concerned with the need for digital processing of an ever-growing influx of data. Among these areas, phonetics and laboratory phonology are at the forefront, as substantial benefit can be expected from the study of larger and richer data collections, supported by faster, partially automated processing.

The current scientific and technological constellation holds promise for a virtuous circle of shared interests in large corpus-based and statistically supported modeling of phonetic variation opening avenues for both linguists and technology stakeholders. Indeed, a new research field, ?big data phonetics?, is emerging that relies on corpora and approaches borrowed from speech technologies. In return, speech technologies may take advantage of statistically grounded observations in order to better disentangle the sources and the patterns of speech variation.

We propose a workshop dedicated to this exciting research direction combining methods, approaches and corpora from speech technology domains with phonetics and laboratory phonology studies.

Background and research questions?

Traditionally, research in phonetics and phonology is driven by specific hypotheses, which may entail requirements both on the speech data?s acoustic quality and their linguistic content and structure. Raw large-scale corpora typically include all kinds of noises adding to the highly variable nature of speech conditioned by many linguistic and extra-linguistic factors. When relying on such heterogeneous material, phonetics and laboratory phonology research needs to reconsider both the matter of addressing scientific hypotheses and the methods to process such data. One of the purposes of the workshop is to discuss access to such data and the various challenges of processing large-scale corpora for speech analysis by phoneticians and phonologists. A related question concerns the most efficient methods borrowed from speech technologies that can be ?diverted? for the needs of phonetic analysis.

The symmetrical speech technology-driven purpose of this workshop is to draw a state of the art of the speech variation challenges for speech technologies and to provide suggestions on how these technologies could benefit from phonetic and phonology-driven analyses. For example, Automatic Speech Recognition systems and related applications are known to degrade ungracefully when faced with unseen variation. Research aimed at improving lexical modeling for speech recognition and L2 pronunciation learning may benefit from large corpus-based phonetics and phonology research.

Several special sessions on similar topics have been dedicated to big data in phonetic research as part of phonetics and phonology scientific manifestations (see VLSP, UPenn in 2011, Special sessions at ICPhS 2015, ICPhS 2019 and LSRL 2019).

The workshop will not only promote the use of speech technologies as an aide for linguistic studies and provide insight on how to make use of recent developments, but also make research in phonetics and phonology visible to the speech technology community.

Topics and areas of interest?

We encourage submissions on any topics related to the list of questions listed below:

- How to analyze variation phenomena in continuous speech using large corpora??

- How to take advantage of large corpora for segmental and supra-segmental studies? What caveats??

- How to investigate ongoing phonological processes using large corpora?

?- How to capture sound change in the pool of large-scale corpora??- How to clean and structure annotation of raw speech data??

- How could expertise and research in phonetics and phonology take part in the advancement of speech technology (eg. improving pronunciation dictionaries)?

Submission information?

Abstract of the workshop follows the PaPE 2021 conference abstract guidelines . Please find the abstract template of the conference here. All presentations will be oral and follow the PaPE format.

Abstracts should be submitted through Easychair by 1 February 2021. Authors may submit one abstract as first author and up to three abstracts as a co-author.

Important dates:

Abstract deadline: 1 February 2021?

Notification of acceptance: 28 February 2021?

Workshop: 23 June 2021, 14:00 ? 17:00 (Barcelona time)

Website : https://pape2021.upf.edu/session/creativity-and-variability-prosody-and-information-2-2-2/

Back

Top

Organisation	Events	Membership	Help
> Board	> Interspeech	> Join - renew	> Sitemap
> Legal documents	> Workshops	> Membership directory	> Contact
> Logos			> FAQ
			> Privacy policy