ISCA - International Speech
Communication Association


ISCApad Archive  »  2022  »  ISCApad #288  »  Journals

ISCApad #288

Friday, June 10, 2022 by Chris Wellekens

7 Journals
7-1IEEE JSTSP Special issue on Self-Supervised Learning for Speech and Audio Processing

Call for Papers

IEEE Journal of Selected Topics in Signal Processing

Special Issue on

Self-Supervised Learning for Speech
and Audio Processing

[Deadline Extended: January 15, 2022]
 
There is a trend in the machine learning community to adopt self-supervised approaches to pre-train deep networks.
Self-supervised learning utilizes proxy supervised learning tasks, for example, distinguishing parts of the input signal
from distractors, or generating masked input segments conditioned on the unmasked ones, to leverage data from
unlabeled corpora. These approaches make it possible to use a tremendous amount of unlabeled data available on
the web to train large networks to extract high-level, informative, and compact features from raw inputs. Then for
downstream applications, a simple task-specific model is added on top of the self-supervised model. The self-supervised
model can serve as a feature extractor or be fine-tuned with the task-specific model.

Recently self-supervised approaches for speech and audio processing are gaining attention. This special issue will
bring the works on self-supervision for the field of speech and audio processing. Alongside research work on new
self-supervised methods, data, applications, and results, this special issue will call for novel work on understanding,
analyzing, and comparing different self-supervision approaches for speech and audio processing. We welcome
the submission of the work on, but not limited to, the following research directions: 

Topics of interest in this special issue include (but are not limited to):
  • New self-supervised proxy tasks for speech and audio processing.
  • New approaches to use self-supervised models in speech and audio processing tasks.
  • Compare the similarities and differences of self-supervised learning approaches.
  • Theoretical or empirical studies on understanding why self-supervision methods work.
  • Exploring the limits of self-supervised learning approaches for speech and audio processing, 
  • for example, universal pre-trained model for multiple downstream tasks, environment conditions, 
  • or languages.
  • Self-supervised learning approaches involving the interaction of speech and other modalities.
  • Comparison or integration of self-supervised learning methods and other semi-supervised and transfer 
  • learning methods.

Submission Guidelines

Prospective authors should visit the IEEE JSTSP webpages for information on paper submission.

Manuscripts should be submitted using the Manuscript Central system according to the schedule below. 

Manuscripts will be peer-reviewed according to the standard IEEE process.

Back  Top

7-2CfP The speaker's mask: a transdisciplinary interrogator of the complexity of speech, Langue(s) et Parole, Revista de filologia francesa i romanica

The speaker's mask: a transdisciplinary interrogator of the complexity of speech

Call for papers

At the time of this call, the current health crisis still requires the world population to wear face masks designed to protect each individual from droplets and aerosols received or projected during breathing, speaking or singing. Several publications attest to the effectiveness of approved masks in this protection, which varies according to the material of the equipment and its duration of use. However, the study of these devices is also a matter for the human sciences. In particular, the language sciences are not left out in the study of the effects of the mask on the wearer's ability to be heard (Giovanelli et al. 2021) and intelligible (Palmiero et al. 2016), regardless of the communication situation or speech style (Cohn et al. 2021). Speech acoustics (Magee et al. 2020), phonetics, discourse analysis (Onipede 2021), modeling, recognition (Kodali et al. 2021), and psycholinguistics can, for example, be convened by such research regarding a current universal everyday concern. Although several publications have recently appeared in this field, many ways remain to be explored for this transdisciplinary subject.

In this volume, we propose to pursue the reflection ay least through the following thematic reflections:

• Speech perception: How does hiding the lower part of the face alter the receiver’s perception of the message produced by the emitter, and how does the latter adapt to this communicational change?

• Spoken articulation: How does the articulatory discomfort experienced by the masked speaker alter the management of speech production? Does it depend on the segmental and prosodic composition of the speech, on the speaking style or on the communication situation, or even on the representation that the speaker makes of the discomfort caused by the mask to his/her interlocutor?

• Voice: What are the effects of wearing a mask on the produced, perceived and felt timbre of the spoken, declaimed or sung voice, in an ecological situation (artistic for example, or in a training context)?

• Modeling and recognition of speech: What modeling and recognition of a 'masked speech signal' are possible? • Transmission of speech: What are the issues in terms of acquisition and education related to wearing a face mask? (Early childhood, schooling, native and foreign languages, teaching, etc.)

• Discourse analysis: What are the discourses produced in terms of behavior and reaction, emotions and affects, feelings and proprioception, nonverbal communication and aesthetics as a result of wearing the mask? Does wearing the mask (and subsequently dropping it) have any impact (positive or negative) on the speaker's self-esteem and confidence in front of a group? • Clinical phonetics and linguistics: What links can exist between pathologies of the areas covered by the mask and its use? Can the use of the mask cause significant alterations? What are the adaptations to be made when wearing the mask with respect to voice, speech or communication pathologies? Is wearing a mask linked to an increase in vocal fatigue in speakers who have to practice a profession that requires oral expression in front of a large audience?

• Engineering: How can knowledge of the effects of the mask on spoken communication be useful to designers of new devices that are better suited to the communication situation and/or the particularities of the target speakers? In this respect, this volume aims to bring together current research on the issues and effects of wearing a mask as they can be studied from the point of view of the various components of the language sciences (linguistics, phonetics, psycholinguistics, clinical phonetics and linguistics, didactics, sociolinguistics) and their implementation in certain communication contexts (speech therapy, psychology, artistic disciplines, etc.). These reflections, in their theoretical aspect, will contribute to the systemic analyses of oral communication, by providing, for example, data to enlighten the mechanisms of compensation, reorganization of voice, speech or discourse to this multi-effect, external disrupter (Vaxelaire et al. 2007) of language and oral communication systems. The examples of themes proposed above aim, by their diversity, to have the reader perceive the width of the epistemological span targeted by this volume of contributions. Other topics may of course be considered, provided that the research questions involved do question the impact of wearing a mask on the functioning of language and thus contribute, in a transdisciplinary approach, to develop knowledge about the latter. From an application point of view, it is hoped that the contributions collected can contribute to optimizing strategies to overcome the disturbances resulting from wearing the mask. Coordinators Claire Pillot-Loiseau, Université Sorbonne Nouvelle, Paris, France Bernard Harmegnies, Université de Mons, Mons, Belgique Languages of publication: French, English

Author guidelines : https://revistes.uab.cat/languesparole/languesparole/languesparole/about/submissions

Timeline: May 30, 2022: deadline for submission of article proposals to be sent to: r.langues.parole@uab.cat, claire.pillot@sorbonne-nouvelle.fr, Bernard.HARMEGNIES@umons.ac.be

End of 2022: print and online publication of the issue Appel à contributions – Call for papers – Langue(s) & Parole, n°7, 2022 : le masque du locuteur 5

References

Cohn, M., Pycha, A., & Zellou, G. (2021). Intelligibility of face-masked speech depends on speaking style: Comparing casual, clear, and emotional speech. Cognition, 210, 1-5, https://doi.org/10.1016/j.cognition.2020.104570

Giovanelli, E., Valzolgher, C., Gessa, E., Todeschini, M., & Pavani, F. (2021). Unmasking the Difficulty of Listening to Talkers With Masks: lessons from the COVID-19 pandemic. i-Perception, 12(2), 1–11. https://doi.org/10.1177/2041669521998393

Kodali, R. K., & Dhanekula, R. (2021). Face Mask Detection Using Deep Learning. In 2021 International Conference on Computer Communication and Informatics (ICCCI) (pp. 1-5). IEEE.

Magee, M., Lewis, C., Noffs, G., Reece, H., Chan, J. C., Zaga, C. J., ... & Vogel, A. P. (2020). Effects of face masks on acoustic analysis and speech perception: Implications for peri-pandemic protocols. The Journal of the Acoustical Society of America, 148(6), 3562-3568.

Onipede, F. M. (2021). Nigerians' Reactions towards COVID-19 Pandemic Health Precautions: A Pragma-Semiotic Analysis, International Review of Social Sciences Research, Volume 1, Issue 1, pp. 1- 24.

Palmiero, A. J., Symons, D., Morgan III, J. W., & Shaffer, R. E. (2016). Speech intelligibility assessment of protective facemasks and air-purifying respirators. Journal of occupational and environmental hygiene, 13(12), 960-968.

Vaxelaire, B., Sock, R., Kleiber, G., Marsac, F. (2007). Perturbations et Réajustements. Langue et langage, Publications de l'Université Marc Bloch - Strasbourg 2.

Back  Top

7-3CfP TAL Journal (Open Access), Special issue on 'Cross/multimodal NLP'

Call for Papers

 

TAL Journal (Open Access), Special issue on 'Cross/multimodal NLP'

 

 

Submission deadline: 31 March 2022

Abstract submission deadline is removed. The only deadline is now the full paper submission deadline.

If possible, the authors are invited to let us know their intent to submit by email to tal-63-2@sciencesconf.org one week before the full paper submission deadline.


Website: https://tal-63-2.sciencesconf.org


Natural language is not limited to the written modality. It includes and interacts with many others. On the one hand, a message can be conveyed through other language modalities, including audio (speech), gestures and facial expressions (sign language or completed speech). It may also be accompanied by social attitudes and non-verbal dimensions, including signs of affect, spontaneity, pathology, co-adaptation with dialogue participants, etc. Natural language processing (NLP) is thus a joint processing of multiple information channels. On the other hand, natural language is often used to describe concepts and denote entities that are essentially multimodal (description of an image, an event, etc.). Many problems then require bridges between different modalities.


The objective of this special issue of the journal TAL is to promote NLP in multimodal contexts (several modalities contribute to the resolution of a problem) or inter-modal contexts (passage from one modality to another). Thus, the contributions expected for this special issue are, among others (but not exclusively), in the following fields of application:

  • multimodal dialogue, multimodal question-answering ;
  • sign language, completed spoken language ;
  • speech processing, automatic speech recognition, speech synthesis in multimodal contexts;
  • synthesis of animated emotional agents;
  • handwriting recognition and analysis of handwritten documents;
  • understanding, translation and summarisation of multimodal documents;
  • indexing, search and mining of multimedia and/or multimodal documents;
  • biological signal processing, computational psychology or sociology, for NLP;
  • inter-/multimodal human-computer interface for NLP;
  • other multimodal or inter-modal applications (automatic image captioning, image-to-text generation, generation/analysis of songs and lyrics, etc.).
In the face of the predominance of work on written language processing and the historical compartmentalisation of communities specific to each modality (image processing, signal processing, neuroscience, etc.), authors are encouraged to highlight the specificities (benefits, difficulties, perspectives, etc.) linked to inter- or multimodality in their work, for instance concerning:
  • understanding the interactions between modalities;
  • harmonisation or compatibility of representations;
  • the development of joint models or transfer from one modality to another;
  • the constitution (or even annotation) of multimodal resources;
  • etc.

IMPORTANT DATES

  • Full paper submission deadline: 31 March 2022
  • Notification to authors, first review: 15 June 2022
  • Notification to authors, second review: 15 September 2022
  • Publication: November 2022

LANGUAGE

Manuscripts may be submitted in English or FrenchIf all authors are French speakers, they are requested to submit their contributions in French.



FORMAT

Papers must be between 20 and 25 pages, references and appendices included (no length exemptions are possible). Authors who intend to submit a paper are encouraged to click the menu item 'Paper submission' (PDF format).

 

To do so, they will need to have an account, or create it, on the sciencesconf platform (go to http://www.sciencesconf.org and click on 'create account' next to the 'Connect' button at the top of the page). To submit, come back to the page https://tal-63-2.sciencesconf.org/ , connect to the account and upload the submission.

 

From now on, TAL will perform double-blind review: it is thus necessary to anonymize the manuscript and the name of the pdf file.

 

Style sheets are available for download on the Web site of the journal (https://www.atala.org/content/instruction-authors-style-files-0)

 

 

ABOUT THE JOURNAL

Traitement Automatique des Langues (TAL, ?Natural Language Processing?) is an international journal published by ATALA (French Association for Natural Language Processing) since 1960 with the support of the CNRS (French National Centre for Scientific Research). It has moved to an electronic mode of publication, with printing on demand. This has no impact however on the reviewing and selection process.

 

EDITORIAL BOARD

Guest editors:

  • Gwénolé Lecorvé (Orange)
  • John D. Kelleher (TU Dublin)

Members (under development):

  • Loïc Barrault (U. Le Mans)
  • Marion Blondel (CNRS)
  • Chloé Clavel (Telecom Paris)
  • Camille Guinaudeau (U. Paris Saclay)
  • Hervé Le Borgne (CEA)
  • Damien Lolive (U. Rennes 1)
  • Slim Ouni (U. Lorraine)

 

 

 
Back  Top

7-4TAL Journal: Special issue: review articles

TAL Journal: Special issue: review articles

http://tal-63-3.sciencesconf.org/ (page soon available)


2022 Volume 63 Number 3

Deadline for submission: 01/07/2022

Editors: Cécile Fabre, Emmanuel Morin, Sophie Rosset and Pascale Sébillot


FOCUS OF THE ISSUE:

This special issue of the journal TAL invites articles that summarize the current state of knowledge in one of the fields of natural language processing.

The article will discuss research made in the chosen field and will present how it has evolved to the most recent advances. The synthesis must be rigorous, clear and accessible to readers of the journal TAL who are not specialists in the subject of the article. It should provide a perspective on the work presented, allowing the reader to understand the relation between the various lines of research.

The topics covered are those usually targeted by the varia issues of the journal, i.e. all aspects of natural processing of written, spoken and signed languages and of computational linguistics. The theme of the article may be identified within the following fields (non-exhaustive list):

- Computational models of language, statistical learning and modeling
- Lexical and terminological resources
- Linguistic tools (tokenization, tagging, parsing, etc.)
- Intermodality and multimodality
- Language multiplicity and diversity, multilingual processing, translation
- Semantics, discourse, pragmatics, comprehension
- Information access and text mining
- Text generation and synthesis
- Speech or sign language recognition/synthesis
- Dialogue
- Evaluation
- Explicability and reproducibility
- NLP in interaction with other disciplines (e.g. digital humanities)
- NLP and ethics

Note the date of intention to submit (calendar below): the authors will have to submit an abstract of the chosen theme. This phase is important for the selection of external reviewers specialized in the subject matter.


LANGUAGE


Articles are written in English or French. Submissions in English are
accepted only if one of the co-authors is not French speaking.


THE JOURNAL


TAL (Traitement Automatique des Langues / Natural Language Processing)
(http://www.atala.org/revuetal) is an international journal published
by ATALA (French Association for Natural Language Processing) since 1960, with the support of the national centre for scientific research (CNRS). It is published in electronic format, with immediate free access to published articles.


IMPORTANT DATES (PROVISIONAL)

First call: March 2022

Intent to submit (abstract presenting the theme): April 29, 2022

Deadline for submission: July 1, 2022

Notification to authors after first review: end of September 2022

Notification to authors after second review: end of Dec 2022

Publication: February 2023


SUBMISSION FORMAT


The length of the papers must be between 20 and 25 pages.


The TAL journal has a double-blind review process. It is necessary to anonymize the article, the name of the file, and to avoid
self-references.


Style sheets are available on the journal's website
(_https://www.atala.org/content/instruction-authors-style-files-0_).


Authors are invited to submit their paper by clicking on the menu 'Paper submission' (PDF
article' menu (PDF format). To do so, you will need to have an account on the sciencesconf platform (http://www.sciencesconf.org). Click on 'create account' next to the
'Connect' button at the top of the page. To submit, come back to the
page (soon available) http://tal-63-3.sciencesconf.org/, connect to you
account and upload your submission.

SCIENTIFIC COMMITTEE

Each article is evaluated by three reviewers, two external reviewers and a member of the editorial board of the journal TAL. The list of the members of the editorial board of the journal is available at http://www.atala.org/content/comit%C3%A9-de-r%C3%A9daction-0

Back  Top

7-5Appel à contributions: le masque du locuteur, revue Langue(s) et Parole, 2022

The speaker's mask: a transdisciplinary interrogator of the complexity of speech Call for papers At the time of this call, the current health crisis still requires the world population to wear face masks designed to protect each individual from droplets and aerosols received or projected during breathing, speaking or singing. Several publications attest to the effectiveness of approved masks in this protection, which varies according to the material of the equipment and its duration of use. However, the study of these devices is also a matter for the human sciences. In particular, the language sciences are not left out in the study of the effects of the mask on the wearer's ability to be heard (Giovanelli et al. 2021) and intelligible (Palmiero et al. 2016), regardless of the communication situation or speech style (Cohn et al. 2021). Speech acoustics (Magee et al. 2020), phonetics, discourse analysis (Onipede 2021), modeling, recognition (Kodali et al. 2021), and psycholinguistics can, for example, be convened by such research regarding a current universal everyday concern. Although several publications have recently appeared in this field, many ways remain to be explored for this transdisciplinary subject. In this volume, we propose to pursue the reflection ay least through the following thematic reflections:

• Speech perception: How does hiding the lower part of the face alter the receiver’s perception of the message produced by the emitter, and how does the latter adapt to this communicational change?

• Spoken articulation: How does the articulatory discomfort experienced by the masked speaker alter the management of speech production? Does it depend on the segmental and prosodic composition of the speech, on the speaking style or on the communication situation, or even on the representation that the speaker makes of the discomfort caused by the mask to his/her interlocutor?

• Voice: What are the effects of wearing a mask on the produced, perceived and felt timbre of the spoken, declaimed or sung voice, in an ecological situation (artistic for example, or in a training context)?

 Call for papers – Langue(s) & Parole, n°7, 2022 : le masque du locuteur 4

• Modeling and recognition of speech: What modeling and recognition of a 'masked speech signal' are possible? • Transmission of speech: What are the issues in terms of acquisition and education related to wearing a face mask? (Early childhood, schooling, native and foreign languages, teaching, etc.)

• Discourse analysis: What are the discourses produced in terms of behavior and reaction, emotions and affects, feelings and proprioception, nonverbal communication and aesthetics as a result of wearing the mask? Does wearing the mask (and subsequently dropping it) have any impact (positive or negative) on the speaker's self-esteem and confidence in front of a group?

• Clinical phonetics and linguistics: What links can exist between pathologies of the areas covered by the mask and its use? Can the use of the mask cause significant alterations? What are the adaptations to be made when wearing the mask with respect to voice, speech or communication pathologies? Is wearing a mask linked to an increase in vocal fatigue in speakers who have to practice a profession that requires oral expression in front of a large audience?

• Engineering: How can knowledge of the effects of the mask on spoken communication be useful to designers of new devices that are better suited to the communication situation and/or the particularities of the target speakers? In this respect, this volume aims to bring together current research on the issues and effects of wearing a mask as they can be studied from the point of view of the various components of the language sciences (linguistics, phonetics, psycholinguistics, clinical phonetics and linguistics, didactics, sociolinguistics) and their implementation in certain communication contexts (speech therapy, psychology, artistic disciplines, etc.). These reflections, in their theoretical aspect, will contribute to the systemic analyses of oral communication, by providing, for example, data to enlighten the mechanisms of compensation, reorganization of voice, speech or discourse to this multi-effect, external disrupter (Vaxelaire et al. 2007) of language and oral communication systems. The examples of themes proposed above aim, by their diversity, to have the reader perceive the width of the epistemological span targeted by this volume of contributions. Other topics may of course be considered, provided that the research questions involved do question the impact of wearing a mask on the functioning of language and thus contribute, in a transdisciplinary approach, to develop knowledge about the latter. From an application point of view, it is hoped that the contributions collected can contribute to optimizing strategies to overcome the disturbances resulting from wearing the mask.

Coordinators Claire Pillot-Loiseau, Université Sorbonne Nouvelle, Paris, France Bernard Harmegnies, Université de Mons, Mons, Belgique

Languages of publication: French, English

Author guidelines : https://revistes.uab.cat/languesparole/languesparole/languesparole/about/submissions

Timeline: May 30, 2022: deadline for submission of article proposals to be sent to: r.langues.parole@uab.cat, claire.pillot@sorbonne-nouvelle.fr, Bernard.HARMEGNIES@umons.ac.be

End of 2022: print and online publication of the issue Appel à contributions – Call for papers – Langue(s) & Parole, n°7, 2022 : le masque du locuteur 5

References Cohn, M., Pycha, A., & Zellou, G. (2021). Intelligibility of face-masked speech depends on speaking style: Comparing casual, clear, and emotional speech. Cognition, 210, 1-5, https://doi.org/10.1016/j.cognition.2020.104570 Giovanelli, E., Valzolgher, C., Gessa, E., Todeschini, M., & Pavani, F. (2021). Unmasking the Difficulty of Listening to Talkers With Masks: lessons from the COVID-19 pandemic. i-Perception, 12(2), 1–11. https://doi.org/10.1177/ 2041669521998393 Kodali, R. K., & Dhanekula, R. (2021). Face Mask Detection Using Deep Learning. In 2021 International Conference on Computer Communication and Informatics (ICCCI) (pp. 1-5). IEEE. Magee, M., Lewis, C., Noffs, G., Reece, H., Chan, J. C., Zaga, C. J., ... & Vogel, A. P. (2020). Effects of face masks on acoustic analysis and speech perception: Implications for peri-pandemic protocols. The Journal of the Acoustical Society of America, 148(6), 3562-3568. Onipede, F. M. (2021). Nigerians' Reactions towards COVID-19 Pandemic Health Precautions: A Pragma-Semiotic Analysis, International Review of Social Sciences Research, Volume 1, Issue 1, pp. 1- 24. Palmiero, A. J., Symons, D., Morgan III, J. W., & Shaffer, R. E. (2016). Speech intelligibility assessment of protective facemasks and air-purifying respirators. Journal of occupational and environmental hygiene, 13(12), 960-968. Vaxelaire, B., Sock, R., Kleiber, G., Marsac, F. (2007). Perturbations et Réajustements. Langue et langage, Publications de l'Université Marc Bloch - Strasbourg 2.

Back  Top



 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA