ISCA - International Speech
Communication Association


ISCApad Archive  »  2012  »  ISCApad #171  »  Resources  »  Database

ISCApad #171

Tuesday, September 04, 2012 by Chris Wellekens

5-2 Database
5-2-1ELRA - Language Resources Catalogue - Update (2012-07)

*****************************************************************
ELRA - Language Resources Catalogue - Update
*****************************************************************
ELRA is happy to announce that 2 new Speech     Telephone Resources are now available in its catalogue.
    Moreover, an updated version of the Bilingual Collocational     Dictionary (Horst Bogatz) has also been released.     
   
    1) New Language Resources:
     
      ELRA-S0343 VERIF1DE
   
The speech corpus VERIF1DE contains 20 recordings (sessions) of     150 German speakers each over the telephone network (10 sessions     over fixed network and 10 sessions over GSM). Each session contains  40 single recordings, mainly speech read from a prompt sheet.
  
For more information, see: http://catalog.elra.info/product_info.php?products_id=1169
   
    ELRA-S0344 LILA Hindi Belt database
   
The LILA Hindi Belt database comprises 2,023 Hindi speakers     (1,011 males and 1,012 females, all speakers with Hindi as first     language) recorded over the Indian mobile telephone network. Each  speaker uttered 83 read and spontaneous items.
   
For more information, see: http://catalog.elra.info/product_info.php?products_id=1170
   
    2) Updated Language Resource:
     
    ELRA-M0013 Bilingual Collocational Dictionary (Horst Bogatz)
   
This new release contains  69,000  English headwords (instead       of 40,000 for the previous release).
    The bilingual English-German collocational dictionary consists of     around 69,000 English headwords, including concepts expressed with     more than one word (e.g. 'the awareness of the environment' or 'lame     duck') and hyphenated compounds. It contains verbs, adjectives,     synonyms and phrases that collocate with the headword. It provides     the German equivalents for the headwords as well as their English     synonyms.
    For more information, see: http://catalog.elra.info/product_info.php?products_id=451
    
    For more information on the catalogue, please contact Valérie  Mapelli mailto:mapelli@elda.org
   
    Visit our On-line Catalogue: http://catalog.elra.info
    Visit the Universal Catalogue: http://universal.elra.info
    Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html

Top

5-2-2LDC Newsletter (August 2012)

 

In
            this newsletter:

           
      LDC and Google Collaboration Results             in New Syntactically-Annotated Language Resources  -
       
        The Future of Language Resources: LDC 20th             Anniversary Workshop  -
       
        -  Fall 2012 LDC Data             Scholarship Program  -
       
        -  Spotlight on HAVIC  -
       

      New

            publications:

           
      LDC2012T13
      English Web             Treebank  -
     
       LDC2012T14
      -  GALE Phase 2 Arabic             Broadcast Conversation Parallel Text Part 2   –
     
       LDC2012T12
      -  Spanish TimeBank 1.0  –
     

   
       
   



       

       
   

LDC and Google Collaboration Results in New             Syntactically-Annotated Language Resources
   

   
      Google Inc.  and the Linguistic Data         Consortium (LDC) have collaborated to develop new         syntactically-annotated language resources that enable computers         to better understand human language. The project, funded through a gift from         Google in 2010, has resulted in the development of the English Web
          Treebank LDC2012T13
containing over 250,000 words of         weblogs, newsgroups, email, reviews and question-answers         manually annotated for syntactic structure. This resource will         allow language technology researchers to develop and evaluate         the robustness of parsing methods in various new web domains. It         was used in the 2012 shared task on parsing English web text for         the First           Workshop on Syntactic Analysis of Non-Canonical Language           (SANCL) which took place at NAACL-HLT in Montreal on June         8, 2012. The English Web Treebank is available to the research         community through LDC’s
          Catalog
.
       
      Natural language processing (NLP) is a field of         computational linguistic research concerned with the         interactions between human language and computers. Parsing is a         discipline within NLP in which computers analyze text and         determine its syntactic structure. While syntactic parsing is         already practically useful, Google funded this effort to help         the research community develop better parsers for web text. The         web texts collected and annotated by LDC provide new, diverse         data for training parsing systems.
       
      Google chose LDC for         this work based on the Consortium’s experience in developing and         creating syntactic annotations, also known as treebanks.         Treebanks are critically important to parsing research since         they provide human-analyzed sentence structures that facilitate         training and testing scenarios in NLP research. This work         extends the existing relationship between LDC and Google.  LDC has published four         other Google-developed data sets in the past six years: English,         Chinese, Japanese and European language n-grams used principally         for language modeling.
     
 
       
   

       
       The
            Future of Language Resources: LDC 20th Anniversary Workshop            

             
   

    LDC’s 20th
        Anniversary Workshop is rapidly approaching! The event will take         place on the University of Pennsylvania’s campus on September         6-7, 2012.
             
      Workshop themes
        include: the developments in human language technologies and         associated resources that have brought us to our current state;         the language resources required by the technical approaches         taken and the impact of these resources on HLT progress; the         applications of HLT and resources to other disciplines including         law, medicine, economics, the political sciences and psychology;         the impact of HLTs and related technologies on linguistic         analysis and novel approaches in fields as widespread as         phonetics, semantics, language documentation, sociolinguistics         and dialect geography; and the impact of any of these         developments on the ways in which language resources are         created, shared and exploited and on the specific resources         required.
             
     
Please read more here.
       
     

   
     
   
   

Fall 2012

              LDC Data Scholarship Program

     
   

    Applications are
        now being accepted through September 17, 2012, 11:59PM EST for         the Fall 2012 LDC Data Scholarship program! The LDC Data         Scholarship program provides university students with access to         LDC data at no-cost. During previous program cycles, LDC has         awarded no-cost copies of LDC data to over 20 individual         students and student research groups.
       
        This program is open to students pursuing both undergraduate and         graduate studies in an accredited college or university. LDC         Data Scholarships are not restricted to any particular field of         study; however, students must demonstrate a well-developed         research agenda and a bona fide inability to pay. The selection         process is highly competitive.
       
        The application consists of two parts:
       
              (1) Data Use Proposal. Applicants must submit a proposal         describing their intended use of the data. The proposal should         state which data the student plans to use and how the data will         benefit their research project as well as information on the         proposed methodology or algorithm.
           
              Applicants should consult the LDC Corpus Catalog         for a complete list of data distributed by LDC. Due to certain         restrictions, a handful of LDC corpora are restricted to members         of the Consortium. Applicants are advised to select a maximum of         one to two datasets; students may apply for additional datasets         during the following cycle once they have completed processing         of the initial datasets and publish or present work in some         juried venue.
           
              (2) Letter of Support. Applicants must submit one letter         of support from their thesis adviser or department chair. The         letter must confirm that the department or university lacks the         funding to pay the full Non-member Fee for the data and verify         the student's need for data.
                   
        For further information on application materials and program         rules, please visit the LDC Data Scholarship         page.
       
        Students can email their applications to the LDC Data Scholarship program. Decisions will be         sent by email from the same address.
       
        The deadline for the Fall 2012 program cycle is September 17,         2012, 11:59PM EST.
       

   
               

Spotlight
            on HAVIC

     

         
      As part
        of our 20th anniversary celebration, the coming newsletters will         include features that provide an overview of the broad range of         LDC’s activities. To begin, we'll examine the Heterogeneous         Audio Visual Internet Collection (HAVIC), one of the many         projects handled by LDC’s Collection/Annotation Group led by         Senior Associate Director Stephanie Strassel.
       
      Under the
        supervision of Senior Research Coordinator Amanda
          Morris, the HAVIC team is developing a large corpus of           unconstrained multimedia data drawn from user-generated videos           on the web and annotated for a variety of features. The HAVIC           corpus has been designed with an eye toward providing           increased challenges for both acoustic and video processing           technologies, focusing on multi-dimensional variation inherent           in user-generated content. Over the past three years the           corpus has provided training, development and test data for           the NIST TRECVID Multimedia Event Detection (MED) Evaluation           Track, whose goal is to assemble core detection technologies           into a system that can search multimedia recordings for           user-defined events based on pre-computed             metadata.
       
      For each
        MED evaluation, LDC and NIST have collaborated to define many         new events, including things like “making a cake” or “assembling         a shelter”. Each event requires an Event Kit, consisting of a         textual description of the event’s properties along with a few         exemplar videos depicting the event. A large team of LDC data         scouts search for videos that contain each event, along with         videos that are only indirectly or superficially related to         defined events plus background videos that are unrelated to any         defined event. After finding suitable content, data scouts label         each video for a variety of features including the presence of         audio, visual or text evidence that a particular event has         occurred. This work is done using LDC’s AScout framework,         consisting of a browser plug-in, a database backend and         processing scripts that together permit data scouts to         efficiently search for videos, annotate the multimedia content,         and initiate download and post-processing of the data. Collected         data is converted to MPEG-4 format, with h.264 video encoding         and AAC audio encoding, and the original video resolution and         audio/video bitrates are retained.
       
      To date,
        LDC has collected and labeled well over 100,000 videos as part         of the HAVIC Project, and the corpus will ultimately comprise         thousands of hours of labeled data. Look for portions of the         corpus to appear among LDC’s future releases.
       
           

       
   

     
              New publications
       
     
     

         
      (1)English
          Web Treebank
was developed by the Linguistic Data         Consortium (LDC) with funding through a gift from Google Inc. It         consists of over 250,000 words of English weblogs, newsgroups,         email, reviews and question-answers manually annotated for         syntactic structure and is designed to allow language technology         researchers to develop and evaluate the robustness of parsing         methods in those web domains.
       
      This release
        contains 254,830 word-level tokens and 16,624 sentence-level         tokens of webtext in 1174 files annotated for sentence- and         word-level tokenization, part-of-speech, and syntactic         structure. The data is roughly evenly divided across five         genres: weblogs, newsgroups, email, reviews, and         question-answers. The files were manually annotated following         the sentence-level tokenization guidelines for web text and the         word-level tokenization guidelines developed for English         treebanks in the DARPA GALE project. Only text         from the subject line and message body of posts, articles,         messages and question-answers were collected and annotated.
       
      English Web
        Treebank is distributed via web download.
       
      2012 Subscription
        Members will receive two copies of this data on disc. 2012         Standard Members may request a copy as part of their 16 free         membership corpora. Non-members may license this data by         completing the LDC User Agreement for Non-members. The agreement can         be faxed to +1 215 573 2175 or scanned and emailed to this         address. The first fifty copies of this publication are being         made available at no charge. After the first fifty copies are         distributed, the non-member fee of US$175 applies.
       
           
   

   
   

*
     

   
      (2) GALE
          Phase 2 Arabic Broadcast Conversation Parallel Text Part 2
        was developed by LDC. Along with other corpora, the parallel         text in this release comprised training data for Phase 2 of the         DARPA GALE (Global Autonomous Language Exploitation) Program.         This corpus contains Modern Standard Arabic source text and         corresponding English translations selected from broadcast         conversation (BC) data collected by LDC between 2004 and 2007         and transcribed by LDC or under its direction.
       
      GALE Phase
        2 Arabic Broadcast Conversation Parallel Text Part 2 includes 29         source-translation document pairs, comprising 169,488 words of         Arabic source text and its English translation. Data is drawn         from eight distinct Arabic programs broadcast between 2004 and         2007 from Aljazeera, a regional broadcast programmer based in         Doha, Qatar; and Nile TV, an Egyptian broadcaster. The programs         in this release focus on current events topics.
       
      The files
        in this release were transcribed by LDC staff and/or         transcription vendors under contract to LDC in accordance with         the Quick Rich Transcription guidelines         developed by LDC. Transcribers indicated sentence boundaries in         addition to transcribing the text. Data was manually selected         for translation according to several criteria, including         linguistic features, transcription features and topic features.         The transcribed and segmented files were then reformatted into a         human-readable translation format and assigned to translation         vendors. Translators followed LDC's Arabic to English         translation guidelines. Bilingual LDC staff performed quality         control procedures in the completed translations.
       
      GALE Phase
        2 Arabic Broadcast Conversation Parallel Text Part 2 is         distributed via web download.
       
      2012 Subscription
        Members will receive two copies of this data on disc. 2012         Standard Members may request a copy as part of their 16 free         membership corpora. Non-members may license this data for         US$1750.
     

   
   

*
   

         
      (3)
        Spanish
          TimeBank 1.0
was developed by researchers at Barcelona Media and consists of         Spanish texts in the AnCora corpus annotated with         temporal and event information according to the TimeML specification language.
       
      Spanish TimeBank
        1.0 contains stand-off annotations for 210 documents with over         75,800 tokens (including punctuation marks) and 68,000 tokens         (excluding punctuation). The source documents are news stories         and fiction from the AnCora corpus.
       
      The AnCora
        corpus is the largest multilayer annotated corpus of Spanish and         Catalan. AnCora contains 400,000 words in Spanish and 275,000         words in Catalan. The AnCora documents are annotated on many         linguistic levels including structure, syntax, dependencies,         semantics and pragmatics. That information is not included in         this release, but it can be mapped to the present annotations.         The corpus is freely available from the Centre de Llenguatge i Computació (CLiC).
       
      Spanish TimeBank
        1.0 is distributed by web download.
       
      2012 Subscription
        Members will receive two copies of this data on disc. 2012         Standard Members may request a copy as part of their 16 free         membership corpora. Non-members may license this data by         completing the LDC User Agreement for Non-members
. The agreement         can be faxed to +1 215 573 2175 or scanned and emailed to this         address. The publication is being made available at no charge.
     
   


In this newsletter:

 

 

 

Top

5-2-3Speechocean January 2012 update

Speechocean - Language Resource Catalogue - New Released (01- 2012)

Speechocean, as a global provider of language resources and data services, has more than 200 large-scale databases available in 80+ languages and accents covering the fields of Text to Speech, Automatic Speech Recognition, Text, Machine Translation, Web Search, Videos, Images etc.

 

Speechocean is glad to announce that more Speech Resources has been released:

 

Chinese and English Mixing Speech Synthesis Database (Female)

The Chinese Mandarin TTS Speech Corpus contains the read speech of a native Chinese Female professional broadcaster recorded in a studio with high SNR (>35dB) over two channels (AKG C4000B microphone and Electroglottography (EGG) sensor). 
The Corpus includes the following categories:
1.    Basic Mandarin sub-corpus: including 5,000 utterances which were carefully designed considering all kinds of linguistic phenomena. All sentences were declarative and extracted from News channels of People's Daily, China Daily, etc. The prompts with negative words were carefully excluded. ONLY suitable length sentences were accepted (7~20 words, in average 14 words). This sub-corpus can be used for R&D of HMM-based TTS, Limit domain TTS and Small-scale concatenative TTS;
2.    Complementary Mandarin sub-corpus: including 10,000 utterances which were carefully designed considering all kinds of linguistic phenomena. All sentences were declarative and extracted from News channels of People's Daily, China Daily, etc. The prompts with negative words are carefully excluded. ONLY suitable length sentences were accepted (7~20 words, average 14 words). This sub-corpus is a complementary corpus for Basic Mandarin sub-corpus and can be used for R&D of Large-scale concatenative TTS;
3.    Mandarin Neutral sub-corpus: including 380 Chinese bi-syllable words which embedded in carrier sentences;
4.    Mandarin ERHUA sub-corpus: including 290 Chinese Erhua syllables which embedded in carrier sentences;
5.    Mandarin Digit-String sub-corpus: including 1250 utterances with 3-digit length which considered the different pronunciation of 1, i.e. “yi1” and “yao1”.
6.    Mandarin Question sub-corpus: including 300 question sentences with common used question mark, for example “吗”, “么”, “呢”, and etc.;
7.    Mandarin exclamatory sub-corpus: including 200 exclamatory sentences with common used exclamatory mark, for example “呀”, “啊”, “吧”, “啦”, and etc.;
8.    Chinese English sentence sub-corpus: including 1,000 sentences which were carefully designed considering bi-phone coverage. All sentences were extracted from News channels of Voice of America (VOA), and etc. The prompts with negative words are carefully excluded. ONLY suitable length sentences were accepted (7~20 words, in average 12 words) and phonetically annotated with SAMPA. This sub-corpus can be used for R&D of HMM-based TTS, Limit domain TTS and Small-scale concatenative TTS;
9.    Chinese English words sub-corpus: including about 6,000 commonly used English words which embedded in carrier sentence;
10.    Chinese English Abbreviation sub-corpus: including about 200 utterances which considered not only the alphabet coverage, but also the combination of character and digit, such as “MP4”;
11.    Chinese English Letter sub-corpus: including 26 carrier utterances with each letter embedded in the Beginning, Middle and End;
12.    Chinese Greek Letter sub-corpus: including 24 carrier utterances with each letter embedded in the Beginning, Middle and End.

All speech data are segmented and labeled on phone level. Pronunciation lexicon and pitch extract from EEG can also be provided based on demands.

 

France French Speech Recognition Corpus (desktop) – 50 speakers

This France French desktop speech recognition database was collected by SpeechOcean in France. This database is one of our databases of Speech Data ----Desktop Project (SDD) which contains the database collections for 30 languages presently. 

It contains the voices of 50 different native speakers who were balanced distributed by age (mainly 16 – 30, 31 – 45, 46 – 60), gender (28 males, 22 females) and regional accents. The script was specially designed to provide material for both training and testing of many classes of speech recognition applications. Each speaker recorded 500 utterances in a quiet office environment through two professional microphones. Each utterance is stored as 44.1K 16Bit uncompressed PCM format and accompanied by an ASCII SAM label file which contains the relevant descriptive information.

A pronunciation lexicon with a phonemic transcription in SAMPA is also included.

 

UK English Speech Recognition Corpus (desktop) – 50 speakers

This UK English desktop speech recognition database was collected by SpeechOcean in England. This database is one of our databases of Speech Data ----Desktop Project (SDD) which contains the database collections for 30 languages presently. 

It contains the voices of 50 different native speakers who were balanced distributed by age (mainly 16 – 30, 31 – 45, 46 – 60), gender (28 males, 22 females) and regional accents. The script was specially designed to provide material for both training and testing of many classes of speech recognition applications. Each speaker recorded 500 utterances in a quiet office environment through two professional microphones. Each utterance is stored as 44.1K 16Bit uncompressed PCM format and accompanied by an ASCII SAM label file which contains the relevant descriptive information.

A pronunciation lexicon with a phonemic transcription in SAMPA is also included.

 

US English Speech Recognition Corpus (desktop) – 50 speakers

This US English desktop speech recognition database was collected by SpeechOcean in America. This database is one of our databases of Speech Data ----Desktop Project (SDD) which contains the database collections for 30 languages presently. 

It contains the voices of 50 different native speakers who were balanced distributed by age (mainly 16 – 30, 31 – 45, 46 – 60), gender (25 males, 25 females) and regional accents. The script was specially designed to provide material for both training and testing of many classes of speech recognition applications. Each speaker recorded 500 utterances in a quiet office environment through two professional microphones. Each utterance is stored as 44.1K 16Bit uncompressed PCM format and accompanied by an ASCII SAM label file which contains the relevant descriptive information.

A pronunciation lexicon with a phonemic transcription in SAMPA is also included.

 

Italian Speech Recognition Corpus (desktop) – 50 speakers

This Italian desktop speech recognition database was collected by SpeechOcean in Italy. This database is one of our databases of Speech Data ----Desktop Project (SDD) which contains the database collections for 30 languages presently. 

It contains the voices of 50 different native speakers who were balanced distributed by age (mainly 16 – 30, 31 – 45, 46 – 60), gender (23 males, 27 females) and regional accents. The script was specially designed to provide material for both training and testing of many classes of speech recognition applications. Each speaker recorded 500 utterances in a quiet office environment through two professional microphones. Each utterance is stored as 44.1K 16Bit uncompressed PCM format and accompanied by an ASCII SAM label file which contains the relevant descriptive information.

A pronunciation lexicon with a phonemic transcription in SAMPA is also included.

 

For more information about our Database and Services please visit our website www.Speechocen.com or visit our on-line Catalogue at http://www.speechocean.com/en-Product-Catalogue/Index.html

If you have any inquiry regarding our databases and service please feel free to contact us:

Xianfeng Cheng mailto: Chengxianfeng@speechocean.com

Marta Gherardi mailto: Marta@speechocean.com

 

 

Top

5-2-4Appen ButlerHill

 

Appen ButlerHill 

A global leader in linguistic technology solutions

RECENT CATALOG ADDITIONS—MARCH 2012

1. Speech Databases

1.1 Telephony

1.1 Telephony

Language

Database Type

Catalogue Code

Speakers

Status

Bahasa Indonesia

Conversational

BAH_ASR001

1,002

Available

Bengali

Conversational

BEN_ASR001

1,000

Available

Bulgarian

Conversational

BUL_ASR001

217

Available shortly

Croatian

Conversational

CRO_ASR001

200

Available shortly

Dari

Conversational

DAR_ASR001

500

Available

Dutch

Conversational

NLD_ASR001

200

Available

Eastern Algerian Arabic

Conversational

EAR_ASR001

496

Available

English (UK)

Conversational

UKE_ASR001

1,150

Available

Farsi/Persian

Scripted

FAR_ASR001

789

Available

Farsi/Persian

Conversational

FAR_ASR002

1,000

Available

French (EU)

Conversational

FRF_ASR001

563

Available

French (EU)

Voicemail

FRF_ASR002

550

Available

German

Voicemail

DEU_ASR002

890

Available

Hebrew

Conversational

HEB_ASR001

200

Available shortly

Italian

Conversational

ITA_ASR003

200

Available shortly

Italian

Voicemail

ITA_ASR004

550

Available

Kannada

Conversational

KAN_ASR001

1,000

In development

Pashto

Conversational

PAS_ASR001

967

Available

Portuguese (EU)

Conversational

PTP_ASR001

200

Available shortly

Romanian

Conversational

ROM_ASR001

200

Available shortly

Russian

Conversational

RUS_ASR001

200

Available

Somali

Conversational

SOM_ASR001

1,000

Available

Spanish (EU)

Voicemail

ESO_ASR002

500

Available

Turkish

Conversational

TUR_ASR001

200

Available

Urdu

Conversational

URD_ASR001

1,000

Available

1.2 Wideband

Language

Database Type

Catalogue Code

Speakers

Status

English (US)

Studio

USE_ASR001

200

Available

French (Canadian)

Home/ Office

FRC_ASR002

120

Available

German

Studio

DEU_ASR001

127

Available

Thai

Home/Office

THA_ASR001

100

Available

Korean

Home/Office

KOR_ASR001

100

Available

2. Pronunciation Lexica

Appen Butler Hill has considerable experience in providing a variety of lexicon types. These include:

Pronunciation Lexica providing phonemic representation, syllabification, and stress (primary and secondary as appropriate)

Part-of-speech tagged Lexica providing grammatical and semantic labels

Other reference text based materials including spelling/mis-spelling lists, spell-check dictionar-ies, mappings of colloquial language to standard forms, orthographic normalization lists.

Over a period of 15 years, Appen Butler Hill has generated a significant volume of licensable material for a wide range of languages. For holdings information in a given language or to discuss any customized development efforts, please contact: sales@appenbutlerhill.com

3. Named Entity Corpora

Language

Catalogue Code

Words

Description

Arabic

ARB_NER001

500,000

These NER Corpora contain text material from a vari-ety of sources and are tagged for the following Named Entities: Person, Organization, Location, Na-tionality, Religion, Facility, Geo-Political Entity, Titles, Quantities

English

ENI_NER001

500,000

Farsi/Persian

FAR_NER001

500,000

Korean

KOR_NER001

500,000

Japanese

JPY_NER001

500,000

Russian

RUS_NER001

500,000

Mandarin

MAN_NER001

500,000

Urdu

URD_NER001

500,000

3. Named Entity Corpora

Language

Catalogue Code

Words

Description

Arabic

ARB_NER001

500,000

These NER Corpora contain text material from a vari-ety of sources and are tagged for the following Named Entities: Person, Organization, Location, Na-tionality, Religion, Facility, Geo-Political Entity, Titles, Quantities

English

ENI_NER001

500,000

Farsi/Persian

FAR_NER001

500,000

Korean

KOR_NER001

500,000

Japanese

JPY_NER001

500,000

Russian

RUS_NER001

500,000

Mandarin

MAN_NER001

500,000

Urdu

URD_NER001

500,000

4. Other Language Resources

Morphological Analyzers – Farsi/Persian & Urdu

Arabic Thesaurus

Language Analysis Documentation – multiple languages

 

For additional information on these resources, please contact: sales@appenbutlerhill.com

5. Customized Requests and Package Configurations

Appen Butler Hill is committed to providing a low risk, high quality, reliable solution and has worked in 130+ languages to-date supporting both large global corporations and Government organizations.

We would be glad to discuss to any customized requests or package configurations and prepare a cus-tomized proposal to meet your needs.

6. Contact Information

Prithivi Pradeep

Business Development Manager

ppradeep@appenbutlerhill.com

+61 2 9468 6370

Tom Dibert

Vice President, Business Development, North America

tdibert@appenbutlerhill.com

+1-315-339-6165

                                                         www.appenbutlerhill.com

Top



 Organisation  Events   Membership   Help 
 > Board  > Interspeech  > Join - renew  > Sitemap
 > Legal documents  > Workshops  > Membership directory  > Contact
 > Logos      > FAQ
       > Privacy policy

© Copyright 2024 - ISCA International Speech Communication Association - All right reserved.

Powered by ISCA