LINGUIST List 32.2012

Thu Jun 10 2021

FYI: The Speech in Cantonese and English (SpiCE) Bilingual Speech Corpus

Editor for this issue: Everett Green <>

Date: 09-Jun-2021
From: Khia A. Johnson <>
Subject: The Speech in Cantonese and English (SpiCE) Bilingual Speech Corpus
E-mail this message to a friend

Khia A. Johnson and the University of British Columbia Speech-in-Context Lab ( are excited to announce the publication of "Speech in Cantonese and English (SpiCE)," an audio corpus of conversational Cantonese-English bilingual speech recorded in Vancouver, Canada during 2018-2020. SpiCE includes high-quality recordings of 34 early bilinguals in both languages, along with manually-corrected orthographic transcripts and force-aligned phone level annotations. SpiCE is an open-access corpus, available at

For more information, see the corpus documentation:

SpiCE was funded by the UBC Public Scholars Initiative and a UBC Arts Graduate Research Award to Khia A. Johnson and by a Social Sciences and Humanities Research Council of Canada (SSHRC) Insight Grant to Molly Babel.

Linguistic Field(s): Cognitive Science; Computational Linguistics; Phonetics; Phonology; Psycholinguistics; Text/Corpus Linguistics

Subject Language(s): Chinese, Yue (yue)
                            English (eng)

Page Updated: 10-Jun-2021