LINGUIST List 15.3192
Sat Nov 13 2004
Sum: Phonetically Transcribed Italian Corpus
Editor for this issue: Jessica Boynton <jessicalinguistlist.org>
To post to LINGUIST, use our convenient web form at
Phonetically Transcribed Italian Corpus
Message 1: Phonetically Transcribed Italian Corpus
From: Christina Villafana <cmv2georgetown.edu>
Subject: Phonetically Transcribed Italian Corpus
Regarding query http://www.linguistlist.org/issues/15/15-3137.html
Thanks to the following people who responded to my recent query:
Caren Brinckmann, Saarland University
Federico Albano Leoni, Universita' di Napoli
Giuliana Fiorentino, Universita' Roma Tre
The AVIP (Archivio delle Varietà di Italiano Parlato) corpus, a joint
project with the Laboratorio Linguistica of the Scuola Normale Superiore in
Pisa and the Linguistics Department of the Universita' di Napoli Federico
II has a collection of Italian map-task dialogues. 75 minutes of
spontaneous speech are phonetically segmented and labelled.
There is a short description on LINGUIST List:
And a description of the project is available at:
This corpus is freely available via ftp: http://ftp.cirass.unina.it/avip/
with documentation under http://ftp.cirass.unina.it/avip/doc_app/
(mostly in Italian).
Another corpus, API (Archivio di Parlato Italiano), is available freely on
DVD by contacting Paola Petrone, CIRASS, Università di Napoli,
petroneunina.it via email. There is a query generator included on the DVD.
From what I have understood, most of the phonetic transcriptions are done
in SAMPA or X-SAMPA, and those files with transcriptions are chopped into
speaker turns and so therefore need to be concatenated.
This information has been extremely helpful, but I am still looking for an
easy way to get phoneme frequencies for Standard Italian, so if anyone has
further information, please let me know!
Department of Linguistics
Linguistic Field(s): Phonetics; Text/Corpus Linguistics
Respond to list|Read more issues|LINGUIST home page|Top of issue