LINGUIST List 15.3192
Sat Nov 13 2004
Sum: Phonetically Transcribed Italian Corpus
Editor for this issue: Jessica Boynton <jessica
linguistlist.org>
To post to LINGUIST, use our convenient web form at
http://linguistlist.org/LL/posttolinguist.html.
Directory
1. Christina
Villafana,
Phonetically Transcribed Italian Corpus
Message 1: Phonetically Transcribed Italian Corpus
Date: 13-Nov-2004
From: Christina Villafana <cmv2
georgetown.edu>
Subject: Phonetically Transcribed Italian Corpus
Regarding query http://www.linguistlist.org/issues/15/15-3137.html
Thanks to the following people who responded to my recent query:
Caren Brinckmann, Saarland University
Federico Albano Leoni, Universita' di Napoli
Kristie McCrary
Giuliana Fiorentino, Universita' Roma Tre
The AVIP (Archivio delle Varietà di Italiano Parlato) corpus, a joint
project with the Laboratorio Linguistica of the Scuola Normale Superiore in
Pisa and the Linguistics Department of the Universita' di Napoli Federico
II has a collection of Italian map-task dialogues. 75 minutes of
spontaneous speech are phonetically segmented and labelled.
There is a short description on LINGUIST List:
http://linguistlist.org/issues/13/13-611.html#2
And a description of the project is available at:
http://www.cirass.unina.it/ricerca/studi%20parlato/raccolta%20corpora/raccoltacorpora.htm
This corpus is freely available via ftp: http://ftp.cirass.unina.it/avip/
with documentation under http://ftp.cirass.unina.it/avip/doc_app/
(mostly in Italian).
Another corpus, API (Archivio di Parlato Italiano), is available freely on
DVD by contacting Paola Petrone, CIRASS, Università di Napoli,
petrone
unina.it via email. There is a query generator included on the DVD.
From what I have understood, most of the phonetic transcriptions are done
in SAMPA or X-SAMPA, and those files with transcriptions are chopped into
speaker turns and so therefore need to be concatenated.
This information has been extremely helpful, but I am still looking for an
easy way to get phoneme frequencies for Standard Italian, so if anyone has
further information, please let me know!
Christina Villafana
Department of Linguistics
Georgetown University
Washington DC
cmv2
georgetown.edu
Linguistic Field(s): Phonetics; Text/Corpus Linguistics
Respond to list|Read more issues|LINGUIST home page|Top of issue