FYI: PhonItalia: a Phonological Lexicon for Italian
Announcing the availability of PhonItalia, the first comprehensive lexical database to provide phonological representations for Italian word-forms.
Each of the 120,000 entries is provided with a wide range of information in addition to the phonological representation of the word. This includes syllable boundary and stress markings, uniqueness points, neighborhood estimates, and other measures, including written word-frequency and part of speech markers provided by the Colfis orthographic corpus.
Using data derived from this core lexicon an additional range of databases have also been compiled to provide positional frequency of use statistics for Italian phonemes, syllables, syllable onsets and codas, plus character and phoneme bigrams.
PhonItalia and all derived databases are freely available for non-commercial research use, under a creative commons license, and is available to download in Excel ( ,xlsx ) and tab-delimited text format
( .txt ) at the following URL:
Further information on the methods and details of the database, additional summarizing lexical statistics, and a demonstration of an application of the data to aphasic speech errors is also available in a companion publication.
Goslin,J., Galluzzi,C., & Romani, C. (2013). PhonItalia: a Phonological Lexicon for Italian, Behavior Research Methods, DOI: 10.3758/s13428-013-0400-8