Featured Linguist!

Jost Gippert: Our Featured Linguist!

"Buenos dias", "buenas noches" -- this was the first words in a foreign language I heard in my life, as a three-year old boy growing up in developing post-war Western Germany, where the first gastarbeiters had arrived from Spain. Fascinated by the strange sounds, I tried to get to know some more languages, the only opportunity being TV courses of English and French -- there was no foreign language education for pre-teen school children in Germany yet in those days. Read more

Donate Now | Visit the Fund Drive Homepage

Amount Raised:


Still Needed:


Can anyone overtake Syntax in the Subfield Challenge ?

Grad School Challenge Leader: University of Washington

Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info

Software Details

Title: ELRA - Language Resources Catalogue - Update 09/06
Submitter: Helene Mazo
Description: Our on-line catalogue has moved to the following address:
http://catalog.elra.info. Please update your bookmarks.

We are happy to announce that new Written Language Resources are now
available in our catalogue.

*** ELRA-L0072 PAROLE-SIMPLE-CLIPS PISA Italian Lexicon ***
PAROLE-SIMPLE-CLIPS is a four-level, general purpose lexicon that has been
elaborated over three different projects. The PAROLE-SIMPLE-CLIPS Pisa
Italian Lexicon comprises a total of 387,267 phonetic units, 53,044
morphological units (53,044 lemmas), 37,406 syntactic units (28,111 lemmas)
and 28,346 semantic units (19,216 lemmas). The PAROLE-SIMPLE-CLIPS Pisa
Italian Lexicon was encoded at the semantic level, in full accordance with
the international standards set out in the PAROLE-SIMPLE model and based on
EAGLES. Syntactic and semantic encoding were performed jointly with Thamus
(Consortium for Multilingual Documentary Engineering), which is responsible
for 25,000 extra entries (to be released soon).
This lexicon is subdivided into five different subsets:
L0072-01 Full lexicon
L0072-02 Phonetic layer
L0072-03 Morphological layer
L0072-04 Syntactic layer
L0072-05 Semantic layer
For more information, see:

*** ELRA-W0043 PAROLE Italian Corpus ***
The PAROLE Italian Corpus comprises 3,135,651 words collected from four
different domains: newspapers (2,179,800 words), periodicals (143,810
words), books (564,964 words), miscellaneous (247,077 words). Data are
morphosyntactically annotated and lemmatized.
For more information, see:

*** ELRA-W0044 Italian Syntactic-Semantic Treebank (ISST) ***
For more information, see:

For more information on the catalogue, please contact Valérie Mapelli
Linguistic Field(s): Computational Linguistics
Text/Corpus Linguistics

Language Specialty: Italian

LL Issue: 17.2698
Date Posted: 21-Sep-2006

Search Again

Back to Software Index