Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info

Software Details

Title: ELRA - Language Resources Catalogue - Update 03/07
Submitter: Helene Mazo
Description: ELRA is happy to announce that 3 new Speech Related Resources are now
available in its catalogue. Moreover, we are pleased to announce that years
2005 and 2006 from the Text Corpus of 'Le Monde' (ELRA-W0015) are now
available.

ELRA-S0235 LC-STAR Hebrew (Israel) phonetic lexicon
The LC-STAR Hebrew (Israel) phonetic lexicon comprises 109,580 words,
including a set of 62,431 common words, a set of 47,149 proper names
(including person names, family names, cities, streets, companies and brand
names) and a list of 8,677 special application words. The lexicon is
provided in XML format and includes phonetic transcriptions in SAMPA.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=984&language=en

ELRA-S0236 LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal lexicon
The LC-STAR English-Hebrew (Israel) Bilingual Aligned Phrasal lexicon
comprises 10,520 phrases from the tourist domain. It is based on a list of
short sentences obtained by translation from US-English 10,449 phrasal
corpus. The lexicon is provided in XML format.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=985&language=en

ELRA-S0237 LC-STAR US English phonetic lexicon
The LC-STAR US English phonetic lexicon comprises 102,310 words, including
a set of 51,119 common words, a set of 51,111 proper names (including
person names, family names, cities, streets, companies and brand names) and
a list of 6,807 special application words. The lexicon is provided in XML
format and includes phonetic transcriptions in SAMPA.
For more information, see:
http://catalog.elra.info/product_info.php?products_id=986&language=en

ELRA-W0015 Text corpus of 'Le Monde'
Corpus from 'Le Monde' newspaper. Years 1987 to 2002 are available in an
ASCII text format. Years 2003 to 2006 are available in .XML format. Each
month consists of some 10 MB of data (circa 120 MB per year).
For more information, see:
http://catalog.elra.info/product_info.php?products_id=438&language=en


For more information on the catalogue, please contact Valérie Mapelli
mailto:mapelli@elda.org

Our on-line catalogue has moved to the following address:
http://catalog.elra.info. Please update your bookmarks.
Linguistic Field(s): Computational Linguistics

LL Issue: 18.918
Date Posted: 27-Mar-2007

Search Again

Back to Software Index