|
The aim of this volume is to showcase the range of corpus-based linguistic
research currently being carried out on languages other than English.
The papers included report on work carried out on Arabic, Bulgarian, Czech,
Dutch, French, German, Biblical Greek, Biblical Hebrew, Medieval Irish,
Korean, Romanian and Swedish, including a number of regional and social
variants. They also address a range of areas as diverse as corpus design,
corpus annotation, register analysis, syntax, and quantitative linguistics.
The papers in this volume will leave the reader in no doubt that
corpus-based research is now being conducted for a whole "rainbow of
languages".
Contents
1. The PARIS 7 annotated corpus for French: some experimental results
Anne Abeillé, Lionel Clément, Alexandra Kinyon, François Toussenel
2. Lexical frequency of contemporary Canadian French based on a large corpus
Martin Beaudoin and Michel Simard
3. The Corpus of Electronic Texts: A digital lexicon of Medieval Irish and
an Irish prosopography
Beatrix Färber
4. A corpus of written Italian: a defined and a dynamic model R. Rossini
Favretti, F. Tamburini and C. De Santis
5. A reusable corpus needs syntactic annotations: Prague Dependency Treebank
Eva Hajičová and Petr Sgall
6. Variation across Korean text registers
Beom-mo Kang, Hung-gyu Kim and Myung-hoe Huh
7. A tagset for the morphosyntactic tagging of Arabic
Shereen Khoja, Roger Garside and Gerry Knowles
|