Corpora English and German
|Submitter Email:||click here to access email|
For Query: Linguist 11.1877
a long while back I asked for information on German and English corpora which are tagged for grammatical functions, as well as for accessible parallel English-German corpora. Here is a summary of the replies I got.
ENGLISH GRAMMATICALLY TAGGED CORPORA
Joybrato Mukherjee (firstname.lastname@example.org) drew my attention to the International Corpus of English, which can be ordered at the following website (which also allows you to download a very nice demo version):
GERMAN GRAMMATICALLY TAGGED CORPORA
George Smith (email@example.com) drew my attention to the NEGRA and TIGER projects, which can be reached via the following websites:
PARALLEL CORPORA GERMAN-ENGLISH
Anatol Stefanowitsch (firstname.lastname@example.org) drew my attention to a small web-accessible parallel corpus at the University of Chemnitz:
Some people have their own collections of parallel texts, which they may or may not be willing to share with others (there may be copyright issues here).
The two that agreed to be mentioned here are
- Raphael Salkie (R.M.Salkie@bton.ac.uk), who has a collection of parallel texts from websites, literature, manuals, EU- documents, political writing and speeches coming to about 800.000 words in each language.
- Anatol Stefanowitsch, who has a small collection of parallel texts from news magazines (about 15,000 words), and who is in the process of assembling a larger parallel corpus of narrative writing.
Martin Frost (Martin@sinequa.com) drew my attention to the following websites:
Thanks also to Klaus Abels, Petra Steiner, and Monika Budde for other helpful hints.
Take care now,
|Original Query:||Read original query|
Sums main page