Publishing Partner: Cambridge University Press CUP Extra Wiley-Blackwell Publisher Login

FYI: New Falko German Learner Corpus Release

Author: Marc Reznicek

Linguistic Field(s): Text/Corpus Linguistics
Language Acquisition

Subject Language(s): German

FYI Body: The error-annotated German learner corpus Falko has released a new
subcorpus: FalkoEssayL2WHIGv2.0 including 195 argumentative essays by
advanced learners of German (117,189 tokens).

For each text two full-text target hypotheses (a minimal morphosyntactic
normalization and an extended semantic-pragmatic version) have been manually

Each representation has been POS-tagged and lemmatized (Treetagger &
rfTagger). rfTagger morphological annotation has been integrated as well.

On this basis, tags indicating differences between the learner text and its
POS and lemma annotations and the respective target hypotheses (POS & lemma)
have been added.

The corpus is freely available under the following link:

The annotation guidelines can be found here:

Back   FYI main page