* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *


LINGUIST List 23.3863

Mon Sep 17 2012

FYI: New Falko German Learner Corpus Release

Editor for this issue: Brent Miller <brentlinguistlist.org>

Date: 17-Sep-2012
From: Marc Reznicek <marc.reznicekstaff.hu-berlin.de>
Subject: New Falko German Learner Corpus Release
E-mail this message to a friend

The error-annotated German learner corpus Falko has released a new
subcorpus: FalkoEssayL2WHIGv2.0 including 195 argumentative essays by
advanced learners of German (117,189 tokens).

For each text two full-text target hypotheses (a minimal morphosyntactic
normalization and an extended semantic-pragmatic version) have been manually
annotated.

Each representation has been POS-tagged and lemmatized (Treetagger &
rfTagger). rfTagger morphological annotation has been integrated as well.

On this basis, tags indicating differences between the learner text and its
POS and lemma annotations and the respective target hypotheses (POS & lemma)
have been added.

The corpus is freely available under the following link:

http://korpling.german.hu-berlin.de/falko-suche

The annotation guidelines can be found here:
http://www.linguistik.hu-berlin.de/institut/professuren/korpuslinguistik/for
schung/falko/Falko-Handbuchv2.0.pdf



Linguistic Field(s): Language Acquisition
Text/Corpus Linguistics

Subject Language(s): German (deu)
Read more issues|LINGUIST home page|Top of issue



Page Updated: 17-Sep-2012

Supported in part by the National Science Foundation       About LINGUIST    |   Contact Us       ILIT Logo
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.