LINGUIST List 22.4529|
Sat Nov 12 2011
FYI: German SALSA Corpus Release 2.0
Editor for this issue: Brent Miller
To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.cfm.
1. Josef Ruppenhofer ,
German SALSA Corpus Release 2.0
Message 1: German SALSA Corpus Release 2.0
From: Josef Ruppenhofer <josefrcoli.uni-saarland.de>
Subject: German SALSA Corpus Release 2.0
E-mail this message to a friend
The second and final release of the SALSA corpus, a German corpus
with semantic role annotations in the Berkeley FrameNet paradigm is
available for download at http://www.coli.uni-
The corpus was created by the SALSA project at Saarland University
under the direction of Manfred Pinkal. Work on the corpus was
supported by funds from a Leibniz prize awarded to Manfred Pinkal
and by the German Science Foundation (DFG; grants PI 154/9-3, PI
The frame semantic annotations are applied on top of the TIGER
treebank, a syntactically annotated German newspaper corpus. Salsa
release 2 references TIGER version 2.1.
More information on TIGER and FrameNet can be found here:
SALSA uses the frames of FrameNet releases 1.2 and 1.3 for the
German annotation, wherever available and appropriate. In addition,
SALSA has developed a number of ''proto-frames'', i.e., predicate-
specific frames, to provide coverage for predicate instances currently
not covered by FrameNet. The total size of the annotation is roughly
20.000 verbal target instances and, new in Salsa release 2, more than
17.000 nominal target instances.
More information on SALSA can be found on the website:
The annotation scheme is described in:
A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado and M. Pinkal. The
SALSA Corpus: a German Corpus Resource for Lexical Semantics. In:
Proceedings of LREC 2006, Genoa, Italy.
If you have any questions, feel free to send an email to
Linguistic Field(s): Computational Linguistics; Semantics; Text/Corpus Linguistics
Read more issues|LINGUIST home page|Top of issue
Page Updated: 12-Nov-2011
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.