* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 22.4529

Sat Nov 12 2011

FYI: German SALSA Corpus Release 2.0

Editor for this issue: Brent Miller <brentlinguistlist.org>

To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.cfm.
        1.     Josef Ruppenhofer , German SALSA Corpus Release 2.0

Message 1: German SALSA Corpus Release 2.0
Date: 11-Nov-2011
From: Josef Ruppenhofer <josefrcoli.uni-saarland.de>
Subject: German SALSA Corpus Release 2.0
E-mail this message to a friend

The second and final release of the SALSA corpus, a German corpus
with semantic role annotations in the Berkeley FrameNet paradigm is
available for download at http://www.coli.uni-

The corpus was created by the SALSA project at Saarland University
under the direction of Manfred Pinkal. Work on the corpus was
supported by funds from a Leibniz prize awarded to Manfred Pinkal
and by the German Science Foundation (DFG; grants PI 154/9-3, PI

The frame semantic annotations are applied on top of the TIGER
treebank, a syntactically annotated German newspaper corpus. Salsa
release 2 references TIGER version 2.1.

More information on TIGER and FrameNet can be found here:


SALSA uses the frames of FrameNet releases 1.2 and 1.3 for the
German annotation, wherever available and appropriate. In addition,
SALSA has developed a number of ''proto-frames'', i.e., predicate-
specific frames, to provide coverage for predicate instances currently
not covered by FrameNet. The total size of the annotation is roughly
20.000 verbal target instances and, new in Salsa release 2, more than
17.000 nominal target instances.

More information on SALSA can be found on the website:


The annotation scheme is described in:

A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado and M. Pinkal. The
SALSA Corpus: a German Corpus Resource for Lexical Semantics. In:
Proceedings of LREC 2006, Genoa, Italy.

If you have any questions, feel free to send an email to


Linguistic Field(s): Computational Linguistics; Semantics; Text/Corpus Linguistics

Read more issues|LINGUIST home page|Top of issue

Page Updated: 12-Nov-2011

Supported in part by the National Science Foundation       About LINGUIST    |   Contact Us       ILIT Logo
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.