LINGUIST List 22.4529

Sat Nov 12 2011

FYI: German SALSA Corpus Release 2.0

Editor for this issue: Brent Miller <brentlinguistlist.org>


        1.     Josef Ruppenhofer , German SALSA Corpus Release 2.0


Message 1: German SALSA Corpus Release 2.0
Date: 11-Nov-2011
From: Josef Ruppenhofer <josefrcoli.uni-saarland.de>
Subject: German SALSA Corpus Release 2.0
E-mail this message to a friend

The second and final release of the SALSA corpus, a German corpuswith semantic role annotations in the Berkeley FrameNet paradigm isavailable for download at http://www.coli.uni-saarland.de/projects/salsa/corpus/.

The corpus was created by the SALSA project at Saarland Universityunder the direction of Manfred Pinkal. Work on the corpus wassupported by funds from a Leibniz prize awarded to Manfred Pinkaland by the German Science Foundation (DFG; grants PI 154/9-3, PI154/8-1).

The frame semantic annotations are applied on top of the TIGERtreebank, a syntactically annotated German newspaper corpus. Salsarelease 2 references TIGER version 2.1.

More information on TIGER and FrameNet can be found here:

http://www.ims.uni-stuttgart.de/projekte/TIGER/https://framenet.icsi.berkeley.edu/fndrupal/

SALSA uses the frames of FrameNet releases 1.2 and 1.3 for theGerman annotation, wherever available and appropriate. In addition,SALSA has developed a number of ''proto-frames'', i.e., predicate-specific frames, to provide coverage for predicate instances currentlynot covered by FrameNet. The total size of the annotation is roughly20.000 verbal target instances and, new in Salsa release 2, more than17.000 nominal target instances.

More information on SALSA can be found on the website:

http://www.coli.uni-saarland.de/projects/salsa/

The annotation scheme is described in:

A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Pado and M. Pinkal. TheSALSA Corpus: a German Corpus Resource for Lexical Semantics. In:Proceedings of LREC 2006, Genoa, Italy.

If you have any questions, feel free to send an email to

salsa-mitcoli.uni-sb.de

Linguistic Field(s): Computational Linguistics; Semantics; Text/Corpus Linguistics

Page Updated: 12-Nov-2011