* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *


LINGUIST List 24.2250

Fri May 31 2013

FYI: 40474 Split Compounds from GermaNet Available

Editor for this issue: Brent Miller <brentlinguistlist.org>

Date: 31-May-2013
From: Verena Henrich <verena.henrichuni-tuebingen.de>
Subject: 40474 Split Compounds from GermaNet Available
E-mail this message to a friend

We are happy to announce the availability of 40474 German nominal compounds from GermaNet release 8.0 that have been split into their constituent parts, i.e., modifier and head. This dataset has been constructed semi-automatically and all compound splits have been manually post-corrected.

The list of split compounds is freely available for download at
http://www.sfs.uni-tuebingen.de/GermaNet/compounds.shtml

For many applications, it is helpful to have information about the parts of the compound, as usually the semantic interpretation is based on the meaning of its parts. What makes compound splitting for German a challenging task is the fact that compounding, which is a very productive word formation process in German, is not always simple string concatenation. It often involves the presence of intervening linking elements or the elision of word-final characters in the modifier constituent of a compound.

For more information about GermaNet, please consult the project website: http://www.sfs.uni-tuebingen.de/GermaNet/


Linguistic Field(s): Computational Linguistics; Lexicography; Semantics; Text/Corpus Linguistics

Subject Language(s): German (deu)

Read more issues|LINGUIST home page|Top of issue



Page Updated: 31-May-2013

Supported in part by the National Science Foundation       About LINGUIST    |   Contact Us       ILIT Logo
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.