FYI: UBY: A Large-Scale Lexical-Semantic Resource
We are pleased to announce the release of UBY 1.0 - a large-scale
lexical-semantic resource for natural language processing (NLP) based
on the ISO standard Lexical Markup Framework (LMF):
UBY combines a wide range of information from expert-constructed
and collaboratively constructed resources for English and German.
Currently, UBY holds structurally and semantically interoperable
versions of nine resources in two languages:
* English WordNet, Wiktionary, Wikipedia, FrameNet and VerbNet,
* German Wikipedia, Wiktionary and GermaNet, and multilingual
A subset of these resources is linked at the word sense level. There
are monolingual sense alignments between VerbNet–FrameNet and
VerbNet–WordNet as well as between WordNet–Wikipedia and
In addition, UBY provides cross-lingual sense alignments between
WordNet and German OmegaWiki, also including the inter-language
links given in Wikipedia and OmegaWiki.
All resources in UBY are represented according to our LMF lexicon
UBY-LMF captures lexical information at a ﬁne-grained level by
employing a large number of Data Categories from ISOCat.
Highlights of UBY:
* The union of a wide range of heterogeneous resources in a single,
* The linking at the word sense level between a subset of the
UBY is complemented by a Java API, the UBY-API, and conversion
tools (e.g., for converting the resources to UBY-LMF).
The UBY API and conversion tools are available at Google Code:
Highlights of the UBY-API:
* Unified access to the various information types in the nine resources.
* Easy cross-resource access to the various information types in the
A tutorial showing the use of the UBY-API can be found at
A Web Interface for exploring and visualizing UBY is currently being
developed and will soon be available at http://www.ukp.tu-
This project was initiated under the auspices of Prof. Dr. Iryna
Gurevych, Ubiquitous Knowledge Processing Lab (UKP), Technische
We are grateful for the generous financial support from the
Volkswagen Foundation and the German Research Foundation.
Please direct any questions or suggestions to
Dr. Judith Eckle-Kohler
Ubiquitous Knowledge Processing Lab (UKP-TUDA)
FB 20 Computer Science Department
Technische Universität Darmstadt
Hochschulstr. 10, D-64289 Darmstadt, Germany
phone [+49] (0)6151 16-6166, fax -5455, room S2/02/B115
Web Research at TU Darmstadt (WeRC) www.werc.tu-darmstadt.de