LINGUIST List 24.4016|
Mon Oct 14 2013
Software: Computational Linguistics: DKPro WSD 0.9.1
Editor for this issue: Andrew Lamont
From: Tristan Miller <millerukp.informatik.tu-darmstadt.de>
Subject: Computational Linguistics: DKPro WSD 0.9.1
E-mail this message to a friend
We are pleased to announce the first major release of DKPro WSD, version 0.9.1.
DKPro WSD is a modular, extensible Java framework for word sense disambiguation. It provides UIMA components which encapsulate corpus readers, linguistic annotators, lexical semantic resources, disambiguation algorithms, and evaluation and reporting tools.
To obtain the software, please visit its websites on Google Code:
More about DKPro WSD
DKPro WSD supports rapid design and evaluation of word sense disambiguation algorithms and pipelines. With the framework you can:
* evaluate any number of algorithms simultaneously
* compute common evaluation metrics: coverage, precision, recall, F1
* produce confusion matrices to compare pairs of algorithms
* filter or break down results by part of speech, etc.
* show results with and without user-specific backoff algorithms
* rescore results with a user-specified sense clustering
* visualize disambiguation algorithms in action
* perform parameter sweeping to find optimal system configurations
* run related tasks such as entity linking and word sense induction
There is out-of-the-box support for many different data sets (Senseval, SemEval, SemCor, AIDA CoNLL-YAGO, TAC KBP, MASC, etc.) and lexical semantic resources (WordNet, UBY, Wiktionary, etc.).
DKPro WSD is free software, released under the Apache License and GNU General Public License.
Changes since the preview release (0.9.0):
* upgraded to DKPro Core 1.5.0, uimaFIT 2.0.0, UBY 0.4.0, and TWSI 1.0.1
* added module for word sense induction
* moved Wikipedia-specific graph algorithms to a separate module
Linguistic Field(s): Computational Linguistics
Read more issues|LINGUIST home page|Top of issue
Page Updated: 14-Oct-2013
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.