LINGUIST List 27.3994
Fri Oct 07 2016
Software: Computational Linguistics; Text/Corpus Linguistics: MITRE Annotation Toolkit 3.1
Editor for this issue: Amanda Foster <amandalinguistlist.org>
Date: 07-Oct-2016
From: Samuel Bayer <sam
mitre.org>
Subject: Computational Linguistics; Text/Corpus Linguistics: MITRE Annotation Toolkit 3.1
E-mail this message to a friend All -
My colleagues and I are pleased to announce the availability of version 3.1 of the MITRE Annotation Toolkit.
- What is it?
MAT is a suite of cooperating, loosely-coupled tools for supporting the human annotator in constructing models and corpora for computational linguistics and analysis, especially using the iterative 'tag a little, learn a little' workflow. MAT includes MITRE's open source Java Carafe CRF trainer/tagger; a Web-based annotation and process control UI; a flexible and powerful scorer; an XML-configurable experiment harness; a workflow manager; and more.
MAT is platform-independent, open source, and compatible (with some effort) with multiple trainer/tagger engines and hand annotation tools.
You can learn about MAT 3.1, and download it, at
http://mat-annotation.sf.net - What's new in version 3.1?
This release introduces crossvalidation in the experiment engine; a much more streamlined way of declaring annotation sets; the ability to infer tasks in the UI from annotated documents; and significant UI enhancements, including a guided mode for filling in annotation popups and much better management of overlapping annotations. For additional details, visit the documentation at
http://mat-annotation.sf.net/current_docs/html/index.html and select ''Upgrade and release notes'' from the documentation sidebar.
If you have MAT 3.0, MAT 3.1 is completely backward compatible. If you have MAT 2.0, there are some conversions you may have to perform. See the release notes for details.
- How stable is it?
For many years, MAT has served as the underpinning of MITRE's Identification Scrubber Toolkit (MIST), an open-source package for deidentification of free-text medical documents. As MIST, it has been used by a wide range of researchers in this area. MAT itself has been used extensively by MITRE in its internal research and development projects, and by a variety of MITRE's sponsors.
- Is it supported?
Unfortunately, no. MITRE doesn't (currently) have the resources to provide public open-source support. There is a public mailing list on Sourceforge, which the MITRE developers subscribe to, but we likely won't have the resources to answer.
On the other hand, MAT has been under development for almost ten years, and has seen a good deal of use, and comes with a ton of documentation.
If you download MAT, and you like it, please let us know.
Cordially,
Sam Bayer
The MITRE Corporation
sam
mitre.org
Linguistic Field(s): Computational Linguistics
Text/Corpus Linguistics
Page Updated: 07-Oct-2016