* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 20.3020

Tue Sep 08 2009

Software: ELRA: Distribution Agreement signed for BioLexicon

Editor for this issue: Hannah Morales <hannahlinguistlist.org>

To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.html.
        1.    Helene Mazo, ELRA: Distribution Agreement signed for BioLexicon

Message 1: ELRA: Distribution Agreement signed for BioLexicon
Date: 08-Sep-2009
From: Helene Mazo <mazoelda.org>
Subject: ELRA: Distribution Agreement signed for BioLexicon
E-mail this message to a friend

ELRA together with the European Bioinformatics Institute (EBI, Hinxton,
UK), Istituto di Linguistica Computazionale-Consiglio Nazionale Ricerche
(ILC-CNR, Pisa, Italy), and the National Centre for Text Mining (NaCTeM,
University of Manchester, UK) has signed a Language Resources distribution
agreement for a large-scale English language terminological resource in the
biomedical domain: BioLexicon.

Biological terminology is a frequent cause of analysis errors when
processing literature written in the biology domain, due largely to the
high degree of variation in term forms, to the frequent mismatches between
labels of controlled vocabularies and ontologies on the one hand and the
forms actually occurring in text on the other, and to the lack of detailed
formal information on the linguistic behavior of domain terms. For example,
'retro-regulate' is a terminological verb often used in molecular biology
but it is not included in conventional dictionaries.

BioLexicon is a linguistic resource for the biology domain, tailored to
cope with these problems. It contains information on:
- terminological nouns, including nominalised verbs and proper names (e.g.
gene names)
- terminological adjectives
- terminological adverbs
- terminological verbs
- general English words frequently used in the biology domain

Existing information on terms was integrated, augmented, complemented and
linked, through processing of massive amounts of biomedical text, to yield
inter alia over 2.2M entries, and information on over 1.8M variants and on
over 2M synonymy relations. Moreover, extensive information is provided on
how verbs and nominalised verbs in the domain behave at both syntactic and
semantic levels, supporting thus applications aiming at discovery of
relations and events involving biological entities in text. This
comprehensive coverage of biological terms makes BioLexicon a unique
linguistic resource within the domain. It is primarily intended to support
text mining and information retrieval in the biomedical domain, however its
standards-based structure and rich content make it a valuable resource for
many other kinds of application.

On behalf of ELRA, ELDA will act as the distribution agency by
incorporating the BioLexicon in the ELRA Language Resources catalogue. With
these resources, ELRA is willing to extend the current catalogue, by
offering specialized resources and thus allow a better coverage of the
language. For more information on BioLexicon (catalogue reference:
ELRA-S0373): http://catalog.elra.info/product_info.php?products_id=1113

For more information on the ELRA catalogue, please contact Valérie Mapelli,

For more information on ELRA & ELDA, please contact Khalid Choukri,
choukrielda.org; Hélène Mazo, mazoelda.org

55-57, rue Brillat Savarin
75013 Paris (France)
Tel.: +33 1 43 13 33 33
Fax: +33 1 43 13 33 30

ELRA is a non-profit making organisation founded by the European Commission
in 1995, with the mission of providing a clearing house for language
resources and promoting Human Language Technologies (HLT). To find out more
about ELRA, please visit http://www.elra.info.

ELDA is ELRA's operational body. ELDA identifies, collects, markets, and
distributes language resources, along with the dissemination of general
information in the field of HLT. ELDA also participates in some evaluation
projects and campaigns, has considerable knowledge and skills in HLT
applications and has participated in many French, European and
international projects. To find out more about ELDA, please visit

To find out more about BioLexicon partners, please visit:
EBI: http://www.ebi.ac.uk
ILC-CNR: http://www.ilc.cnr.it
NaCTeM: http://www.nactem.ac.uk

Linguistic Field(s): Computational Linguistics

Read more issues|LINGUIST home page|Top of issue

Please report any bad links or misclassified data

LINGUIST Homepage | Read LINGUIST | Contact us

NSF Logo

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.