Software Details
| Title: | ELRA: Distribution Agreement signed for BioLexicon |
|---|---|
| Submitter: | Helene Mazo |
| Description: | ELRA together with the European Bioinformatics Institute (EBI, Hinxton, UK), Istituto di Linguistica Computazionale-Consiglio Nazionale Ricerche (ILC-CNR, Pisa, Italy), and the National Centre for Text Mining (NaCTeM, University of Manchester, UK) has signed a Language Resources distribution agreement for a large-scale English language terminological resource in the biomedical domain: BioLexicon. Biological terminology is a frequent cause of analysis errors when processing literature written in the biology domain, due largely to the high degree of variation in term forms, to the frequent mismatches between labels of controlled vocabularies and ontologies on the one hand and the forms actually occurring in text on the other, and to the lack of detailed formal information on the linguistic behavior of domain terms. For example, 'retro-regulate' is a terminological verb often used in molecular biology but it is not included in conventional dictionaries. BioLexicon is a linguistic resource for the biology domain, tailored to cope with these problems. It contains information on: - terminological nouns, including nominalised verbs and proper names (e.g. gene names) - terminological adjectives - terminological adverbs - terminological verbs - general English words frequently used in the biology domain Existing information on terms was integrated, augmented, complemented and linked, through processing of massive amounts of biomedical text, to yield inter alia over 2.2M entries, and information on over 1.8M variants and on over 2M synonymy relations. Moreover, extensive information is provided on how verbs and nominalised verbs in the domain behave at both syntactic and semantic levels, supporting thus applications aiming at discovery of relations and events involving biological entities in text. This comprehensive coverage of biological terms makes BioLexicon a unique linguistic resource within the domain. It is primarily intended to support text mining and information retrieval in the biomedical domain, however its standards-based structure and rich content make it a valuable resource for many other kinds of application. On behalf of ELRA, ELDA will act as the distribution agency by incorporating the BioLexicon in the ELRA Language Resources catalogue. With these resources, ELRA is willing to extend the current catalogue, by offering specialized resources and thus allow a better coverage of the language. For more information on BioLexicon (catalogue reference: ELRA-S0373): http://catalog.elra.info/product_info.php?products_id=1113 For more information on the ELRA catalogue, please contact Valérie Mapelli, mapelli@elda.org For more information on ELRA & ELDA, please contact Khalid Choukri, choukri@elda.org; Hélène Mazo, mazo@elda.org ELDA 55-57, rue Brillat Savarin 75013 Paris (France) Tel.: +33 1 43 13 33 33 Fax: +33 1 43 13 33 30 ELRA is a non-profit making organisation founded by the European Commission in 1995, with the mission of providing a clearing house for language resources and promoting Human Language Technologies (HLT). To find out more about ELRA, please visit http://www.elra.info. ELDA is ELRA's operational body. ELDA identifies, collects, markets, and distributes language resources, along with the dissemination of general information in the field of HLT. ELDA also participates in some evaluation projects and campaigns, has considerable knowledge and skills in HLT applications and has participated in many French, European and international projects. To find out more about ELDA, please visit http://www.elda.org. To find out more about BioLexicon partners, please visit: EBI: http://www.ebi.ac.uk ILC-CNR: http://www.ilc.cnr.it NaCTeM: http://www.nactem.ac.uk |
| Linguistic Field(s): |
Computational Linguistics Semantics Lexicography |
| LL Issue: | 20.3020 |
| Date Posted: | 08-Sep-2009 |


