LINGUIST List 30.2476

Tue Jun 18 2019

Disc: (Free webinar) The OED and Historical Text Collections: Discovering New Words

Editor for this issue: Everett Green <>

Date: 17-Jun-2019
From: Charlotte Crouch <>
Subject: (Free webinar) The OED and Historical Text Collections: Discovering New Words
E-mail this message to a friend

Dr Tanja Säily, tenure-track assistant professor in English Language at the University of Helsinki, and Dr Eetu Mäkelä, tenure-track assistant professor in Human Sciences–Computing Interaction at the University of Helsinki, will be presenting their research on neologism use in the Corpus of Early English Correspondence and how the OED, as well as large historical text collections, are semi-automatically consulted within the project.

Join Dr Säily and Dr Mäkelä for an overview of the project and also for a discussion around the possibilities of using the OED in digital humanities research in general.

This session will cover:
- An overview of the neologism research project, with examples of new vocabulary and its social embedding in the Corpus of Early English Correspondence
- Semi-automatic workflows for moving between the OED and historical text collections
- Building a pipeline from the corpus, through automated spelling normalization, lemmatization, filtering, comparison, and analysis to results
- The barriers faced with regard to historical spelling variation, noise from optical character recognition and material bias
- A discussion of the solutions applied, including statistical and neural machine translation, look-ups against comparison corpora and user interfaces for manual checking of results
- What the researchers wished they knew before they started
- What the future looks like for this and other projects using the OED
- Q&A session – bring your questions or send them to the panellists in advance:

Who is this session for?
- Researchers working in similar projects – using the OED or not
- Anyone interested in historical sociolinguistics, historical lexicology, or historical lexicography
- People interested in the complexities of applying computational methods to historical data

Linguistic Field(s): Applied Linguistics
                            Computational Linguistics
                            Discourse Analysis
                            General Linguistics
                            Historical Linguistics
                            Language Acquisition
                            Language Documentation
                            Text/Corpus Linguistics

Page Updated: 18-Jun-2019