Publishing Partner: Cambridge University Press CUP Extra Publisher Login

The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported primarily by your donations. Please support LINGUIST List during the 2016 Fund Drive.

FYI: Corpus of Late Modern English Texts


Author: Hendrik De Smet

Linguistic Field(s): Historical Linguistics
Text/Corpus Linguistics

Subject Language(s): English

FYI Body: A SUBSTANTIALLY UPGRADED VERSION OF THE CORPUS OF LATE MODERN ENGLISH TEXTS IS NOW AVAILABLE ONLINE. CLMET3.0, COMPILED BY HENDRIK DE SMET, HANS-JÜRGEN DILLER AND JUKKA TYRKKÖ, CONTAINS ABOUT 34 MILLION WORDS OF TEXT, COVERING BRITISH ENGLISH FROM 1710 TO 1920. THE NEW VERSION OF THE CORPUS IS GENRE-BALANCED AND PART-OF-SPEECH-TAGGED. THE CORPUS IS FREELY AVAILABLE. MORE DETAILED INFORMATION AND INSTRUCTIONS FOR DOWNLOADING THE CORPUS CAN BE FOUND ON HTTP://PERSWWW.KULEUVEN.BE/~U0044428.