LINGUIST List 24.5199

Mon Dec 16 2013

Confs: Text/Corpus Linguistics/USA

Editor for this issue: Xiyan Wang <>

Date: 13-Dec-2013
From: Stefan Th. Gries <>
Subject: Quantitative Corpus Linguistics with R
E-mail this message to a friend

Quantitative Corpus Linguistics with R Short Title: QCLWR

Date: 11-Aug-2014 - 15-Aug-2014 Location: Santa Barbara, CA, USA Contact: Stefanie Wulff Contact Email: < click here to access email > Meeting URL:

Linguistic Field(s): Text/Corpus Linguistics

Meeting Description:

The corpus bootcamp is a 30-hours hands-on introduction to quantitative corpus linguistics for both graduate students and seasoned researchers. Note: this corpus linguistics bootcamp can be taken together with the bootcamp 'Statistics for linguistics with R', which takes place the week after this bootcamp. Using the open source software and programming language R, we will learn

- how to generate frequency lists and search for words and patterns;
- how to process corpora and perform corpus-linguistic searches in ways that typical corpus software does not support;
- how to write small functions for recurrent corpus-linguistic tasks.

Data to be dealt with include plain text corpora, corpora with SGML or XML annotation, ICE-GB files, and others. The content of this corpus linguistics bootcamp is based on Gries (2009e) but (i) structured differently to accommodate the workshop format of the bootcamp and (ii) provides functions and new examples not discussed in it.

Page Updated: 16-Dec-2013