|Full Title:||Quantitative Corpus Linguistics with R|
|Location:||Santa Barbara, CA, USA|
|Start Date:||11-Aug-2014 - 15-Aug-2014|
|Meeting Email:||click here to access email|
The corpus bootcamp is a 30-hours hands-on introduction to quantitative corpus linguistics for both graduate students and seasoned researchers. Note: this corpus linguistics bootcamp can be taken together with the bootcamp 'Statistics for linguistics with R', which takes place the week after this bootcamp. Using the open source software and programming language R, we will learn
- how to generate frequency lists and search for words and patterns;
- how to process corpora and perform corpus-linguistic searches in ways that typical corpus software does not support;
- how to write small functions for recurrent corpus-linguistic tasks.
Data to be dealt with include plain text corpora, corpora with SGML or XML annotation, ICE-GB files, and others. The content of this corpus linguistics bootcamp is based on Gries (2009e) but (i) structured differently to accommodate the workshop format of the bootcamp and (ii) provides functions and new examples not discussed in it.
|Linguistic Subfield:||Text/Corpus Linguistics|
|Calls and Conferences main page|