The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported primarily by your donations. Please support LINGUIST List during the 2016 Fund Drive.
|Full Title:||Corpus Linguistics with R|
|Location:||Atlanta, GA, USA|
|Start Date:||01-Aug-2013 - 06-Aug-2013|
|Meeting Email:||click here to access email|
|Meeting Description:||The corpus bootcamp is a 30-hours hands-on introduction to quantitative corpus linguistics for both graduate students and seasoned researchers. Note: this corpus linguistics bootcamp can be taken together with the bootcamp 'Statistics for linguistics with R', which takes place the week after this bootcamp. Using the open source software and programming language R, we will learn
- how to generate frequency lists and search for words and patterns;
- how to process corpora and perform corpus-linguistic searches in ways that typical corpus software does not support;
- how to write small functions for recurrent corpus-linguistic tasks.
Data to be dealt with include plain text corpora, corpora with SGML or XML annotation, ICE-GB files, and others. The content of this corpus linguistics bootcamp is based on Gries (2009e) but (i) structured differently to accommodate the workshop format of the bootcamp and (ii) provides functions and new examples not discussed in it.
|Linguistic Subfield:||Text/Corpus Linguistics|
|Calls and Conferences main page|