LINGUIST List 24.5199
Mon Dec 16 2013
Confs: Text/Corpus Linguistics/USA
Editor for this issue: Xiyan Wang
<xiyanlinguistlist.org>
Date: 13-Dec-2013
From: Stefan Th. Gries <stgries
linguistics.ucsb.edu>
Subject: Quantitative Corpus Linguistics with R
E-mail this message to a friend
Quantitative Corpus Linguistics with R
Short Title: QCLWR
Date: 11-Aug-2014 - 15-Aug-2014
Location: Santa Barbara, CA, USA
Contact: Stefanie Wulff
Contact Email:
< click here to access email >
Meeting URL:
http://www.linguistics.ucsb.edu/faculty/stgries/ucsbbootcamps2014.pdf
Linguistic Field(s): Text/Corpus Linguistics
Meeting Description:
The corpus bootcamp is a 30-hours hands-on introduction to quantitative corpus linguistics for both graduate students and seasoned researchers. Note: this corpus linguistics bootcamp can be taken together with the bootcamp 'Statistics for linguistics with R', which takes place the week after this bootcamp. Using the open source software and programming language R, we will learn
- how to generate frequency lists and search for words and patterns;
- how to process corpora and perform corpus-linguistic searches in ways that typical corpus software does not support;
- how to write small functions for recurrent corpus-linguistic tasks.
Data to be dealt with include plain text corpora, corpora with SGML or XML annotation, ICE-GB files, and others. The content of this corpus linguistics bootcamp is based on Gries (2009e) but (i) structured differently to accommodate the workshop format of the bootcamp and (ii) provides functions and new examples not discussed in it.
Page Updated: 16-Dec-2013