LINGUIST List 25.2076

Sat May 10 2014

Summer Schools: Bootcamp: Corpus linguistics with R / Santa Barbara, California, USA

Editor for this issue: Malgorzata Cavar <>

Date: 10-May-2014
From: Stefan Th. Gries <>
Subject: Bootcamp: Corpus linguistics with R / Santa Barbara, California, USA
E-mail this message to a friend

Bootcamp: Corpus linguistics with R

Host Institution: University of California, Santa Barbara
Coordinating Institution: University of Florida

Dates: 11-Aug-2014 - 15-Aug-2014
Location: Santa Barbara, California, USA

Focus: 5-day bootcamp on corpus linguistics with R
Minimum Education Level: Undergraduate

The corpus bootcamp is a 30-hours hands-on introduction to quantitative corpus linguistics for both graduate students and seasoned researchers. Using the open source software and programming language R, we will learn
- how to generate frequency lists and search for words and patterns;
- how to process corpora and perform corpus-linguistic searches in ways that typical corpus software does not support;
- how to write small functions for recurrent corpus-linguistic tasks.
Data to be dealt with include plain text corpora, corpora with SGML or XML annotation, ICE-GB files, and others. The participants will also get small functions and scripts they can use for their own corpus-linguistic tasks (concordancing, generating n-grams of words or characters, and others). The content of this corpus linguistics bootcamp is based on Gries (2009e) but (i) structured differently to accommodate the workshop format of the bootcamp, and (ii) provides functions and examples not discussed in it.

Linguistic Field(s): Text/Corpus Linguistics

Tuition: 650.00 USD
Tuition Explanation: Tuition and housing.

Registration: 01-May-2014 to 20-May-2014

Contact Person: Stefanie Wulff

Apply by Email:

Registration Instructions:
See website/pdf for contact information.

Page Updated: 10-May-2014