Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

Holy Sh*t: A Brief History of Swearing

By Melissa Mohr

Holy Sh*t: A Brief History of Swearing "contains original research into the history of swearing, and is scrupulous in analyzing the claims of other scholars."


New from Cambridge University Press!

ad

A New Manual of French Composition

By R. L. Graeme Ritchie

A New Manual of French Composition "provides a guide to French composition aimed at university students and the higher classes in schools. "


The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported primarily by your donations. Please support LINGUIST List during the 2016 Fund Drive.

Academic Paper


Title: Morphosyntactic annotation of CHILDES transcripts
Author: Kenji Sagae
Institution: University of Southern California
Author: Eric Davis
Institution: Carnegie Mellon University
Author: Alon Lavie
Institution: Carnegie Mellon University
Author: Brian Macwhinney
Email: click here TO access email
Institution: Carnegie Mellon University
Author: Shuly Wintner
Institution: University of Haifa
Linguistic Field: Computational Linguistics; Language Acquisition; Psycholinguistics
Abstract: Corpora of child language are essential for research in child language acquisition and psycholinguistics. Linguistic annotation of the corpora provides researchers with better means for exploring the development of grammatical constructions and their usage. We describe a project whose goal is to annotate the English section of the CHILDES database with grammatical relations in the form of labeled dependency structures. We have produced a corpus of over 18,800 utterances (approximately 65,000 words) with manually curated gold-standard grammatical relation annotations. Using this corpus, we have developed a highly accurate data-driven parser for the English CHILDES data, which we used to automatically annotate the remainder of the English section of CHILDES. We have also extended the parser to Spanish, and are currently working on supporting more languages. The parser and the manually and automatically annotated data are freely available for research purposes.

CUP AT LINGUIST

This article appears IN Journal of Child Language Vol. 37, Issue 3, which you can READ on Cambridge's site or on LINGUIST .



Add a new paper
Return to Academic Papers main page
Return to Directory of Linguists main page