Academic Paper |
|
|
|
|
| Title: | The use of film subtitles to estimate word frequencies |
| Author: | Boris New |
| Institution: | Université Paris V - Descartes |
| Author: | Marc Brysbaert |
| Institution: | University of London |
| Author: | Jean Véronis |
| Institution: | Université de Provence |
| Author: | Christophe Pallier |
| Institution: | Centre National de la Recherche Scientifique |
| Linguistic Field: | Sociolinguistics; Text/Corpus Linguistics |
| Abstract: | We examine the use of film subtitles as an approximation of word frequencies in human interactions. Because subtitle files are widely available on the Internet, they may present a fast and easy way to obtain word frequency measures in language registers other than text writing. We compiled a corpus of 52 million French words, coming from a variety of films. Frequency measures based on this corpus compared well to other spoken and written frequency measures, and explained variance in lexical decision times in addition to what is accounted for by the available French written frequency measures. |
|
|
|
|
This article appears in Applied Psycholinguistics Vol. 28, Issue 4, which you can read on Cambridge's site or on LINGUIST . |
|
|
|
|
Back
Add a new paper Return to Academic Papers main page Return to Directory of Linguists main page |
|


