* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 18.3088

Mon Oct 22 2007

Sum: Corpus Linguistics and Frequency

Editor for this issue: Dan Parker <danlinguistlist.org>


To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.html.
Directory
        1.    Peyton Todd, Corpus Linguistics and Frequency


Message 1: Corpus Linguistics and Frequency
Date: 22-Oct-2007
From: Peyton Todd <peytontoddmindspring.com>
Subject: Corpus Linguistics and Frequency
E-mail this message to a friend

Query for this summary posted in LINGUIST Issue: 18.2793
Many thanks to Roger Levy, Maria Giagkou, Balint Tanos, Aida Zitouni, Holly 
Jacobson, Cedric Krummes, Karen Englander, Gill Philip, Martin Volk, N.
Wiedenmann, and Josh Viau for their answers to my recent query regarding
sources of information about corpus linguistics and frequency. In further
expression of my gratitude, and as a boon to others sharing my interest, I
provide below a summary of the replies I received.

Peyton Todd

BOOKS AND ARTICLES:

1. Baker, Paul (2006). Using Corpora in Discourse Analysis. London:
Continuum, 0-8264-7725-9

2. Biber, Douglas. Dimensions of Register Variation using
Multifeature/multidimensional analysis.

3. Hunston, S. & G.Francis, Pattern Grammar (J. Benjamins)

4. Meyer, Charles F. (2002). English Corpus Linguistics: An Introduction .
Cambridge University Press. (ISBN: 052100490X)

5. Roland, Douglas, Frederic Dick, and Jeffrey L. Elman (2007). Frequency
of basic English grammatical structures: A corpus analysis. Journal of
Memory and Language 57(3):348-379.

6. Sinclair, John. Reading Concordances.

7. Sinclair, John. Trust the Text.

Also, 'the works of Joan Bybee', listed at http://www.unm.edu/~jbybee/

HANDS-ON SEARCHES:

1. Bank of English (= Collins, below)

2. British National Corpus: http://www.natcorp.ox.ac.uk/

3. Collins WordbanksOnline concordance sampler
http://www.collins.co.uk/Corpus/CorpusSearch.aspx

4. Introductory website:
http://www.essex.ac.uk/linguistics/clmt/w3c/corpus_ling/content/introduction.html

5. Linguistic Data Consortium (LDC) at the University of Pennsylvania.WebSearch

6. Phrases in English: http://pie.usna.edu/, which uses the BNC

7. http://childes.psy.cmu.edu/

8. http://www.natcorp.ox.ac.uk/

9. TIGER-Search (freely available from the University of Stuttgart)

10.The Penn Treebank (for English).

SOFTWARE

1. The TextSTAT (free):
http://www.niederlandistik.fu-berlin.de/textstat/software-en.html

2. The WordSmith Tools (not free, but inexpensive):
http://www.lexically.net/wordsmith/

3. AntConc: downloadable for free at:
http://www.antlab.sci.waseda.ac.jp/software.html

4. ConcApp: available from www.edict.com.hk/PUB/concapp/


A LIST SERVER FOR FURTHER QUERIES:


1. Corporauib.no


And an INDIVIDUAL:

Prof. Dr. Dietmar Zaefferer,
Ludwig-Maximilians-University at Munich, Germany
(who is very friendly) who has data on all languages of the world
(Computational Linguistics)

Linguistic Field(s): Text/Corpus Linguistics



Read more issues|LINGUIST home page|Top of issue




Please report any bad links or misclassified data

LINGUIST Homepage | Read LINGUIST | Contact us

NSF Logo

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.