* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 18.2686

Fri Sep 14 2007

Qs: English Type Frequencies by POS

Editor for this issue: Dan Parker <danlinguistlist.org>

We'd like to remind readers that the responses to queries are usually best posted to the individual asking the question. That individual is then strongly encouraged to post a summary to the list. This policy was instituted to help control the huge volume of mail on LINGUIST; so we would appreciate your cooperating with it whenever it seems appropriate.

In addition to posting a summary, we'd like to remind people that it is usually a good idea to personally thank those individuals who have taken the trouble to respond to the query.

To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.html.
        1.    Richard Hudson, English Type Frequencies by POS

Message 1: English Type Frequencies by POS
Date: 12-Sep-2007
From: Richard Hudson <dickling.ucl.ac.uk>
Subject: English Type Frequencies by POS
E-mail this message to a friend

Does anyone know where I can find the proportion of English lemmas that are

More precisely, I'm looking for figures for lemmas in some large dictionary
or corpus classified by word class (aka part of speech), and if possible
also by token frequency; so ideally I'd like a table which shows nouns (and
maybe other word classes) as a percentage of the lemmas in a given
frequency range. My assumption is that the percentage of nouns in rare
vocabulary is higher than in common vocabulary, but I'd like to know
whether this is true.

If I learn anything significant I'll summarise back to the list.

Dick Hudson (dickling.ucl.ac.uk)

Linguistic Field(s): Computational Linguistics

Read more issues|LINGUIST home page|Top of issue

Please report any bad links or misclassified data

LINGUIST Homepage | Read LINGUIST | Contact us

NSF Logo

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.