Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

Holy Sh*t: A Brief History of Swearing

By Melissa Mohr

Holy Sh*t: A Brief History of Swearing "contains original research into the history of swearing, and is scrupulous in analyzing the claims of other scholars."


New from Cambridge University Press!

ad

A New Manual of French Composition

By R. L. Graeme Ritchie

A New Manual of French Composition "provides a guide to French composition aimed at university students and the higher classes in schools. "


The LINGUIST List is dedicated to providing information on language and language analysis, and to providing the discipline of linguistics with the infrastructure necessary to function in the digital world. LINGUIST is a free resource, run by linguistics students and faculty, and supported primarily by your donations. Please support LINGUIST List during the 2016 Fund Drive.

Summary Details


Query:   Vocabulary Statistics
Author:  Richard Hudson
Submitter Email:  click here to access email
Linguistic LingField(s):   Text/Corpus Linguistics

Summary:   A FEW WEEKS AGO I BROADCAST A DOUBLE QUERY ABOUT THE STATISTICS OF ENGLISH
VOCABULARY. MY FIRST QUESTION WAS ABOUT THE NUMBER OF MORPHEMES COMPARED
WITH THE NUMBER OF LEMMAS, BUT NOBODY OFFERED AN ANSWER.

MY SECOND QUESTION WAS MORE SUCCESSFUL. THIS WAS ABOUT THE PROPORTION OF
LEMMAS IN EACH OF THE MAIN WORD CLASSES, AND HOW THIS PROPORTION VARIED
WITH TOKEN FREQUENCY; I WAS PARTICULARLY KEEN TO CHECK A GUESS THAT THE
PROPORTION OF NOUNS WAS GREATER AMONG RARE LEMMAS THAN AMONG COMMON ONES. I
RECEIVED DATA FROM GWILLIM LAW AND JASPER HOLMES. IT TURNS OUT THAT MY
GUESS WAS RIGHT. I'VE PRESENTED AND SUMMARISEDTHE DATA AT
HTTP://WWW.PHON.UCL.AC.UK/HOME/DICK/NOUNINESS/NOUNINESS.HTM. IF ANYONE HAS
COMMENTS OR FURTHER DATA (INCLUDING DATA ON OTHER LANGUAGES), I SHOULD OF
COURSE BE MOST INTERESTED TO HEAR FROM THEM.

LL Issue: 20.413
Date Posted: 09-Feb-2009
Original Query: Read original query