Featured Linguist!

Jost Gippert: Our Featured Linguist!

"Buenos dias", "buenas noches" -- this was the first words in a foreign language I heard in my life, as a three-year old boy growing up in developing post-war Western Germany, where the first gastarbeiters had arrived from Spain. Fascinated by the strange sounds, I tried to get to know some more languages, the only opportunity being TV courses of English and French -- there was no foreign language education for pre-teen school children in Germany yet in those days. Read more



Donate Now | Visit the Fund Drive Homepage

Amount Raised:

$34638

Still Needed:

$40362

Can anyone overtake Syntax in the Subfield Challenge ?

Grad School Challenge Leader: University of Washington


Publishing Partner: Cambridge University Press CUP Extra Publisher Login
amazon logo
More Info


New from Oxford University Press!

ad

What is English? And Why Should We Care?

By: Tim William Machan

To find some answers Tim Machan explores the language's present and past, and looks ahead to its futures among the one and a half billion people who speak it. His search is fascinating and important, for definitions of English have influenced education and law in many countries and helped shape the identities of those who live in them.


New from Cambridge University Press!

ad

Medical Writing in Early Modern English

Edited by Irma Taavitsainen and Paivi Pahta

This volume provides a new perspective on the evolution of the special language of medicine, based on the electronic corpus of Early Modern English Medical Texts, containing over two million words of medical writing from 1500 to 1700.


Summary Details


Query:   sum:frequency
Author:  Marcial Terradez
Submitter Email:  click here to access email
Linguistic LingField(s):   Discourse Analysis
Pragmatics

Summary:   Some weeks ago, I made a query on linguist list about frequency
vocabularies on English, French, German and Spanish. Many people
responded with helpful comments, which are summarised below.

Thanks to everybody who wrote to me. Your suggestions and information
are very important for my work.


My name is Erik Willis and I attend Brigham Young University as a
Masters student in Spanish. One of our professors is very active in
frequency counts, his name is Orlando Alba. (Orlando_Alba@byu.edu) I
know his teacher Humberto Lopez Morales was very active in that field
also. Their respective corpora are based on the Dom. Rep. Puerto Rico
and I believe Mexico and were based on availability (lexico
disponible). Hasta ahora no creo que tengan algo en el net. El que
mejor conoce los recursos del net el Francisco Marcos Marin en la
Autonoma de Madrid. No tengo su e-mail. I am also working with
frequency counts but at a phonological level. I am looking at written
and oral narratives which I believe has not been done. Ojala podamos
ayudarnos mutuamente con bibliografias etc.

Erik Willis
willisew@itsnet.com

- -------------------------------------------------
Estimado Marcial:
Hay varios recuentos existentes ya, entre ellos:

Helen Eaton, ca. 194?. (Me olvido del ti'tulo, pero es algo
como: Frecuency counts in 5 European languages. No se' quie'n lo
publico' originariamente, pero la Dover Press lo volvio' a publicar en
'paperback' por eso de los 60s o 70s.
Luis Fernando Lara en el Colegio de Me'xico ha hecho mucho en
este sentido (con base en textos seleccionados de un total de [creo] 2
millones de palabras de texto corrido). E'l esta' en el DEM
[diccionario del espan~ol de Me'xico], y actualmente es el director
del CELL [centro de estudios de lingu"i'stica y literatura] de El
Colegio de Me'xico (e-mail: lara@colmex.mx, aunque no estoy 100%
seguro del prefijo). E'l te puede asesorar mucho al respecto.
Tb. hay muchos investigadores del ana'lisis de corpus en la propia
Espan~a, aunque no me acuerdo en estemomento de sus nombres.
Yo a mediano plazo emprendere' un proyecto con propo'sito
similar, pero con un corpus de gigapalabras, para poder investigar el
uso de formas de palabras (por ej., el futuro del subjuntivo, etc.)
con algo de detalle, asi' como los nombres propios, etc. Sin embargo,
no tengo mucho hecho al respecto hasta la fecha.

Jim
James L. Fidelholtz e-mail: jfidel@siu.cen.buap.mx
A'rea de Ciencias del Lenguaje o: jfidel@cca.pue.udlap.mx
Instituto de Ciencias Sociales y Humanidades
Universidad Auto'noma de Puebla, Me'xico
- -------------------------------------------------------------
Estimado Marcial,

Un colega mio de la Universidad de Oviedo acaba de publicar un
diccionario de frecuencias del castellano. Su direccion es: Jose
Ramon Alameda <jalameda@sci.cpd.uniovi.es> En cuanto al diccionario
que Ud. va a recopilar, Ud. piensa etiquetear las palabras. Es
decir, va a distinguir entre en numero de casos de 'casa' que son del
sustantivo 'casa' y los que viene del verbo 'casar'? -
--------------------------------------------------
David Eddington
Mississippi State University

I used two frequency lists in research I
conducted almost 20 years ago: one is the Keniston List, 2000 words
divided into groups of 500 for frequency of words in print in
Peninsular Spanish. The other is Rodriquez and Bou for frequency of
words in print for Puerto Rican Spanish.

Joel Walters
Department of English
Bar-Ilan University Ramat Gan, Israel -
----------------------------------------------------

I produced the frequency list for Longman's Dictionary. Both the
paper and assorted frequency lists are available from my web page (see
below).

If you have troubel accessign the paper, feel free to email me again
and I'll send it,

Happy surfing,

Adam

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Adam Kilgarriff =20
Senior Research Fellow tel: (44) 1273 642919 =20
Information Technology Research Institute (44) 1273 642900=20
University of Brighton fax: (44) 1273 642908
Lewes Road =20
Brighton BN2 4GJ email: Adam.Kilgarriff@itri.bton.ac.uk
UK http://www.itri.bton.ac.uk/~Adam.Kilgarriff
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
- -------------------------------------------------------------------
Entra por ftp anonimo en ftp-lsi.upc.es
cambia al directorio pub/lluisp

alli encontraras los ficheros

spanish.freq (frecuencias de palabras en espa=F1ol
sacadas de un corpus de 3M de palabras)

wsj.freq (frecuencias de palabras en ingles sacadas
de 1.1M de palabras del WSJ)

tienes que uudecodear y gzunzipar los ficheros


suerte

Lluis Padro
- -----------------------------------------------
Hola Marcial:

Aunque es muy probable que ya las tengas, te envio las referencias
que tengo a mano sobre frecuencias lexicas del castellano, por si
te pueden ayudar:

PATTERSON, William; y URRUTIBEHEITY, Hector, _The Lexical Structure of
Spanish_, Mouton, La Haya-Par=EDs, 1975.=20

JUILLAND, Alphonse; y CHANG-RODRIGUEZ, Eugenio, _Frequency dictionary
of Spanish words_, Mouton, Londres-La Haya-Par=EDs, 1964.

PATTERSON, William T., "On the genealogical structure of the Spanish
vocabulary",en ???, pp. 309-339.

GARCIA HOZ, Victor, _Estudios experimentales sobre el vocabulario_,
CSIC, Madrid, 1977.

______________________
Javier Gomez Guinovart <uvifejgg@cesga.es>
http://www.uvigo.es/departamentos/dep/h06/webh06/sli/index.html
Univ. de Vigo - Fac. de Humanidades - Apartado 874 - E-36200 Vigo
Tel: +34+86+812360 - Fax: +34+86+812380
- ---------------------------------------------------------
I have a copy of:

An English-French-German-Spanish Word Frequency Dictionary Subtitle: A
correlation of the first 6000 words in four single-language frequency
lists
Compiled by Helen S. Eaton, Teachers College, Columbia Univ;
visiting instructor, Univ of New Mexico;
Diplomee, Sorbonne, Universite de Paris
441 pages, paperback, Dover Publications, Inc, New York.
copyright 1940, 1967 by Helen S. Eaton

There are separate indexes for English, French, German and Spanish
words.
Appendix II is a conceptual analysis of substantives, verbs and
adjectives
in the list.

Pub in Canada by General Publ Co Ltd, 30 Lesmill Road, Don Mills,
Toronto, Ontario
Pub in UK by Constable and Co, Ltd, 10 Orange St, London, W.C. 2
Pub in US by Dover Publications Inc, 180 Varick St, New York, NY 10014
LCCN: 61-4487

/s/ Israel Cohen
New Dimension Software Ltd
izzy@telaviv.ndsoft.com

LL Issue: 8.930
Date Posted: 26-Jun-1997
Original Query: Read original query


Back

Sums main page