|
|
E-mail this message to a friend
|
|
Title:
|
Phonological Constraints and Free Variation in Compounding: A corpus study of English and Estonian noun compounds
|
|
Author:
|
Mary Sepp
|
|
Email:
|
click here to access email
|
|
Homepage:
|
http://www1.bmcc.cuny.edu/faculty/fp.jsp?f=msepp
|
|
Degree Awarded:
|
City University of New York
, Linguistics Program
|
|
Degree Date:
|
2006
|
|
Linguistic Subfield(s):
|
Text/Corpus Linguistics
|
|
Subject Language(s):
|
English
|
|
Director(s):
|
Martin Chodorow
|
|
|
Abstract:
|
|
This research was designed to examine the patterns of variation in the
phonological and/or orthographic form of Estonian and English noun
compounds. Estonian noun compounds generally occur in one of two forms:
N1(nominative)+ N2 , as in kool + meister (“schoolmaster”), or N1(genitive)
+ N2, as in kooli + õpetaja (“schoolteacher”). Some Estonian compounds
vary freely in form – e.g., veebsepp/veebisepp (“webmaster”). English noun
compounds exhibit orthographic variation, as they may be written in three
ways: closed (“bookstore”), hyphenated (“dot-com”), or open (“space
station”). Many English compounds also vary freely – e.g.,
cellphone/cell-phone/cell phone. The principal goal of this study was to
use statistical data derived from corpora to determine which variables best
account for the choice of variant compound forms.
The 1,094 Estonian compounds used in this research came from a one million
word corpus of Estonian literary and news texts. Data on variation of form
were obtained from Google searches of the World Wide Web. Results showed a
strong preference for genitive forms, and it was posited that this
preference is due to general principles of ease of pronunciation and ease
of perception.
Phonology is also a factor in the distribution of English compounds. A
number of phonological variables were examined in the current study: number
of syllables, presence of compound stress, vowel sequences across internal
lexical boundaries, and double consonants across internal lexical
boundaries. Frequency data for these variables were extracted from a
fourteen million word English corpus. Results of multiple regression
analyses showed that the number of syllables in the compound is a stronger
predictor of orthographic form than the other phonological features that
were tested. Phonology was not assumed to be the only influence, however;
lexical features were also examined. Results indicated a substantial
contribution of the second constituent in predicting whether the compound
would be open or closed, and a lesser, though important, contribution of
the first constituent. A regression analysis combining phonological and
lexical variables accounted for about 68% of the variance in the
orthography of 707 high frequency English noun compounds.
|
|
|
|
|
Page Updated: 27-Nov-2009

Please report any bad links or misclassified data
LINGUIST Homepage | Read
LINGUIST | Contact us

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.
|
|