LINGUIST List 18.267|
Thu Jan 25 2007
Qs: Categorisation of Semantic Relations/American English Corpus
Editor for this issue: Kevin Burrows
We'd like to remind readers that the responses to queries are usually best posted to the individual asking the question. That individual is then strongly encouraged to post a summary to the list. This policy was
instituted to help control the huge volume of mail on LINGUIST; so we would appreciate your cooperating with it whenever it seems appropriate.
In addition to posting a summary, we'd like to remind people that it is usually a good idea to personally thank those individuals who have taken the trouble to respond to the query.
To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.html.
Categorisation of Semantic Relations
Large Amer English Corpus with Parts of Speech
Message 1: Categorisation of Semantic Relations
From: Jo Wiltshire <jo.wiltshiregmail.com>
Subject: Categorisation of Semantic Relations
I am just commencing a PhD in Psychology looking at children's
understanding of semantic relations and how this relates to their
understanding of analogy. My first problem is how to choose the relations
which we should look at. For example, previous research has looked at
relations such as 'part of', 'lives in', 'is made of', etc. But I don't simply want
to pick the first relations that come in to my head.
Can you recommend any books, papers or other resources which attempt
to list and categorise relations in a meaningful way? Several authors appear
to have demonstrated how most relations can be put in to five or six basic
categories, however, this tends to end up with quite abstract categories
which aren't suitable for tasks with children. What I really need is something
which categorises and organises relations in a more fine grained way.
Message 2: Large Amer English Corpus with Parts of Speech
From: Christopher Maloof <cjm62georgetown.edu>
Subject: Large Amer English Corpus with Parts of Speech
We are looking for a large word frequency list for English which
distinguishes parts of speech. We're currently using counts from an
automatically-tagged corpus of 50 million words of AP newswire text, but
this isn't enough to reliably distinguish counts of moderately infrequent
words, especially for inflected forms like past participles.
Does anyone know of any large, POS-tagged corpus of American English, or a
list of word frequency counts from such a corpus? We're more concerned
about size than balance; a very large list from a single source (such as
newswire text) would be fine.
Please e-mail any suggestions to Chris Maloof at cjm62georgetown.edu.
Thanks very much,
Brain and Language Lab, Georgetown University
Respond to list|Read more issues|LINGUIST home page|Top of issue
Please report any bad links or misclassified data
LINGUIST Homepage | Read
LINGUIST | Contact us
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.