LINGUIST List 8.889

Wed Jun 18 1997

Qs: Arabic/Korean, Morphology, Theta Grid

Editor for this issue: Ann Dizdar <annlinguistlist.org>


We'd like to remind readers that the responses to queries are usually best posted to the individual asking the question. That individual is then strongly encouraged to post a summary to the list. This policy was instituted to help control the huge volume of mail on LINGUIST; so we would appreciate your cooperating with it whenever it seems appropriate.

Directory

  1. M.-L. Hannan [TAO], Arabic and Korean corpora
  2. Daniel Nettle, Appeal: Inflectional Morphology Database
  3. F.Baube, Theta Grid DB for English ?

Message 1: Arabic and Korean corpora

Date: Thu, 12 Jun 1997 15:36:51 -0400
From: M.-L. Hannan [TAO] <hannanciti.doc.ca>
Subject: Arabic and Korean corpora

Hello,

I am a researcher in computational linguistics with a Canadian reserach
team. We are currently looking for large corpora of electronic text in
both Arabic and Korean (around 3MB of each) for the purposes of our
research in building statistical language models. The subject of the
text is not important, but since a large quantity is required, a
literature source might be ideal.

I'd be very grateful if anyone could point me towards any ftp-able
archives that might be pertinent.

Thank you in advance,

- Marie-Louise Hannan
 Reseracher, Group TAO
 Centre for Information Technology Innovation
 Montreal, Canada
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 2: Appeal: Inflectional Morphology Database

Date: Fri, 13 Jun 1997 09:40:49 +0100 (BST)
From: Daniel Nettle <daniel.nettlemerton.oxford.ac.uk>
Subject: Appeal: Inflectional Morphology Database


Appeal: Inflectional Morphology Research Project

or

What's the point of inflectional morphology anyway?

I am currently interested in looking at how the distinctions expressed
morphologically in languages with rich systems of morphological
marking are realised in languages which lack it, if they are. In a
language such as Mangarayi (Merlan 1980) where inflectional morphology
is highly developed, morphology takes over almost all the marking of
grammatical relations from syntax, but it also does much more. Nominal
morphology replaces the category of PP, and allows pronouns in many
sentences to be dropped, whilst verbal inflection replaces not only
tense and aspect marking words but some adverbs. The questions which
obviously arise are: (a) diachronically, why do some languages come to
achieve morphologically what others do syntactically; and (b)
synchronically, what are the systemic consequences of having more or
less developed inflection.

I am appealing for help in building up a cross-linguistic database
relevant to these issues. The data required are a set of simple
sentences translated into a large number of different languages. The
languages I am interested are listed below. They are drawn from a
standard cross-linguistic sampling frame (after Nichols 1992). I would
like data from as many of them as possible, though I appreciate that
in practice only a small proportion can be done.

To respond for a particular language requires both competence in the
language (or access to native speakers) AND linguistic expertise on it
(since phonological transcription and interlinear glosses are
required). I will be extremely grateful if readers can contribute on
languages with which they are familiar. I will post results on the
Internet and also make the data available to anyone who contributes
and is interested. The questionnaire is available from my website at:
http://users.ox.ac.uk/~mert0362
 - or by e-mailing me at: 
 Daniel.Nettlemerton.ox.ac.uk
 - or by post from:
 Daniel Nettle, Merton College, Oxford, OX1 4JD, UK.

Many thanks in anticipation to all!

Daniel Nettle

List of languages follows:
African languages
Amharic	
Dizi			
Fulfulde
Gbeya
Hausa	
Ik		
!Kung	
Logbara
Luganda
Maasai
Mandinka
Nama (Hottentot)
Nera		
Orig (Kordofanian)
Oromo	
Sandawe			
Songhai
Yoruba

South and South-East Asian Languages
Acehnese
Fur
Gurung
Kota		
Temiar
Thai		

Languages from New Guinea and Oceania
Abelam	
Alamblak	
Amele	
Arapesh	
Asmat	
Awtuw	
Drehu 
Hua			
Kate			
Kewa		
Kiwai		
Kobon	
Kombai	
Nasioi 
Ponapean 
Salt-Yui	
Sentani	
Suena	
Sulka
Telefol	
Vanimo	
West Futuna
Yali			
Yessan-Mayo	
Yimas		

Australian Languages
Djingili		
Dyirbal		
Garawa		
Gunwinggu		
Kuniyanti / Gooniyandi
Malak-Malak	
Maung		
Nunggubuyu	
Nyigina		
Tiwi				
Ungarinjin / Ngarinjin		
Uradhi		
Warndarang	
Western Desert / Pintupi
Yukulta		


Central and South American Languages
Axininca Campa
Canela-Kraho	
Cashinahua	
Cayuvava		
Chontal	
Guarani
Hixkaryana		
Huallaga Quechua
Huave	
Jaqaru		
Jivaro	
Mixe			
Mixtec	
Nambiquara
Pipil			
Piraha	
Tarascan	
Tepehua	
Tzutujil	
Yagua	
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 3: Theta Grid DB for English ?

Date: Fri, 13 Jun 1997 15:59:52 +0300 (EEST)
From: F.Baube <fredkirjasto.kaarina.fi>
Subject: Theta Grid DB for English ?


Is there a database, in electronic form and in the public domain,
of the theta grids (argument structures) of some English verbs ?

It need not be "complete", a few hundred common verbs would do nicely.

This would be for a non-commercial software project.

If people wish to reply directly to me, I can summarize to the list.

Thank you,

Fred Baube

- 
F.Baube(tm) 
G.U. MSFS '88 
fredkirjasto.kaarina.fi
#include <disclaimer.h>
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue