LINGUIST List 4.414

Sat 29 May 1993

Qs: Object agr., German dictionaries, Text processing

Editor for this issue: <>


Directory

  1. Dan Everett, Object agreement
  2. , query: German, German- English, and German-Russian dictionaries
  3. David Lewis, major resources and lists of resources for text processing

Message 1: Object agreement

Date: Fri, 28 May 93 12:55:36 -0Object agreement
From: Dan Everett <deverpogo.isp.pitt.edu>
Subject: Object agreement

I am looking for nonergative languages in which verbs agree with
the direct object but not the subject. (The intransitive subject should
also not trigger agreement on the verb, else the language would count
as ergative.)

I know of only one language that operates this way (in New Guinea,
described by Foley, I believe). Anyone know of any others?

Individual responses should be sent to me directly. I will post a summary
of any responses I receive.

Dan Everett
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 2: query: German, German- English, and German-Russian dictionaries

Date: Fri, 28 May 93 19:59:01 +0query: German, German- English, and German-Russian dictionaries
From: <ursula.doleschalwu-wien.ac.at>
Subject: query: German, German- English, and German-Russian dictionaries

I am posting this on behalf of my colleague Sergej Krylov:

I would like to find out if there exist anywhere computerized dictionaries
(German monolingual, German-English and German-Russian). I am interested in
freeware as well as shareware or commercial versions.
Ursula Doleschal Tel.: ++43-1-31336 4115
Inst. f. Slawische Sprachen Fax: ++43-1-31336 744
Wirtschaftsuniv. Wien
Augasse 9 Austria
1090 Wien Europe
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

Message 3: major resources and lists of resources for text processing

Date: Fri, 28 May 93 13:19 EDT
From: David Lewis <lewisresearch.att.com>
Subject: major resources and lists of resources for text processing


Hello, As part of a tutorial I'm presenting on natural language
processing for information retrieval, I would like to prepare two
brief lists:

 1. A list of "lists" of resources for natural language processing
of text. I count electronic mailing lists, and archives of those
lists, as such lists of lists when they frequently discuss language
processing resources. I will also list network servers (FTP archives,
Bitnet LISTSERVs, etc.) as "lists" but even better would be to be able
to provide pointers to regularly maintained files on such archives.

My current list is the following:

 Free electronic mailing lists:
 NL-KR
 IRLIST
 LINGUIST
 EMPIRICISTS
 CORPORA-LIST
 LN
 HUMANIST

 Commercial electronic mailing lists:
 Computists Communique

 Regularly Maintained Lists of Resources:
 /pub/catalog at anonymous FTP site clr.nmsu.edu

Suggestions are welcome. This list of "lists" is intended to be a
one-shot effort that will become immediately out-of-date, but will
have pointed a group of people to maintained resources with a longer
expected lifetime.

 2. A second list of the 10 to 20 most useful, easily available,
non-commercial resources for content-based processing of large (at
least megabyte scale) bodies of text, and how to get them. (If it
takes more than 50 words to describe how to get them, they fail on
"easily available".) I want to include only resources that are
robust, available at nominal cost (at least to academics), and have
actually been successfully used by multiple research or application
groups. Some of the ones I intend to include on the list based on
personal experience are:

 --Treebank corpus (tagged corpus)
 --PC-Kimmo and Englex (morphological analyzer)
 --Perl (programming language)
 --SMART (information retrieval system)
 --JUMAN (Japanese morphological analyzer / segmenter)

I welcome suggested additions to this list, from resource developers
and particularly from resource users based on your personal
experience. Please do not be offended if you are a resource developer
and I do not choose to list your resource. Brevity is my goal and
there will be significant omissions.

 I will post the resulting two lists back to all newsgroups that
this query is going to, so there's no need to write me asking for
copies. These two lists will be one shots that will not be
maintained, and will hopefully be soon forgotten in favor of better,
permanently maintained lists.

Many thanks, David Lewis

David D. Lewis
AT&T Bell Laboratories email: lewisresearch.att.com
600 Mountain Ave.; Room 2C-408 ph. 908-582-3976
Murray Hill, NJ 07974; USA dept. fax. 908-582-7550
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue