LINGUIST List 8.1675

Mon Nov 24 1997

FYI: E. Asian Lang, ELRA Resources, AutolexicaList

Editor for this issue: Martin Jacobsen <martylinguistlist.org>

Directory

Judson Murray, E. Asian Lang. Courses and Lang. Teacher Training Course

Chris Corcoran, AutolexicaList Electronic List

Message 1: E. Asian Lang. Courses and Lang. Teacher Training Course

Date: Tue, 18 Nov 1997 16:41:18 -0500
From: Judson Murray <murray.45osu.edu>
Subject: E. Asian Lang. Courses and Lang. Teacher Training Course

The Ohio State University, Summer Programs East Asian Concentration (SPEAC): Training Program for Teachers of Japanese and Intensive Chinese and Japanese Language Programs

The Ohio State University Department of East Asian Languages and Literatures, in conjunction with the OSU National Foreign Language Resource Center, is offering intensive summer programs for teachers of Japanese and learners of Chinese and Japanese during the summer of 1998. The Training Program for Teachers of Japanese is an intensive seven-week (June 22 - August 7) training program which develops participants' Japanese teaching skills through lectures, master classes, workshops, and hands-on teaching. Professor Mari Noda of The Ohio State University, co-author of Japanese: The Spoken Language, is director of the Training Program for Teachers of Japanese. The ten-week (June 22 - August 28) intensive Chinese (Levels I and IV) and Japanese (Levels I, II, and IV) language programs allow learners of Chinese and Japanese to complete one level of language study during the summer quarter. Applicants who submit their application materials prior to March 13, 1998 will receive priority consideration for admission and fellowships. The final deadline is April 6, 1998.

For more information and/or an application, please contact Judson Murray, SPEAC Coordinator, Foreign Language Center, The Ohio State University, 276 Dieter Cunz Hall, 1841 Millikin Rd., Columbus, OH 43210, tel: 614-292-4361, fax: 614-292-2682, e-mail: murray.45osu.edu, or e-mail Mari Noda, Director of SPEAC at noda.1osu.edu. Please visit us on the web: http://deall.ohio-state.edu/SPEAC/

Message 2: ELRA new resources

Date: Wed, 19 Nov 1997 18:56:22 +0100 (MET)
From: Valerie Mapelli <info-elracalva.net>
Subject: ELRA new resources

EUROPEAN LANGUAGE RESOURCES ASSOCIATION ELRA News =====================================

*** ANNOUCEMENT OF NEW RESOURCES AVAILABLE FROM ELRA ***

ELRA is happy to announce the update of its catalogue of Language Resources for Language Engineering and Research.

************************************* * ELRA-S0034 Verbmobil * *************************************

This resource consists of spontaneous speech recorded in a dialog task (appointment scheduling). The German corpus has a total of 13,910 utterances (turns). The BAS edition of the German part is fully labelled and segmented into phonemic/phonetic SAM-PA by the MAUS system and partly segmented manually.

New corpora available via ELRA (for the complete list, please contact ELRA or visit ELRA or BAS Web sites):

VM CD 4.0 - VM40 (1 CD-ROM, original edition) 72 Dialogues, 181 Appointments, 1,588 Turns.

VM CD 4.1 - VM41 (1 CD-ROM, new edition) 72 Dialogues 181 Appointments 1,588 Turns

This new edition contains the transliterations of all dialogues, signal files with PhonDat 2 Header structure, software and speaker documentation and partitur files*. All files were evaluated according to BAS guidelines.

VM CD 5.0 - VM50 (1 CD-ROM, original edition) 101 Dialogues, 256 Appointments, 2,154 Turns.

VM CD 5.1 - VM51 (1 CD-ROM, new edition) 101 Dialogues, 256 Appointments 2,154 Turns.

This new edition contains the transliterations of all dialogues, signal files with PhonDat 2 Header structure, software and speaker documentation and partitur files*. All files were evaluated according to BAS guidelines.

VM CD 6.0 - VM60 (1 CD-ROM, original edition) American/English and 'Denglish'**. 146 Dialogues, 191 Appointments, 1,828 Turns.

VM CD 6.1 - VM61 (1 CD-ROM, new edition) American/English and 'Denglish'**. 146 Dialogues, 191 Appointments 1,828 Turns. This new edition contains the transliterations of all dialogues, signal files with PhonDat 1 Header structure, software and speaker documentation. All files were evaluated according to BAS guidelines.

VM CD 7.0 - VM70 (1 CD-ROM, original edition) 68 Dialogues, 238 Appointments, 1,739 Turns.

VM CD 7.1 - VM71 (1 CD-ROM, new edition) 68 Dialogues, 238 Appointments, 1,739 Turns. This new edition contains the transliterations of all dialogues, signal files with PhonDat 2 Header structure, software and speaker documentation and partitur files*. All files were evaluated according to BAS guidelines.

VM CD 8.0 - VM80 (1 CD-ROM, original edition) American/English 167 Dialogues, 167 Appointments, 1,181 Turns.

VM CD 8.1 - VM81 (1 CD-ROM, new edition) American/English 167 Dialogues, 167 Appointments, 1,181 Turns. This new edition contains the transliterations of all dialogues, signal files with PhonDat 1 Header structure, software and speaker documentation. All files were evaluated according to BAS guidelines.

VM CD 12.0 - VM120 (1 CD-ROM, original edition) 207 Dialogues, 207 Appointments, 2,154 Turns.

VM CD 12.1 - VM121 (1 CD-ROM, new edition) 207 Dialogues, 207 Appointments, 2,154 Turns. This new edition contains the transliterations of all dialogues, signal files with PhonDat 2 Header structure, software and speaker documentation and partitur files*. All files were evaluated according to BAS guidelines.

VM CD 13.0 - VM13.0 (original edition) American/English and 'Denglish'** - 90 speakers - 1714 turns - 200 spontaneous dialogues.

VM CD 13.1 - VM13.1 (new edition) American/English and 'Denglish'** - 90 speakers - 1714 turns - 200 spontaneous dialogues - transliteration.

VM CD 14.0 - VM14.0 (original edition) 97 speakers - 1891 turns - 156 spontaneous dialogues - transliteration.

VM CD 14.1 - VM14.1 (new edition) 97 speakers - 1891 turns - 156 spontaneous dialogues - transliteration - PhonDat 2 headers - Partitur Files*.

* partitur files : files describing the different parts which constitute the corpus - word order, phrase order, etc. ** 'Denglish' : English spoken by Germans.

Price for ELRA members: 76 ECU per CD Price for non members: 152 ECU per CD

*********************************************** * ELRA-S0044 SPINA Corpus ("Robots Commands") * ***********************************************

This German corpus contains read speech of 22 different speakers (6 male, 16 female). The corpus consists of 10 robot command sentences and 62 robot command words. Each speaker reads the whole corpus 5 times, except one speaker who reads the sentence corpus 16 times and the word corpus 51 times. The speakers were recorded at two different sites in Germany (University of Goettingen, University of Bochum). The corpus contains a total of 10,810 recorded utterances.

All speakers are within the age of 25-30. Two speakers are non-native speakers. One file gives information about the speakers (speaker ID, recording site, sex).

The task for the speaker was to read carefully but fluently. If an error occurred, the recording was interrupted by the supervisor and the sentence was repeated. The signal files are raw files without any header, 16 bit per sample, linear, most significant byte first, 16 kHz sample frequency.

The orthography of the corpus is given in two distinct files which contain the prompted words and the prompted sentences as an ordered list.

The recording conditions are as follows:

Microphone: AKG acoustics, C414B-TL, condensator microphone omnidirectional, built-in attenuator and high pass filter switched off, distance to mouth 50 cm. Environment: Studio Quality, echo cancelled room, about 121 qqm Preamplifier: John Hardy, M-1 Sampling rate: 48 kHz to DAT recorder, filtered to 16 kHz Resolution: 16 Bit, most significant byte first

The speech data were digitally filtered to 8 kHz cut-off frequency and downsampled to 16 kHz.

The corpus consists of 1 volume, total size 266,361 KB uncompressed data.

The signal of each utterance is stored in a separate file. Symbolic information like segmentations or labelling (e.g. Phonological Segmentation of words or Word Segmentation of sentences) are stored in files with the same prefix but with different extensions.

Price for ELRA members: 76 ECU Price for non members: 152 ECU

*********************************************************** * ELRA-S0045 German Pronunciation Rules Set - PHONRUL 9.0 * ***********************************************************

PHONRUL is a collection of computer-readable underspecifying pronunciation rules of standard German. This set describes the most common known effects in German pronunciation if deviating from the so-called canonic or citation form of words. The knowledge of this rule set was derived from empirical analysis of speech corpora as well as from a multitude of publications about German phonetics. The set does not contain any dialect-specific rules, however the line between Standard German and dialects is indistinct. Presently, this rule set is used at the University of Munich to aid automatic segmentation and labelling of unknown speech utterances.

The rule set, in its present form, consists of approximately 1,500 complex rules which expand to 5,546 simple replacement rules. The rule set was designed for extended German SAM-PA, but can be translated into other alphabets (e.g. Worldbet, IPA) without much effort.

Price for ELRA members: o for research use: 76 ECU o for commercial use: 482 ECU Price for non members: o for research use: 152 ECU o for commercial use: 964 ECU

******************************************** For more information, please contact: ELRA/ELDA 87, Avenue d'Italie 75013 PARIS Tel: +33 1 45 86 53 00 Fax: +33 1 45 86 44 88 E-mail: info-elracalva.net http://www.icp.grenet.fr/ELRA/home.html ********************************************

Message 3: AutolexicaList Electronic List

Date: Fri, 21 Nov 1997 01:04:59 -0600
From: Chris Corcoran <cmcorcormidway.uchicago.edu>
Subject: AutolexicaList Electronic List

This is to announce the creation of an electronic list for the discussion of Autolexical Grammar. The list will discuss broad theoretical issues as well as the nitty gritty of particular autolexical analyses.

Autolexical syntax is a variety of non-transformational generative grammar in which fully autonomous systems of rules characterize the various dimensions of linguistic representation. These components, or modules, are coordinated by means of the lexicon and a set of interface principles that limit the degree of structural discrepancy between the autonomous representations given by the various modular grammars. The number, identity and content of the components remains a matter of debate, but most studies in the framework have assumed at least a syntactic module, a morphological module, and a semantic module, and have assumed that each of these components is a context free phrase structure grammar. There is neither movement nor deletion in autolexical syntax, the effects of the former being modeled as discrepancies of order or constituency between two autonomous representations, and of the latter as the presence of an element in one dimension to which there corresponds no element at some other dimension.

The model has been employed to handle otherwise paradoxical mismatches between morphological structure and syntax (viz. incorporation), between morphophonological structure and syntax (viz. cliticization), and lately between semantic structure and syntax to handle most of the problems that originally motivated Transformational Grammar, but without the need to invoke movement or deletion. The two most important references are:

Sadock, Jerrold M., 1991, Autolexical Syntax: A Theory of Parallel Grammatical Representations, Chicago: University of Chicago Press.

Schiller, Eric, Steinberg, Elisa, and Barbara Need, eds. 1996. Autolexical Theory: Ideas and Methods. Berlin: Mouton de Gruyter.

To subscribe to the list, send a message with no subject line to Majordomolisthost.uchicago.edu with the following command in the body of your email message:

subscribe autolexical

For non-computerized help with the list, contact Chris Corcoran <cmcorcormidway.uchicago.edu>