Editor for this issue: <>
News from the nameserver With the size of the full e-mail address list fast approaching 200Kb, it is necessary to ask linguists not just to ADD their (new) addresses, but also to REMOVE their old and invalid addresses. There are a considerable number of people with addresses in two different universities, and while these may both valid in some cases, there are certainly a number of invalid addresses in the list. The value of such a list depends on the information it contains being kept up to date, so I would ask linguists who notice addresses in the list they know *for certain* to be invalid to send LINGUISTS a REMOVE command: REMOVE bloggs, bert: bloggerMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issuedeckchair.sunshine-state.edu. It might well be worthwhile checking up on your own address with LIST, as there are very many addresses in the list that people did not send themselves. The list has a long history predating the nameserver. Norval Smith PS: a few hints: 1. begin your message at the margin 2. one command a line 3. follow the instructions available with HELP 4. do not try to deal with the server as if was human - it isn't! 5. do not use uppercase letters 6. (for UK people) do not use backward addresses. Other people cannot use them, and I just have to reverse them. 7. be sure of your address before you send it. For example anything ending in ..bitnet.edu or ..edu.bitnet just can't be right
[From Humanist Discussion Group, Vol. 4, No. 1221] NATURAL LANGUAGE SOFTWARE REGISTRY The Natural Language Software Registry is a catalogue of software implementing core natural language processing techniques, whether available on a commercial or noncommercial basis. The current version includes + speech signal processors, such as the Computerized Speech Lab (Kay Electronics) + morphological analyzers, such as PC-KIMMO (Summer Institute for Linguistics) + parsers, such as Alveytools (University of Edinburgh) + knowledge representation systems, such as Rhet (University of Rochester) + multicomponent systems, such as ELU (ISSCO), PENMAN (ISI), Pundit (UNISYS), SNePS (SUNY Buffalo), + applications programs (misc.) This document is available on-line via anonymous ftp to tira.uchicago.edu (IP 128.135.96.31), by email to registryMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issuetira.uchicago.edu, and by physical mail to the address below. If you have developed a piece of software for natural language processing that other researchers might find useful, you can include it by returning the description form, available from the same sources. Elizabeth Hinkelman, Director (registry
tira.uchicago.edu) NL Software Registry Center for Information and Language Studies 1100 East 57th Street Chicago, IL 60637, USA ----------------------- Authors: Person to contact for software (if different): Institution: Department: Street: City/State/Zip: Country: Phone (with country & area codes): Email network & address: Name of system: Type of system: research system / commercial product / other (specify) Primary task of system: linguistic analysis / test of linguistic theory (specify) / text generation / machine translation / text proofing / database interface / other (specify) Components: phonological analyzer/generator morphological analyzer/generator parser/generator semantic interpreter knowledge representation discourse structure pragmatic features other (specify) Components available as independent modules: (subsequent questions may need a separate answer for each) Components can be extended by: the developer / computational linguist / linguist / programmer / experienced user / new user Data components are: firmly embedded in program / independent of program Data provided: (give size, features and language as in the examples) 120,000 entry wordlist for French 5,000 word LFG lexicon of basic Swahili w/ affixes, English gloss 15 rule transformational grammar for Dutch cross-serial dependencies 200 node knowledge base for AIDS case histories w/ 10 30-node cases. Data components can be extended by: the developer / computational linguist / linguist / programmer / experienced user / new user Character set used for language data: programmable (describe) fixed, 16-bit -- Unicode fixed, 8-bit -- ISO (specify, eg ASCII+Latin II) / proprietary ASCII fixed, 7-bit -- ISO (specify, eg US ASCII) / extended ASCII (specify) other (specify) Range of applicable natural languages: (give theoretical or technical limits) Approximate number of examples processed successfully, as a power of 10: Specify example type: words / sentences / paragraphs / other Its coverage level is now: demonstration / small research / large research / production quality / high volume Size of system: lines of source code, kilobytes of executable Programming language: Operating system or hardware: Is there a stable version of the system? Is there continuing development? Summarize the main goals and ideas. Indicate what makes the project a useful and interesting tool for research applications. List documents in which the software is described: User documentation: System documentation: Available support: upgrades / source code / consulting / other Format for software distribution: Price: Restrictions on use: If you are willing to have the software reviewed, please send us a version along with this information. We are also interested in reports and documentation, even for software not reviewed. NL Software Registry Center for Information and Language Studies 1100 East 57th Street Chicago, IL 60637, USA registry
tira.uchicago.edu [End Linguist List, Vol. 2, No. 130]