LINGUIST List 9.1323

Thu Sep 24 1998

Books: Computers & Ling

Editor for this issue: Scott Fults <scottlinguistlist.org>


Links to the websites of all LINGUIST's supporting publishers are available at the end of this issue.

Directory

  1. L. M. Schmitt, Book on SED and AWK in Language Research

Message 1: Book on SED and AWK in Language Research

Date: Wed, 23 Sep 1998 16:38:44 +0900
From: L. M. Schmitt <lotharu-aizu.ac.jp>
Subject: Book on SED and AWK in Language Research


Combining the Bourne-shell, sed and awk in the UNIX environment
for language analysis
================================================================

Lothar M. Schmitt(1) and Kiel T. Christianson(2)

1. The University of Aizu
School of Computer Science and Engineering,
Aizu-Wakamatsu City, Fukushima Prefecture, 965-80, Japan.
E-mail: lotharu-aizu.ac.jp

2. Michigan State University
Dept. of Linguistics and Germanic, Slavic, Asian and African
Languages
East Lansing, Michigan 48824, USA.
================================================================

Abstract
========
We show how to construct tools for language analysis in research
and teaching using the Bourne-shell, sed and awk under UNIX.
Applications include the following:
searches for words, phrases, grammatical patterns and phonemic
patterns in text;
statistical evaluation of texts in regard to such searches;
transformation of phonetic, phonemic or typographic
transcriptions;
comparison of texts in various respects;
lexical-etymological analysis;
concordance;
assistance in translating text;
assistance in learning languages;
assistance in teaching languages;
and text processing and formatting.
The latter includes the generation of on-line dictionaries for
the internet from files that were
generated with what-you-see-is-what-you-get editors representing
only the linear structure of the dictionary (i.e., the book).
All of the above can be achieved with particularly simple and
short code.
In that regard, we illustrate how sed and awk can be combined in
the pipe mechanism of UNIX to create very powerful processing
devices.
Our notes include a short introduction to programming the
Bourne-shell and rather short, but complete descriptions of sed
and awk customized in regard to language analysis.


****************************************

Availability: 1) ERIC Document service (soon), 2) E-mail LMS,
3) Possible posting.
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue

The following contributing LINGUIST publishers have made their backlists available on the World Wide Web:

1998 Contributors

  • Addison Wesley Longman
  • Blackwell Publishers
  • Cambridge University Press
  • CSLI Publications
  • Edinburgh University Press
  • Garland Publishing
  • Holland Academic Graphics (HAG)
  • John Benjamins Publishing Company
  • Lawrence Erlbaum Assoc.
  • MIT Press--Books Division
  • MIT Working Papers in Linguistics
  • Mouton de Gruyter
  • Oxford University Press
  • Francais Pratique
  • Pacific Linguistics
  • Routledge
  • Summer Institute of Linguistics