* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 22.1037

Wed Mar 02 2011

Sum: Oral History Project: Digitizing a Corpus of Text

Editor for this issue: Danielle St. Jean <daniellelinguistlist.org>

New! Multi-tree Visit LL's Multitree project for over 1000 trees dynamically generated from scholarly hypotheses about language relationships:

To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.cfm.
        1.     Wayles Browne , Oral History Project: Digitizing a Corpus of Text

Message 1: Oral History Project: Digitizing a Corpus of Text
Date: 28-Feb-2011
From: Wayles Browne <ewb2cornell.edu>
Subject: Oral History Project: Digitizing a Corpus of Text
E-mail this message to a friend

Query for this summary posted in LINGUIST Issue: 22.442
In January, I (Wayles Browne) posted a Query for Michelle Chesner of
Columbia University in which she asked for examples of digital corpora.
Columbia is working on a project that would digitize the results of an
oral history project involving Yiddish, undertaken a number of years
ago. The goal is to create a digital database of the findings of this
survey that would include maps from the original study. In the Query,
she was trying to establish whether anything comparable had been
undertaken in any other language.

The original asker of the question, Michelle Chesner, has now
summarized the results as follows:

I would like to thank the members of The LINGUIST List (and,
specifically, Wayles Browne, Georg Böhm, Aurelija Tamosiunaite, and
Shaul Moshe Seidler-Feller) for their help with this project.

Below is a summary of the responses I received:

Lithuanian language vocabulary database for "lexical items that differ in
various dialects" (in Lithuanian): http://www.mch.mii.lt/tarmes/Tarmes/

"Semitisches Tonarchiv," recordings of published texts in mostly Arabic
dialects: http://www.semarch.uni-hd.de/index.php43?&lang=en

I was unaware of this organization and website:
http://www.oeaw.ac.at/icltt/ (currently under construction)

EYDES (www.eydes.de) has been working with the same material that
we are going to be working with, and we hope to do cross-linking
between the two websites (EYDES includes the audio recordings of the
interviews; we have the paper transcriptions).

Thank you once again for all your help.


Michelle Chesner
Norman E. Alexander Librarian for Jewish Studies
Columbia University
304 International Affairs (420 W. 118th St.)
New York, NY 10027
Linguistic Field(s): Computational Linguistics
                            Language Documentation
                            Text/Corpus Linguistics

Subject Language(s): Yiddish, Eastern (ydd)
                            Yiddish, Western (yih)

This Year the LINGUIST List hopes to raise $67,000. This money will go to help 
keep the List running by supporting all of our Student Editors for the coming year.

See below for donation instructions, and don't forget to check out Fund 
Drive 2011 site!


There are many ways to donate to LINGUIST!

You can donate right now using our secure credit card form at  

Alternatively you can also pledge right now and pay later. To do so, go to: 

For all information on donating and pledging, including information on how to 
donate by check, money order, or wire transfer, please visit: 

The LINGUIST List is under the umbrella of Eastern Michigan University and as 
such can receive donations through the EMU Foundation, which is a registered 
501(c) Non Profit organization. Our Federal Tax number is 38-6005986. These 
donations can be offset against your federal and sometimes your state tax return 
(U.S. tax payers only). For more information visit the IRS Web-Site, or contact 
your financial advisor.

Many companies also offer a gift matching program, such that they will match 
any gift you make to a non-profit organization. Normally this entails your 
contacting your human resources department and sending us a form that the 
EMU Foundation fills in and returns to your employer. This is generally a simple 
administrative procedure that doubles the value of your gift to LINGUIST, without 
costing you an extra penny. Please take a moment to check if your company 
operates such a program.

Thank you very much for your support of LINGUIST!

New! Multi-tree Visit LL's Multitree project for over 1000 trees dynamically generated from scholarly hypotheses about language relationships:

Read more issues|LINGUIST home page|Top of issue

Page Updated: 02-Mar-2011

Supported in part by the National Science Foundation       About LINGUIST    |   Contact Us       ILIT Logo
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.