LINGUIST List 21.4172
|
Wed Oct 20 2010
Qs: Rosetta Project Seeks Volunteers for New Project
Editor for this issue: Danielle St. Jean
<danielle linguistlist.org>
|
We'd like to remind readers that the responses to queries are usually best posted to the individual asking the question. That individual is then strongly encouraged to post a summary to the list. This policy was instituted to help control the huge volume of mail on LINGUIST; so we would appreciate your cooperating with it whenever it seems appropriate. In addition to posting a summary, we'd like to remind people that it is usually a good idea to personally thank those individuals who have taken the trouble to respond to the query. To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.cfm.
|
Directory
1. Laine Stranahan ,
Rosetta Project Seeks Volunteers for New Project
Message 1: Rosetta Project Seeks Volunteers for New Project
|
Date: 19-Oct-2010
From: Laine Stranahan <laine longnow.org>
Subject: Rosetta Project Seeks Volunteers for New Project
E-mail this message to a friend
If you have 10-30 minutes and a keyboard or a microphone, please consider making a submission to The Rosetta Project's latest volunteer-based linguistic documentation project: http://www.rosettaproject.org/300-languages The 300 Languages Project is a special effort by The Rosetta Project (www.rosettaproject.org), part of The Long Now Foundation (www.longnow.org), to begin the construction of a universal corpus of human language by collecting parallel text and audio in the world's 300 most widely-spoken languages. The resulting collection will contain thousands of volunteer-contributed public domain text documents and audio recordings which will be made available to researchers and the public alike via The Internet Archive, a free online digital library. The 300 Languages Project seeks to develop an extensible protocol and a set of scalable, low-cost (i.e., volunteer-based) methods and standards for language documentation via the building of a "seed corpus" - a corpus which starts small but is designed to grow. The 300 Languages Project is collecting translations and recordings of three important texts: the Swadesh List, the Universal Declaration of Human Rights, and Genesis chapters 1-3. These texts were chosen primarily for usefulness in research (e.g., the Swadesh list) and breadth of existing translation (Genesis and the UDHR). The 300 Languages Project is made possible through the support and sponsorship of Distinguished Career Professor and speech technology expert Dr. James K. Baker and is conducted in partnership with the ALLOW initiative of the Center for Innovations in Speech and Language at theLanguage Technologies Institute.
Linguistic Field(s):
Computational Linguistics
Language Documentation
Text/Corpus Linguistics
Read more issues|LINGUIST home page|Top of issue
|
|
Page Updated: 20-Oct-2010
|
|
About LINGUIST
|
Contact Us
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.
|
|