LINGUIST List 24.874
|
Mon Feb 18 2013
Internships: Lexical Data Processing
Editor for this issue: Brent Miller
<brent linguistlist.org>
|
Date: 15-Feb-2013
From: Jonathan Pool <pool panlex.org>
Subject: Lexical Data Processing
E-mail this message to a friend
University or Organization: The Long Now Foundation
Department: PanLex Project
Web Address: http://panlex.org
Type of Work: Archiving
Linguistic Field(s):
Computational Linguistics
Language Documentation
Lexicography
Translation
Duration: 24-Jun-2013 to 08-Aug-2013
Voluntary
Internship Location: San Francisco, California, USA
Minimum Education Level: BA
Special Qualifications: Some experience with a programming or scripting language. Some experience with regular expressions. Knowledge of linguistic concepts. Fluency in English. Knowledge of one language other than English. A laptop computer. Optional additional qualifications: Knowledge of two or more languages other than English. Familiarity with writing systems, character encodings, and Unicode. Familiarity with one or more non-Latin scripts. Experience with lexicography.
Description:
Training: lexical database organization, language identification, character encoding, character normalization, Unicode compliance, lemmatic standardization, lexical classification, standards for lexical data and documentation, and lexical resource parsing with regular expressions.
Work: validating legacy lexical data in a variety of formats, languages, and scripts; converting lexical data into a unified structure; adding content to an 18-million-word, 7000-language lexical database.
Mentored individual and teamwork environment.
Application Deadline: 17-Jun-2013
Web Address for Applications: http://panlex.org/help/intern.shtml
Contact Information:
Dr. Jonathan Pool
Email: pool panlex.org
Phone: 510-225-1717
Fax: 510-225-1170
Read more issues|LINGUIST home page|Top of issue
|
|
Page Updated: 18-Feb-2013
|
|
About LINGUIST
|
Contact Us
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.
|
|