Featured Linguist!

Jost Gippert: Our Featured Linguist!

"Buenos dias", "buenas noches" -- this was the first words in a foreign language I heard in my life, as a three-year old boy growing up in developing post-war Western Germany, where the first gastarbeiters had arrived from Spain. Fascinated by the strange sounds, I tried to get to know some more languages, the only opportunity being TV courses of English and French -- there was no foreign language education for pre-teen school children in Germany yet in those days. Read more

Donate Now | Visit the Fund Drive Homepage

Amount Raised:


Still Needed:


Can anyone overtake Syntax in the Subfield Challenge ?

Grad School Challenge Leader: University of Washington

Publishing Partner: Cambridge University Press CUP Extra Publisher Login

FYI: Taiwan Mandarin Spoken Wordlist

Author: Shu-Chuan Tseng

Linguistic Field(s): Text/Corpus Linguistics

FYI Body: The ''Taiwan Mandarin Spoken Wordlist'' was derived from the
transcripts of 85 Taiwan Mandarin conversations collected and
processed at Academia Sinica, with a total of 42 hours of speech
recording. The recording took place from 2001 to 2003 and the
speakers' age ranged from 14 to 63. The transcripts were automatically
processed by the CKIP word segmentation and POS tagging system.
The results of word segmentation, POS tagging, and character-Pinyin
conversion as well as homographs were then manually corrected and
edited. As a result, the wordlist consists of 16,683 word types and
405,435 word tokens, equivalent to 607,016 syllables.

The Wordlist can be downloaded at


Back   FYI main page