FYI: Taiwan Mandarin Spoken Wordlist
| Author: |
Shu-Chuan Tseng
|
| Linguistic Field(s): |
Text/Corpus Linguistics
|
| FYI Body: |
The ''Taiwan Mandarin Spoken Wordlist'' was derived from the
transcripts of 85 Taiwan Mandarin conversations collected and processed at Academia Sinica, with a total of 42 hours of speech recording. The recording took place from 2001 to 2003 and the speakers' age ranged from 14 to 63. The transcripts were automatically processed by the CKIP word segmentation and POS tagging system. The results of word segmentation, POS tagging, and character-Pinyin conversion as well as homographs were then manually corrected and edited. As a result, the wordlist consists of 16,683 word types and 405,435 word tokens, equivalent to 607,016 syllables. The Wordlist can be downloaded at http://mmc.sinica.edu.tw/resources_e_01.htm |
Business Plan,Business Ideas,Advanced Energy,High Technology,Healthy Diets,Healthy Foods,Games Guides,Games Cheats,Travel Guides,Travel Tips,Study Skills,Study Tips,Health Tips,Health Guides,Jewelry Stores,Jewellery UK Online,Digital Camera Reviews,Digital Camera Buying Guide,Replica Handbags,Replica Bags,Jackets on Sale,Jackets Clearance,WoW Gold,Cheap WoW Gold,Buy WoW Gold,WOW Gold,Swtor Credits

