* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 22.3844

Tue Oct 04 2011

FYI: Mandarin Conversational Corpus Wordlist

Editor for this issue: Brent Miller <brentlinguistlist.org>


To post to LINGUIST, use our convenient web form at http://linguistlist.org/LL/posttolinguist.cfm.
Directory
        1.     Shu-Chuan Tseng , Mandarin Conversational Corpus Wordlist


Message 1: Mandarin Conversational Corpus Wordlist
Date: 03-Oct-2011
From: Shu-Chuan Tseng <tsengscgate.sinica.edu.tw>
Subject: Mandarin Conversational Corpus Wordlist
E-mail this message to a friend

The Mandarin Conversational Corpus Wordlist is generated from the
transcripts of 30 free conversations between strangers, 29 topic-specific
conversations between friends/family members, and 26 map task
dialogues between friends/family members, recorded in Taiwan. The
wordlist contains automatically segmented words and their frequency, part
of speech, and size in syllables - in total 405K word tokens in
approximately 42 hours of recording. You can download the wordlist at
http://mmc.sinica.edu.tw/home_c.htm

Linguistic Field(s): Text/Corpus Linguistics


Read more issues|LINGUIST home page|Top of issue



Page Updated: 04-Oct-2011

Supported in part by the National Science Foundation       About LINGUIST    |   Contact Us       ILIT Logo
While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed on its pages, it cannot vouch for their contents.