Editor for this issue: Karen Milligan <karen
linguistlist.org>
Dear all, Does anyone have or know of sources for information on the history of corpora either for dictionary-making or for lingustic pursuit? Thanks. Gao Yongwei Fudan University, Shanghai, ChinaMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue
Has anyone out there done any research on how people will say (more exactly, will expect to hear) years such as 2001, 2005, 2010, 2015, 2020, 2037, etc.? I am working on a speech recognition/synthesis application that needs to "speak back" to the user certain years beyond 2000. Here are the top candidates: (1) "two thousand" not followed by "and", e.g. two thousand five, two thousand thirty-seven (2) "two thousand and...", e.g. two thousand AND five, two thousand AND thirty-seven (3) "twenty...", e.g. twenty oh five, twenty thirty-seven To my ear, it seems that the further into the future the year is, the better way (3) sounds, e.g. "twenty thirty-seven", instead of "two thousand (and) thirty-seven". What about 2001? Will people want to say this date as in the movie title? Should it be "two thousand one" or "two thousand AND one"? Although I'm a native speaker of English, I can't make up my mind, and folks here at work don't agree on this issue. My own hunch is that people will resort to the shortest way possible, way (3), but this isn't based on any serious study. If anyone has done any sort of survey on this topic, your help would be much appreciated! Thanks, James Giangola Software Engineer, Dialog Research & Design jamesgMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issuenuance.com Nuance Communications 1380 Willow Rd. Menlo Park, CA 94025 www.nuance.com