Editor for this issue: Ann Dizdar <dizdar
tam2000.tamu.edu>
Dear LINGUISTs & NLPASIAns, Thank you very much for sending a lot of valuable information. This is the final summary about the languages which don't have delimiters between 'words'. I am tempted to send all comments, but I give up because of the troublesome amount. And also, I eliminated the languages which do have delimiters [YES group] as the same reason. If you are interested in keeping it at your hand, please consult the previous summary (LINGUIST VOL-6-1269). To update as I did, you can just 1) remove Flemish, 2)add West-Frisian (according to Henk Wolf. Thank you.) to [YES]:Latin/Greek variations; and 3) move Turkish, Kazakh(??), Azerbaijani(??), Uzbek(??), Kirghiz(??) and Turkmen(??) feom [YES] to [Partly NO]. Here, (??) indicates a "closely akin" language to Turkish which produces VERY long words by agglutination and no space inside it (also by Henk Wolf). I don't have any definite conclusions, but following are observed: 1) Delimiter-less languages are minority in the world languages. Especially [Yes] group are very rare. There are only 3 languages (Chinese, Japanese Tibettan) - 2% of 158 languages. If we include [Partly NO] group (like many indian languages), they are 6% of total 158 languages. ["(?)" is counted as 0.5]. (Chinese, Japanese & Tibettan, all three have script to write (traditionally) in Up-to-Down direction, but they do so even now with some extent. This may be a factor of this result, but I'm not sure....) 2) There is NO strong correlation between delimiter-less-ness and language typology: We can observe various types of languages in [YES]/[Partly NO] group, e.g., agglutinating (Japanese, Tamil, Turkish, etc.), isolating (Chinese), and inflectional (Sanskrit, etc.). (How about the polisynthetic??) Also, a language type can be observed in both [YES] and [(Partly)NO], such as sanskrit([Prtly NO]) and Russian([YES]) for inflecting. Between [NO] and [YES] (or at least [Partly NO]), the same holds for agglutinating and isolating. For example, Chinese[NO] and Vietnamese[YES]; Japanese[NO] and Hungalian[YES] (or Tamil[Partly NO]). 3) So, it is NOT quite right to say (and often I listened) that "the language L does not have a space between words because L is agglutinating." 4) Latin/Greek and Cyrillic-based languages are big majority (70% in our list; 54% Latin/Greek, 16% Cyrillic), and they have space as a delimiter between words. It seems no exceptions in modern languages. (But many exceptions in classic/medival languages.) Several people suggested that some languages have 'moderately' long (verbal/nominal) compounds (e.g. German, Dutch, West-Frisian, etc.) (vs. above languages with VERY long words). I am not sure that these compounds are non-lexical, i.e., productive and semantically transparent (i.e., syntactic compounds). Could someone tell me if these German etc. have prominently syntactic compounds to make the word "pretty long"? Or, are they mostly lexical compounds? Also, if you know some other languages in our list have this (i.e., syntax compounds are prominent) property, please just send the name of the language. I will make and post a summary of this new question as a different topic again. We have still many (?)-items in our list. So, if you are knowledgeable about these, please let me know (I will wait for in a long run...). I will send an addendum to this summary some time later. Finally, I want to express my sincere gratitude to our (37) contributers to compose this final summary. The name of contributers are listed at the end of this summary. (I hope I didn't miss any name. If it happens, I sincerely apologize.) Hideo Fujii University of Massachusetts at Amherst SUMMARY: Languages Without Delimiters Between 'Words' (in total 158 languages) ========================================================== Q: Does the language have word-boundary delimiters? A.[NO]:(3) Chinese, Japanese, Tibetan B.[Partly NO -Words delimited, but need analysis to reach lexical level]:(14) Latin/Greek Variations: Turkish, Turkmen(*2*)(??), Uzbek(*2*)(??) Cyrillic-baed: Azerbaijani(??), Kazakh(??), Kirghiz(??) Devanagari Variations: Burmese, Khmer, Lao(?), Sanskrit, Thai Others: Kannada(?), Malayalam(?), Tamil C.[Vertually YES - Easily distinguishable by character form]: (10) Arabic Variations: (10) D.[YES]: (131) Latin/Greek Variations: (86) Cyrillic Variations: (25) Hebrew Variations: ( 3) Devangari Variations: ( 8) Others: ( 9) *1* Kurdish also uses Cyrllic, Roman and Armenian. *2* Moldavian, Turkmen, Uzbek, Mongolian used (or still is using) Cyrillic until recently. List of Contributers ==================== Shanley Allen <allenMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issuempi.nl> the Babesther <han
minerva.cis.yale.edu> Rita Bhandari <bhandari
semlab1.sbs.sunysb.edu> Doug Cooper <doug
chulkn.car.chula.ac.th> Peter Daniels <pdaniels
press-gopher.uchicago.edu> Boris Fridman Mintz <fridman
ucol.mx> Stefan Frisch <frisch
babel.ling.nwu.edu> Hideo Fujii <fujii
mackay.cs.umass.edu> Keith Goeringer <keg
violet.berkeley.edu> Henry Groover <hgroover
qualitas.com> Mark Hansell-Mai Hansheng <mhansell
carleton.edu> Susantha Herath <herath
u-aizu.ac.jp> Matthew Hurst <matth
cogsci.ed.ac.uk> Hiroaki Kitano <6500hiro
ucsbuxa.ucsb.edu> Wolfram Kahl <kahl
hermes.informatik.unibw-muenchen.de> Jee Eun Kim <jeeeunk
microsoft.com> Wenchao Li <wcli
vax.ox.ac.uk> Stuart Luppescu <sl70
musuko.spc.uchicago.edu> Greg Lyons <lcgal
mahidol.ac.th> Duncan MacGregor <aa735
freenet.carleton.ca> Stavros Macrakis <macrakis
osf.org> James Magnuson <magnuson
psych.rochester.edu> Mark A. Mandel <Mark
ccgate.dragonsys.com> Alec McAllister <ECL6TAM
lucs-01.novell.leeds.ac.uk> Philippe Mennecier <ferry
cimrs1.mnhn.fr> Nicholas Ostler <nostler
chibcha.demon.co.uk> Peter Paul <Peter.Paul
arts.monash.edu.au> Gnani Perinpanayagam <gnani
sun3.oulu.fi> Ellen F. Prince <ellen
central.cis.upenn.edu> Steve Seegmiller <SEEGMILLER
apollo.montclair.edu> Dan I. Slobin <slobin
cogsci.Berkeley.EDU> Achim Stenzel <achim
tiger.toppoint.de> Jan-Olof Svantesson <Jan-Olof.Svantesson
ling.lu.se> Joseph Tomei <jtomei
lilim.ilcs.hokudai.ac.jp> Shravan Vasishth <shravan
lisa.lang.osaka-u.ac.jp> Allan C Wechsler <Wechsler
world.std.com> Henk Wolf <H.A.Y.Wolf
stud.let.ruu.nl>