Editor for this issue: <>
Dear List Subscribers, In March I posted a request for any information available on medical dictionaries & text available electronically. This is the list of responses I received. THANK YOU TO ALL for the time you took to help! Regards, Gillian Smith CirNoetMail to author|Respond to list|Read more issues|LINGUIST home page|Top of issueAOL.com ****************************** We have developed a bilingual dictionary of the most frequently used words in the medical lexis (English to Spanish) in cooperation with the computer department of the California State University at Fullerton. Said instrument is called R1*TUTOR written in C++. The work is the beginning of MT translation of abstracts of medical literature. We hope to present the work at ACL94 at Las Cruces in June-July, 1994. The vector of English medical terms came from 6 different published studies we having been working on since 1985. The material is electronically available but IS NOT in the public domain. The methodology of finding the most frequently used medical terms is available and if you are interested in creating your own work we will be more than happy to supply you with the tools. Prof. R. M. Chandler-Burns College of Medicine Autonomous University of Nuevo Leon Monterrey, MEXICO **************************** I don't know if this qualifies as a corpus, but do you think that medline might suit your purposes? Ther is an enormous amount of text in it. I'm not sure if any sstring ssearches can be done on it, but certainly it is richly indexed by keywords. -Greg Dubs Genetics Dept. Stanford University **************************** The best person I know to talk to would be Catherine Macleod, who worked on this type of electronic dictionary. She can be reached at macleod
cs.nyu.edu. Good luck with your project! Leslie Barrett **************************** These folks have More than 90 ref. lexicography in medicine: INFOLINGUA (ISSN=1198-1083) A series of extensive and fully indexed bibliographies in A.I. - LINGUISTICS - INFORMATICS - COMMUNICATIONS - EDUCATION ************************************************************************** COMPUTATIONAL MORPHOLOGY : Morphological Analysis and Generation, Lemmatization : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 492p, ISBN=2-921173-01-8 prepaid US$ 80 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 2350, morphological analysis = 1300, morphological generation = 290, lemmatization = 260, etc. COMPUTATIONAL PARSING : Syntactic Analysis, Semantic Analysis, Semantic Interpretation, Parsing Algorithms, Parsing Strategies : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 2 volumes, 1029p, ISBN=2-921173-02-6,2-921173-03-4 prepaid US$ 150 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 5180, syntactic analysis = 1110, semantic analysis = 710, semantic interpretation = 260, parsing algorithm = 200, parsing strategies = 70, etc. COMPUTATIONAL LEXICOLOGY AND LEXICOGRAPHY : Dictionaries, Thesauri, Term Banks ; Analysis, Transfer and Generation Dictionaries ; Machine Readable Dictionaries ; Lexical Semantics ; Lexicon Grammars : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 2 volumes, 1031p, ISBN=2-921173-04-2,2-921173-05-0 prepaid US$ 150 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 5910, dictionaries (production) = 1380, thesauri = 680, term banks = 680, analysis dictionaries = 1230, transfer dictionaries = 140, generation dictionaries = 60, lexical database/machine readable dictionaries = 550, lexical semantics = 780, lexicon grammar = 110, etc. COMPUTATIONAL TEXT UNDERSTANDING : Natural Language Programming, Argument Analysis : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 657p, ISBN=2-921173-06-9 prepaid US$ 80 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 3830, natural language programming = 110, argument analysis = 80, etc. COMPUTATIONAL TEXT GENERATION : Generation from Data or Linguistic Structure, Text Planning, Sentence Generation, Explanation Generation : BIBLIOGRAPHY, by Conrad F. SABOURIN with a survey article by Mark T. Maybury 1994, 649p, ISBN=2-921173-07-7 prepaid US$ 80 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 2870, text generation from data = 1060, text generation from structure = 730, text planning = 180, sentence generation = 310, explanation generation = 330, etc. NATURAL LANGUAGE INTERFACES : Interfaces to Databases, to Expert Systems, to Robots, to Operating Systems, and to Question-Answering Systems : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 2 volumes, 847p, ISBN=2-921173-08-5,2-921173-09-3 prepaid US$ 130 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 4100, interface to database = 1100, to expert system = 70, to question-answering system = 640, to robot = 70 ; conversation system = 300, etc. MACHINE TRANSLATION : Aids to Translation, Speech Translation : BIBLIOGRAPHY, by Conrad F. SABOURIN and Laurent R. BOURBEAU 1994, 2 volumes, 1168p, ISBN=2-921173-10-7,2-921173-11-5 prepaid US$ 180 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 8070, aids to translation = 550, speech translation =100 ; 60 different natural languages ; 120 systems LITERARY COMPUTING : Style Analysis, Author Identification, Text Collation, Literary Criticism : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 581p ISBN=2-921173-12-3 prepaid US$ 80 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 4060, style analysis = 700, author identification = 340, text collation = 220, literary concordances and indexes = 840, fiction = 670, poetry = 670, theatre = 200, bible/tora/quran = 500, theme analysis = 100, creative text generation = 140, etc. COMPUTER ASSISTED LANGUAGE TEACHING : Teaching Vocabulary, Grammar, Spelling, Writing, Composition, Listening, Speaking, Translation, Foreign Languages ; Text Composition Aids, Error Detection and Correction, Readability Analysis : BIBLIOGRAPHY, by Conrad F. SABOURIN and Elca TARRAB 1994, 2 volumes, 1066p, ISBN=2-921173-13-1,2-921173-14-X prepaid US$ 150 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 8010, teaching orthography = 130, writing = 1500, composition = 770, grammar = 430, listening/comprehension = 150, reading = 830, speaking = 200, vocabulary = 250, keyboarding = 60, foreign languages = 1900 ; lexical/grammatical error detection/correction = 500, text composition support = 440, etc. COMPUTER MEDIATED COMMUNICATION : Computer Conferencing, Electronic Mail, Electronic Publishing, Computer Interviewing, Interactive Text Reading, Group Decision Support Systems, Idea Generation Support Systems, Human-Machine Communication, Multi-Media Communication, Hypertext, Hypermedia, Linguistic Games : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 2 volumes, 862p, ISBN=2-921173-15-8,2-921173-16-6 prepaid US$ 130 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 5680, hypertext = 1500, hypermedia = 440, computer conferencing = 550, electronic mail = 400, electronic publishing = 370, multimodal communication = 100, human-machine communication = 960, computer interviewing = 100, etc. ELECTRONIC DOCUMENT PROCESSING : Document Editing, Formatting, Typesetting, Coding, Storing, Interchanging, Managing : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 551p, ISBN=2-921173-17-4 prepaid US$ 80 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 4260, document editing = 2400, formatting = 140, typesetting = 540, coding/mark-up = 420, interchanging = 170, management = 260, etc. COMPUTATIONAL CHARACTER PROCESSING : Character Coding, Input, Output, Synthesis, Ordering, Conversion ; Text Compression, Encryption, Display ; Hashing ; Literate Programming : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 580p, ISBN=2-921173-18-2 prepaid US$ 80 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 4120, character coding = 550, input = 900, output = 260, conversion = 360 ; text compression = 240, hashing = 110, etc. QUANTITATIVE AND STATISTICAL LINGUISTICS : Frequencies of Characters, Phonemes, Words, Grammatical Categories, Syntactic Structures ; Lexical Richness, Word Collocations, Entropy, Word Length, Sentence Length : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 508p, ISBN=2-921173-19-0 prepaid US$ 80 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 3100, frequencies of characters = 60, phonemes = 90, words = 640, grammatical categories = 90, grammatical features = 250 ; lexical richness = 100, word collocations = 230, entropy = 150, word length = 70, sentence length = 90, etc. MATHEMATICAL AND FORMAL LINGUISTICS : Grammar Formalisms, Grammar Testing, Logics, Quantifiers : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 612p, ISBN=2-921173-20-4 prepaid US$ 80 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 3840, formal linguistics = 1470, mathematical linguistics = 1910, grammar formalism = 480, grammar testing = 90, logic = 820, quantifiers = 300, etc. COMPUTATIONAL SPEECH PROCESSING : Speech Analysis, Recognition, Understanding, Compression, Transmission, Coding, Synthesis ; Text to Speech Systems, Speech to Tactile Displays, Speaker Identification, Prosody Processing : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 2 volumes, 1187p, ISBN=2-921173-21-2,2-921173-22-0 prepaid US$ 150 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 8290, speech analysis = 1110, speech recognition = 2600, speech understanding = 600, speech coding = 560, speech synthesis = 1500, text-to-speech = 560, speaker identification = 290, prosody processing = 600, etc. COMPUTATIONAL LINGUISTICS IN INFORMATION SCIENCE : Information Retrieval (Full-Text or Conceptual), Automatic Indexing, Text Abstraction, Content Analysis, Information Extraction, Query Languages : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 2 volumes, 1047p, ISBN=2-921173-23-9,2-921173-24-7 prepaid US$ 150 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 6390, information retrieval = 2100, full-text = 890, conceptual = 60 ; automatic indexing = 930, text abstraction = 270, content analysis = 530, information extraction = 520, etc. OPTICAL CHARACTER RECOGNITION AND DOCUMENT SEGMENTATION : Character Preprocessing, Thinning, Isolation, Segmentation, Feature Extraction ; Cursive and Multi-Font Recognition, Writer/Scriptor Identification : BIBLIOGRAPHY, by Conrad F. SABOURIN 1994, 512p, ISBN=2-921173-25-5 prepaid US$ 80 INFOLINGUA inc., P.O. Box 187 Snowdon, Montreal, Canada, H3X 3T4 Number of references : Total = 3700, recognition of cursive characters = 910, hand printed characters = 490, printed characters = 390, multi-font characters = 140 ; on-line recognition = 170, writer identification = 330, document segmentation = 320, etc. ******************************************************************************* ORDERING INFORMATION All orders must be prepaid in U.S. dollars. Payment : Bank draft drawn on a U.S. bank INTERNATIONAL money order Payable to : INFOLINGUA inc. P.O. Box 187 Snowdon Montreal, Qc, H3X 3T4 CANADA Information : email : 73651.2144
compuserve.com Shipping fees : -Surface mail : free -Air mail : add US$ 5 per volume inside North America : add US$ 12 per volume outside North America Sales taxes : -Canadian residents add GST 7% Discount : 20% to individuals who collaborated by sending bibliographical information or documents. Shipping date : March 28, 1994 and after ******************************************************************************