This book "supplies a vocabulary of English words and idiomatic phrases 'arranged … according to the ideas which they express'. The thesaurus, continually expanded and updated, has always remained in print, but this reissued first edition shows the impressive breadth of Roget's own knowledge and interests."
Syntax-Based Collocation Extraction is the first book to offer a comprehensive, up-to-date review of the theoretical and applied work on word collocations. Backed by solid theoretical results, the computational experiments described based on data in four languages provide support for the book’s basic argument for using syntax-driven extraction as an alternative to the current cooccurrence-based extraction techniques to efficiently extract collocational data. The work described in Syntax-Based Collocation Extraction focuses on using linguistic tools for corpus-based identification of collocations. It takes advantage of recent advances in parsing to propose a novel deep syntactic analytic collocation extraction that has applicability to a range of important core tasks in Computational Linguistics. The book is useful for anyone interested in computational analysis of texts, collocation phenomena, and multi-word expressions in general.