This book describes new methodological and technological approaches to
corpus building and presents recent research based on the Norwegian
Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian
language, compiled through daily harvesting of web newspapers. The book
gives an overview of the corpus and its system architecture, and presents tools
used for tasks such as text harvesting, annotation, topic classification and
extraction and frequency profiling of new words and phrases. Among the
innovative technologies is Corpuscle, a corpus query engine and management
system which is flexible enough to handle very large corpora in an efficient way.
The individual research contributions based on the corpus explore different
aspects of Norwegian, including the occurrence of anglicisms, neologisms and
terminology, and the use of metonymy and metaphor in newspaper language.
The book also describes an innovative method of applying correspondence
analysis and implicational analysis to investigate interdependencies between