Every year since 2002, the linguistic development environment NooJ has been enhanced with new online features that allow social scientists to develop new applications and explore new domains. The 2011 conference was no exception and the arrival of v3.0 has brought many more features and a new range of applications, from the analysis of ancient Arabic and old English texts to the analysis of conversations held by the Mars500 mission’s astronauts. At the 2011 conference, members of the European Meta-Net CESAR project announced that NooJ will soon be available Open Source and will become the de-facto standard tool for Corpus processing in European research in Social Science.
Today, NooJ is used as a research tool in over 30 academic and research centers in the world and there are NooJ modules available for over 20 languages. The international NooJ conference is organized every year; 50 participants present their work in the domains of Linguistic formalization, Corpus processing and Natural Language Processing applications.
The present volume contains a selection of papers from the NooJ 2011 International Conference which was held from 13–15 June 2011 in Dubrovnik, Croatia. This volume presents problems dealing with machine translation, information extraction, processing of multi-word units, automatic disambiguation, semantic analysis, and psychological and literature analysis of various corpora.