CCGbank is a translation of the Penn Treebank into a corpus of Combinatory Categorial Grammar derivations. It pairs syntactic derivations with sets of word-word dependencies which approximate the underlying predicate-argument structure.
CCGbank contains 99.44% of the sentences in the Penn Treebank, for which it corrects a number of inconsistencies and errors in the original annotation.
CCGbank can also be searched with Douglas Rohde's TGrep2, version 1.15 or higher.
Julia Hockenmaier and Mark Steedman juliahrcis.upenn.edu, steedmaninf.ed.ac.uk