LINGUIST List 12.58

Thu Jan 11 2001

FYI: Portuguese-English Parallel Corpus: COMPARA

Editor for this issue: Lydia Grebenyova <>


  1. Diana Maria de Sousa Marques Pinto dos Santos, Portuguese-English Parallel Corpus - COMPARA

Message 1: Portuguese-English Parallel Corpus - COMPARA

Date: Mon, 08 Jan 2001 16:22:59 +0100
From: Diana Maria de Sousa Marques Pinto dos Santos <>
Subject: Portuguese-English Parallel Corpus - COMPARA

We are pleased to announce that COMPARA, the Portuguese-English parallel
corpus, is now available at

COMPARA is open-ended, freely available on the Web, and made for people who
have never used corpora before as well as for experienced corpus users.
COMPARA's criteria for text alignment allow corpus users to investigate
translational discourse changes such as when and where translators have
chosen to join, separate, delete, add and reorder sentences. Users can also
inspect translators' notes, and the corpus admits more than one translation
per source text.

Only six parallel fiction texts have been fully processed so far, but
permission has been obtained to incorporate many more. 

COMPARA is encoded according to the IMS Corpus Workbench system, developed
at the University of Stuttgart, and is distributed on the WWW via the
DISPARA interface, developed by the Computational Processing of Portuguese

We are grateful for any comments and suggestions regarding both COMPARA and
the DISPARA interface. We also welcome the collaboration of all those
interested in contributing to the COMPARA-DISPARA project. 

Ana Frankenberg-Garcia & Diana Santos
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue