LINGUIST List 36.2864

Wed Sep 24 2025

FYI: New German-English corpus available - MPI EVA Leipzig Corpus

Editor for this issue: Daniel Swanson <daniellinguistlist.org>



Date: 24-Sep-2025
From: Antje Quick <antje.quickuni-leipzig.de>
Subject: New German-English corpus available - MPI EVA Leipzig Corpus
E-mail this message to a friend

We are pleased to announce that the first German-English bilingual corpus is now available on the CHILDES platform.
This longitudinal and dense corpus contains transcripts of spontaneous child–adult interaction involving three bilingual children. At present, only the data from Fion are available, with the remaining children’s corpora to be added in the near future.

The Fion corpus spans the period from age 2;3 to 3;11, comprising 211 hours of recordings with 53,372 child utterances and 120,511 input utterances (excluding utterances containing unintelligible parts such as xxx). When including incomplete and partially intelligible utterances, the totals rise to 108,474 utterances for the child and 184,923 utterances for the input.

The data can be accessed through the CHILDES database here: https://talkbank.org/childes/access/Biling/MPI-EVA-Leipzig.html

We hope this contribution will serve as a useful tool for the community, and we welcome any feedback from researchers making use of the data.

Linguistic Field(s): Language Acquisition
Text/Corpus Linguistics

Subject Language(s): English (eng)
German (deu)




Page Updated: 24-Sep-2025


LINGUIST List is supported by the following publishers: