LINGUIST List 25.2446

Thu Jun 05 2014

FYI: Release of BDPA 1.0 (Benchmark Database of Phonetic Alignments)

Editor for this issue: Uliana Kazagasheva <>

Date: 05-Jun-2014
From: Jelena Prokic <>
Subject: Release of BDPA 1.0 (Benchmark Database of Phonetic Alignments)
E-mail this message to a friend

We are pleased to announce the release of BDPA 1.0, a Benchmark Database of Phonetic Alignments in Historical Linguistics and Dialectology. BDPA 1.0 is a publicly available benchmark database of manually edited multiple and pairwise phonetic alignments which is designed as a platform to test the performance of automatic alignment algorithms. The database consists of a great variety of alignments drawn from a number of different sources. Currently, BDPA 1.0 offers a total of 750 multiple alignments, covering 8 language families, more than 500 different language varieties, and more than 50 000 words. The data is arranged in a such way that typical problems encountered in phonetic alignment analyses (metathesis, splits and mergers of sounds, diversity of phonetic strings) are represented and can be directly tested.

BDPA is freely available for research purposes and can be searched and downloaded at

Johann-Mattis List and Jelena Prokic
Forschungszentrum Deutscher Sprachatlas
Philipps-Universit├Ąt Marburg

Linguistic Field(s): Computational Linguistics; Historical Linguistics

Page Updated: 05-Jun-2014