LINGUIST List 31.3547
Wed Nov 18 2020
FYI: November 2020 Newsletter - Linguistic Data Consortium (LDC)
Editor for this issue: Everett Green <everettlinguistlist.org>
Date: 16-Nov-2020
From: Membership Coordinator <ldc
ldc.upenn.edu>
Subject: November 2020 Newsletter - Linguistic Data Consortium (LDC)
E-mail this message to a friend In this newsletter:
Join LDC for Membership Year 2021
Spring 2021 Data Scholarship Application Deadline
New Publications:
Global TIMIT Learner Simple English
LORELEI Ukrainian Representative Language Pack
TAC KBP Event Argument – Comprehensive Training and Evaluation Data 2016-2017
________________________________________
Join LDC for Membership Year 2021
Membership Year 2021 (MY2021) is open and discounts are available for those who keep their membership current and join early. Current MY2020 members who renew their LDC membership before March 1, 2021 will receive a 10% discount off the membership fee. New or returning organizations will receive a 5% discount when joining by March 1.
In addition to receiving new publications, current LDC members also enjoy the benefit of licensing older data at reduced costs from our Catalog of over 850 holdings. Current-year for-profit members may use most data for commercial applications.
For full descriptions of all LDC data sets, browse our Catalog.
Visit Join LDC for details on membership, user accounts and payment.
Spring 2021 Data Scholarship Application Deadline
Applications are now being accepted through January 15, 2021 for the Spring 2021 LDC Data Scholarship program which provides university students with no-cost access to LDC data. Consult the LDC Data Scholarship page for more information about program rules and submission requirements.
________________________________________
New publications:
(1) Global TIMIT Learner Simple English was developed by LDC and Shanghai Jiao Tong University and consists of approximately 12 hours of L1 and L2 English read speech and transcripts. It is comprised of two separate data sets of 50 speakers reading 120 sentences from TIMIT Acoustic-Phonetic Continuous Speech Corpus (LDC93S1) deemed “simple” to read by Chinese learners of English. Among the 120 sentences, 20 sentences were read by all speakers, 40 sentences were read by 10 speakers, and 60 sentences were read by one speaker, for a total of 820 sentence types.
Global TIMIT Learner Simple English is distributed via web download.
2020 Subscription Members will automatically receive copies of this corpus. 2020 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for a fee.
*
(2) LORELEI Ukrainian Representative Language Pack consists of Ukrainian monolingual text, Ukrainian-English parallel and comparable text, annotations, supplemental resources, and related software tools developed by LDC for the DARPA LORELEI program.
LORELEI Ukrainian Representative Language Pack is distributed via web download.
2020 Subscription Members will automatically receive copies of this corpus. 2020 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for a fee.
*
(3) TAC KBP Event Argument – Comprehensive Training and Evaluation Data 2016-2017 was developed by LDC and contains training and evaluation data produced in support of the 2016 TAC KBP Event Argument Linking Pilot and Evaluation tasks and the 2017 Event Argument Linking Training Evaluation task.
TAC KBP Event Argument – Comprehensive Training and Evaluation Data 2016-2017 is distributed via web download.
2020 Subscription Members will automatically receive copies of this corpus. 2020 Standard Members may request a copy as part of their 16 free membership corpora. Non-members may license this data for a fee.
Linguistic Field(s): Computational Linguistics
Page Updated: 18-Nov-2020