LINGUIST List 29.4588

Mon Nov 19 2018

Jobs: Applied Linguistics; Computational Linguistics; Text/Corpus Linguistics: Scientist, Université Catholique de Louvain (UCL)

Editor for this issue: Becca Morris <>

Date: 16-Nov-2018
From: Laurence MUNDSCHAU <>
Subject: Applied Linguistics; Computational Linguistics; Text/Corpus Linguistics: Scientist, Université Catholique de Louvain (UCL), Louvain-La-Neuve, Belgium
E-mail this message to a friend

University or Organization: Université Catholique de Louvain (UCL)
Department: Institute for Language and Communication (ILC)
Job Location: Louvain-La-Neuve, Belgium
Web Address:
Job Title: Data Manager

Job Rank: Scientist

Specialty Areas: Applied Linguistics; Computational Linguistics; Text/Corpus Linguistics


UCLouvain is seeking a Data Manager (M/F)
- Part-time position (50%) for a fixed term of 18 months (with the possibility of extension)
- Starting date: Immediate

Current research makes extensive use of written and oral linguistic data in different languages (French, Spanish, English, Dutch, etc.). To be usable, this data must be documented (metadata), anonymised (in order to comply with data protection requirements), annotated (transcription, indexing, thematic analysis, etc.) and deposited in databases that can be searched online. The Data Manager will contribute to these various tasks within the Institute for Language and Communication (ILC), specifically within the Linguistic Research Unit (PLIN) and the Natural Language Processing Centre (CENTAL platform).

Job description
In collaboration with ILC/PLIN researchers, the Data Manager will:
- Supervise the processing chain for the creation of oral and written corpora (data acquisition, documentation of metadata, transcriptions and annotations, inclusion in existing databases, standardisation of the formats used)
- Develop tools for pre-processing and processing data (segmentation, text-to-sound alignment, text-to-text alignment, automatic or semi-automatic annotation, etc.)
- Monitor technological developments to enhance data interoperability (documented and processed according to international standards; cf. Clarin, Ortolang, Olac, etc.) and improve data acquisition (automatic speech recognition, tokenisation, etc.)
- Ensure compliance with legal and ethical agreements relating to data protection (GDPR)
- Represent UCLouvain in different international linguistic data consortia.

Qualifications and skills required:
Applicants must have the following:
- A Master’s degree in Linguistics, specialising in natural language processing, or equivalent
- Programming skills: Perl and/or Python, good knowledge of XML
- Ability to process linguistic data in several languages (French, English, Dutch, Spanish, German, etc.)
- Knowledge of English (B2), particularly academic English (to be able to take part in international meetings and contribute to research publications)
- Ability to work as part of a team, excellent listening skills and ability to analyse needs, adaptability

Letter of application, CV and recent photo in passport format should be sent by e-mail (preferably) to the application email below or by mail to the mailing address below.

Application Deadline: 30-Nov-2018
Mailing Address for Applications:
Anne Catherine Simon
Institut Langage et Communication (UCL/ILC)
Place Blaise Pascal, 1
Boite L3.03.33
Louvain-la-Neuve B-1348
Email Address for Applications:
Contact Information:
Anne Catherine SIMON

Page Updated: 19-Nov-2018