Editor for this issue: Joel Jenkins <joellinguistlist.org>
Dear colleagues,
A team of us at The University of Melbourne and our industry partner Redenlab (https://redenlab.com/) are working on a pipeline for automated parts-of-speech tagging across different languages. We are looking for linguist native speakers of various languages.
THE LANGUAGES FOR WHICH WE ARE SEEKING NATIVE SPEAKERS: Afrikaans, Arabic, Armenian, Asturian, Basque, Belarusian, Bulgarian, Chinese (Traditional), Chinese (Simplified), Croatian, Czech, Danish, Estonian, Farsi/Persian, Finnish, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Japanese, Kazakh, Korean, Kyrgyz, Latvian, Lithuanian, Macedonian, Maltese, Marathi, Norwegian, Polish, Portuguese (Portugal), Romanian, Russian, Serbian, Slovak, Slovenian, Swedish, Tamil, Telugu, Ukrainian, Uyghur, Vietnamese, Welsh, Wolof.
THE WORK: We're looking for linguists to label a 120-word passage for its parts of speech in their native language (estimated max 2 hours), and if available a second native-speaking linguist to double check the labelling.
THE PROJECT: We would then compare the manually-labelled parts of speech with available automated methods. This work will be unpaid, but we will be writing the work into a journal article and will include everyone who does any part-of-speech tagging as a co-author as part of a consortium. We are aiming for the parts-of-speech tagging of the 120-word passage to be completed in the next month, to then write into a paper ready to submit in early 2025.
If you are a linguist (Bachelor's or higher degree in Linguistics) who is a native speaker of any of the above languages, feel free to email me ([email protected]) with 1-2 sentences about your degree and experience in Linguistics and any questions, and I will get back to you with more info and next steps.
Regards,
Loretta (Lottie) Gasparini
PhD Candidate
The University of Melbourne
Email: [email protected]; [email protected]
Linguistic Field(s): Computational Linguistics
Page Updated: 22-Oct-2024
LINGUIST List is supported by the following publishers: