Get Involved at LINGUIST ListGraduate Assistantships and Student Employment
The LINGUIST Team
Graduate Assistantships and Student Employment
The LINGUIST List, with the support of donations from subscribers and the publishing community and the hosting institution Indiana University (College of Arts and Sciences), currently provides support for 4 graduate students, who serve as LINGUIST List editors and participate in the ongoing research projects and activities. Graduate assistants are usually enrolled in one of the Indiana University linguistics programs, although some past GAs have been pursuing degrees in computer or cognitive science, informatics, library science, or a related discipline.
If you are interested in pursuing graduate studies at Indiana University in any of the linguistics programs, please consult the Department of Linguistics website and ask the program coordinators for advice and support. If you are interested in getting involved and working at The LINGUIST List, contact us to discuss options and opportunities.
The LINGUIST List offers summer internships for undergraduate students, even senior highschool students. If you are interested in working with The LINGUIST List over the summer, consider applying for our summer internship program!
Interns at LINGUIST List have the opportunity to participate in the daily operations of the LINGUIST List. They also get involved in interdisciplinary research projects and activities under the supervision of local faculty at Indiana University.
Some of the projects interns may work on include:
- Language and Location: A Map Accessibility Project (LL-MAP): This project began as a joint NSF-sponsored project of Eastern Michigan University and Stockholm University. In LL-MAP, language information is integrated with data from the physical and social sciences by means of a Geographical Information System (GIS). Tasks for this project include map-making (using GlobalMapper and ArcGIS software) and using the LL-MAP Scholar's Workbench to style and upload maps. This project is being redesigned and restarted at Indiana University. We are working on new components and extensions of the GIS system, as well as the integration of the maps and geolinguistic data in other language and linguistic information systems.
- MultiTree: The MultiTree project is a digital library of scholarly hypotheses about language relationships and subgroupings. This information is organized in a searchable database with a web interface, and each hypothesis is presented graphically as a diagram of a family tree. Typical tasks for this project include researching language relationship hypotheses and entering this information into the MultiTree database. The MultiTree data and system is a unique resource for language code and relation information. We are not only extending its data content, but also working on the integration of the data and system in other language and linguistic information systems.
- Interns may also work on the General Ontology for Linguistic Description (GOLD), the Lexical Enhancement via the GOLD Ontology (LEGO) project, or language resource and technology projects related to under-resourced and endangered languages, in addition to assisting with the LINGUIST List mailing list and website. These environments and systems are being reactivated at the new hosting institution. Interns are not only welcome to help us extending and developing the content of these systems and information resources, but also to provide technological asisstance and support for redeveloping and engineering of new technologies for these and new language data sets.
- The LINGUIST List interns also can get involved in other computational linguistics and language technology projects. These projects are related to language data encoding in various formats and based on standard technologies (e.g. XML and various encoding standards and strategies). The projects teach interns how to use common language technologies for the annotation of language corpora, the creation of digital dictionaries and lexicons, or audio and video recordings for language documentation. They also involve the use of common Natural Language Processing (NLP) algorithms and Human Language Technology (HLT) applications. This could also involve the development (programming or coding) of new algorithms, or online tools and interfaces. In general, these tasks incude the creation of language resources, algorithms and technologies for general NLP tasks and HLT applications. This summer our particular focus will be:
- Development of speech and text corpora for under-resourced, endangered or even extinct languages with available audio recordings; development of acoustic and language models, training of Automatic Speech Recognition tools for these languages.
- Engineering of Finite State Morphologies using formal and rule-based frameworks based on XFST, Foma, or OpenFST, and development of machine learning approaches to generate morphologies or probabilistic Weighted Finite State Transducers (WFST) for lexical analysis and generation.
- Grammar engineering and parser development using formalisms like Lexical Functional Grammar (LFG) or alternative rule- or unification-based and probabilistic systems for the syntactic and semantic analysis of low-resourced languages.
- Knowledge representations and processing based on ontologies modeled in Protege using OWL and various reasoning tools for NLP and HLT systems.
The focus in the NLP and HLT projects is strongly related to linguistics, in particular systems based on linguistic knowledge, applications for language documentation and lingusitic analysis, the creation of language resources for various purposes, and the study of languages in general.
The technologies that we work with are for example:
- Data storage: XML, SQL in PostgreSQL and MySQL, Unicode text
- Web development: Apache, PostreSQL, MySQL, CGI, Node, Django, Ruby on Rails, ColdFusion
- NLP tools: CoreNLP, XFST, Foma, OpenFST, XLE, HTK, Sphinx, many more
- Development environments: Eclipse, NetBeans, PyCharm, VIM, etc.
Internships are generally available for a three-month period between May and August, 30 to 40 hours per week, and interns receive a modest stipend. Housing is not provided, although The LINGUIST List team and the local linguistics students and faculty at Indiana University in Bloomington can provide some assistance in locating accommodations. International applicants are encouraged, but must have or obtain an appropriate visa, i.e. one that permits them to work, study, or intern in the US.
Keep an eye out for the announcement of our summer internships, which is posted in the winter!
The LINGUIST List welcomes volunteers with an aptitude in linguistics and technology. If you are in the Bloomington area and able to commit to at least 10 hours per week, contact us about volunteer opportunities during the entire year!
The LINGUIST Team
Learn more about the current LINGUIST crew!
LINGUIST positions attract students from around the world; past and present LINGUIST team members have joined us from many countries.Read more about LINGUIST Listers' experiences or like the LINGUIST List on Facebook or Google+, and follow us on Twitter!
Email Address: linguistlinguistlist.org
The LINGUIST List
Department of Linguistics
Memorial Hall 322
1021 E. 3rd Street
Bloomington, IN 47405-7005
Telephone Number (with voice box and text): +1 812 391-3602
Fax Number: +1 888 908-2629