Editor for this issue: Anne Clarke <anne
linguistlist.org>
University or Organization: Volt Computer Services Department: Assignment at Microsoft Redmond Rank of Job: Software Testers/Linguists/ SPN, JPN and GER Specialty Areas: Text/Corpus Linguistics Required Language(s): German, Standard (Code = GER); Japanese (Code = JPN); Spanish (Code = SPN) Description: Three Linguist openings Microsoft's Natural Interactive Services Division (Year-long assignments) in Redmond, Washington (USA) This manager is looking for three candidates to support the development of language resources in different languages. Native speakers required currently for Spanish, German, and Japanese. This team works with language resources to develop speech and language technologies, solutions, user experiences, and platforms to improve the customer experience. They are bodies of data, including both acoustic and text data. Language Resources are used to develop technologies such as a Microsoft's Lexical Services Platform (LSP), Document Understanding, SemScript/Language Object Model, Speech Recognition technologies, and the Natural User Interface Platform (NUI-P). Position description: For the most part (approx. 80-90%), these new staff will be working with electronic text data such as newspapers, books, magazines, business documents, academic papers, etc. The team licenses the data (gets the legal rights needed to work with copywrited material), catalogs the data into the database (library), and later extracts the data to build test and dev data sets for training and testing natural language technologies. There is a small part of the work (10 - 20%) that is separate from the text data work where they will validate acoustic (speech) corpora. That work involves listening to sound files and comparing the files to transcripts, using Microsoft transcription guidelines. The team utilizes SQL queries, PERL scripts and UNIX tools to build and validate data sets through a Corpora Library or database. The position will require cataloging, packaging, importing text data in to the NISD (Natural Languages Interactive Services Division) corpora library as well as sampling, cleaning and preparing of data sets. They will additionally be responsible for validation of Speech (acoustic) corpora and other data analysis within NISD. Skills required and skills preferred: - The candidates must already be in the USA, as they need them to start within 2-3 weeks. If an H1B work permit is required, we will only be able to process it in that time if the candidates are already in the USA on a different type of visa. Candidates who already have a work permit who can start in 2-3 weeks are welcome, regardless of their location. - Native language skills in one of the following languages: Spanish, German and Japanese. - Knowledge PERL scripts is required. - Experience with SQL queries, UNIX tools, writing and executing macros would be a plus. - Should be detail oriented, very organized, possess excellent communication skills and able to work independently. - Bachelors degree in Linguistics, Computer Science or Engineering, and / or experience with testing preferred. Start: ASAP, not more than 2-3 weeks. Length of assignments: 3 months, extensible to one year. Location: Microsoft Building 119, in Redmond, Washington Core hours: 9 AM to 5PM Emailed resumes in Word format are strongly preferred. Position has been filled. This announcement was accompanied by a donation to the LINGUIST List!Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue