* * * * * * * * * * * * * * * * * * * * * * * *
LINGUIST List logo Eastern Michigan University Wayne State University *
* People & Organizations * Jobs * Calls & Conferences * Publications * Language Resources * Text & Computer Tools * Teaching & Learning * Mailing Lists * Search *
* *
LINGUIST List 20.3325

Fri Oct 02 2009

Jobs: Comp Ling, Natural Lang Processing: Software Engineer, Amazon

Editor for this issue: Erica Wicks <ericalinguistlist.org>

The LINGUIST List strongly encourages employers to use non-discriminatory standards in hiring policy. In particular we urge that employers do not discriminate on the grounds of race, ethnicity, nationality, age, religion, gender, or sexual orientation. However, we have no means of enforcing these standards.

Job seekers should pay special attention to language in ads regarding employment requirements and are encouraged to consult our international employment page at http://linguistlist.org/jobs/jobnet.html. This page has been set up so that people can report on the employment standards of various countries.

To post to LINGUIST, use our convenient web form at http://linguistlist.org/posttolinguist.html
        1.    C.C. Scott, English & Computational Linguistics, Natural Language Processing and Inforamtion Retrieval: Software Development Engineer, Amazon, Washington, USA

Message 1: English & Computational Linguistics, Natural Language Processing and Inforamtion Retrieval: Software Development Engineer, Amazon, Washington, USA
Date: 30-Sep-2009
From: C.C. Scott <ccscottamazon.com>
Subject: English & Computational Linguistics, Natural Language Processing and Inforamtion Retrieval: Software Development Engineer, Amazon, Washington, USA
E-mail this message to a friend

University or Organization: Amazon
Job Location: Washington, USA
Web Address: http://www.amazon.com
Job Rank: Software Development Engineer

Specialty Areas: Computational Linguistics; Natural Language Processing, Data Mining, Information Retrieval

Required Language(s): English (eng)


Amazon.com's Darwin team is looking for exceptional software engineers to
develop algorithms and build systems to automatically solve a variety of
Information Retrieval and Data Mining problems related to the Amazon
Product Catalog - one of the company's biggest assets.

Our principal challenge is to improve the shopping experience by detecting
duplicate products for sale in the catalog and merging them. Merchants on
Amazon.com provide information about the products they want to sell. Amazon
attempts to match these product data submissions to items in its catalog so
that it can display offers for the same product on a single page. Poorly
structured or incomplete data makes this problem very challenging and often
results in duplicate products getting created in the catalog. These
duplicate products are shown in search results and end up confusing
customers, leading to a bad customer experience. The Darwin team detects
these duplicate products in the Amazon.com catalog using an innovative mix
of Information Retrieval, Data Mining and Natural Language Processing
algorithms and human intelligence harnessed via the Amazon Mechanical Turk.
We then automatically merge products detected as duplicates together,
improving customer experience and the quality of the catalog.

We are also responsible for a variety of other Catalog-related projects
such as placing Product Advertisements on pages, automatically extracting
important product features from the product description with a view to
improving the discovery (search and browse) experience on the website and
detecting egregious cases of poor quality data provided by sellers.

We are a highly-motivated, co-operative and fun loving team who thrive on
solving challenging problems with innovation. As part of this team you will
be analyzing data, developing new algorithms, building large-scale
distributed software systems in Java using open source technologies such as
Apache Lucene and JBoss and other Amazon.com proprietary technologies.


The ideal candidate will have the following qualifications:

* Advanced degree in Computer Science, Math or related field with 5+
years of experience.
* Past experience in at least one of the following areas - Information
Retrieval, Data Mining, Natural Language Processing or Machine Learning.
* Desire to analyze data while developing solutions to problems.
* Strong desire to build high-performance, highly-available and
scalable distributed systems.
* Strong design and coding skills in Java/C++ on Unix Platforms.
* Familiar with Perl and have a good understanding of SQL.
* Be highly innovative, flexible and self-directed.
* Excellent written and verbal communication skills.

Application Deadline: 31-Dec-2009

Email Address for Applications: ccscottamazon.com
Contact Information:
C.C. Scott
Email: ccscottamazon.com

Read more issues|LINGUIST home page|Top of issue

Please report any bad links or misclassified data

LINGUIST Homepage | Read LINGUIST | Contact us

NSF Logo

While the LINGUIST List makes every effort to ensure the linguistic relevance of sites listed
on its pages, it cannot vouch for their contents.