LINGUIST List 13.2909

Sun Nov 10 2002

Jobs: Modified: Comp Ling, extended deadline, NY

Editor for this issue: Heather Taylor <>


  1. Andrew Borthwick, Jobs: Modified Re: 13.2555: Computational Ling, extended deadline, NY

Message 1: Jobs: Modified Re: 13.2555: Computational Ling, extended deadline, NY

Date: Wed, 06 Nov 2002 14:03:25 -0500
From: Andrew Borthwick <>
Subject: Jobs: Modified Re: 13.2555: Computational Ling, extended deadline, NY

University or Organization: ChoiceMaker Technologies, Inc.
Rank of Job: Researcher
Specialty Areas Required: Computational Linguistics, Text/Corpus
 Linguistics, Machine Learning


Java Data Quality/Machine Learning Developer/Researcher

ChoiceMaker Technologies has developed a patent-pending
machine-learning system, ChoiceMaker 2.0, that matches records of
people, businesses, or other entities in large databases filled with
inconsistent information. For instance, ChoiceMaker 2.0 can recognize
that "Arnold Schwarzenegger" and "Arnie Shwarzeneger" are the same
individual. The system can be used to remove duplicate records from a
single database, match records across multiple databases, or search a
database approximately. Clients include the New York City Department
of Health and the U.S. Census Bureau.

Founded in 1998, ChoiceMaker Technologies is a New York City-based
start-up with a highly talented staff that includes three computer
science Ph.D.'s. The company has won two Small Business Innovation
Research grants from the National Science Foundation totaling $600,000
to further its ground-breaking work in machine learning approaches to
approximate record matching.

ChoiceMaker seeks a talented computer scientist or computational
linguist, skilled in Java, to perform multiple tasks:

* Customize ChoiceMaker 2.0 for clients, especially to deploy the ML
 matching system on new data and new types of data.
* Perform NSF-funded research into machine learning, data parsing, and
 data standardization techniques that will improve ChoiceMaker 2.0's
 accuracy or convenience.
* Program Java applications, such as user interfaces and data analysis
 programs, that expand ChoiceMaker 2.0's functionality.

Compensation includes a competitive salary, options and an excellent
benefits package.

Mandatory Qualifications

1. Deep expertise in object-oriented development, development of
 thousands of lines of Java
2. Machine learning, computational linguistics/natural language
 processing (NLP), or data quality
3. MS or PhD in Computer Science or equivalent experience 

Desired Qualifications

1. Record matching, data de-duplication, data cleaning
2. Artificial intelligence (AI). Particularly experimental work
 involving large datasets.
3. Server side Java: J2EE, CORBA, COM, Web services 
4. Java GUI: Swing, AWT 
5. Database: JDBC, SQL, Oracle, MS SQL Server, MySQL 
6. XML: SAX, DOM, JDOM, XML Schemas 
7. Multithreaded Java
8. Various: ant, log4j, JUnit, JavaDoc, Collections
9. design patterns 
10. UML 
11. compiler construction 
12. project management 
13. C++ 
14. Windows, Linux, UNIX 
15. Eclipse plugin development


Please send your resume to A brief cover
letter describing how you meet the mandatory qualifications is also
helpful. Our web site is No phone calls

Address for Applications:

	Attn: Andrew Borthwick
	ChoiceMaker Technologies, Inc.
	41 East 11th St., 11th Floor
	New York, NY 10003
	United States of America 
	Applications are due by 13-Dec-2002

Contact Information:
	Andrew Borthwick.
Mail to author|Respond to list|Read more issues|LINGUIST home page|Top of issue