Natural Language Processing

Faculty

Ralph Grishman   Naomi Sager   Satoshi Sekine

Description

The amount of text which is available in electronic form is growing at an explosive rate. In addition to the web, large quantities of text are being collected for medical, legal, commercial, and scientific applications. But the tools for getting the information we need out of this text are still quite primitive. Our research groups in natural language processing are building systems to to extract specific information from large text collections, and to present it in the user's preferred language.

The Linguistic String Project was one of the pioneers in natural language processing research in the United States. Its current focus is on extracting information from the text in hospital medical records; it has developed systems for English, French, and German.

The Proteus Project focuses on automatically learning the linguistic knowledge needed for information extraction and machine translation. It has developed extraction systems in English and Japanese, and a series of language-independent translation models. It also conducts a wide range of basic research, and develops large-scale dictionaries and other resources for natural language processing.

Research Interests

Ralph Grishman Founder of the Proteus Project. Acquisition of linguistic knowledge, extraction of information from large corpora, machine translation.
Satoshi Sekine Information extraction, syntactic parsing, domain-specific language.
Naomi Sager Director, Linguistic String Project, focusing on extracting information from the text in hospital medical records.

Related Projects

Linguistic String   Proteus  


top | contact webmaster@cs.nyu.edu