Jet (Java Extraction Toolkit)


About JET

JET, the Java Extraction Tool, provides a variety of components for language analysis, such as sentence segmentation, name tagging, time expression tagging and normalization, part-of-speech tagging, partial parsing, and coreference analysis.  These components can be arranged in pipelines for different applications, and can be used either for interactive analysis of individual sentences, or 'batch' analysis of complete documents. Simple tools are provided for annotating documents and displaying annotated documents.  A full set of procedures are also provided for performing information extraction of entities, relations, and events following the ACE [Automatic Content Extraction] specifications.

JET is a work in progress, and continues being regularly expanded and updated.

License

Jet Copyright ©1999-2014 Ralph Grishman

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 . Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Downloading Jet

Release
Date
Jet distribution tar
(select to download)
Notes
1.6.7
8 Nov 2012
jet-121108.tar.gz

1.7.2
23 Jan 2014
jet-140123.tar.gz
expand guide; add dependency parser; add class files; add windows script
1.7.5
29 Jun 2014
jet-140629.tar.gz
enhance onomasticon (name dictionary); add actions and guide entries for onomasticon and dependency parser
1.7.6 28 Sep 2014 jet-140928.tar.gz add 'generic' feature for entities; improved onomasticon matcher
1.8.0 31 Dec 2014 jet-141231.tar.gz add Brown word clusters and name tagger trained on 3 X larger corpus,
together producing more robust name annotation (see note below)
1.8.1 9 May 2015 jet-150509.tar.gz clean up and rename properties files; improve monocase test; fix stemmer;
add classes for ICE (AnchoredPath, AnchoredPathSet, DepPathRelationTagger)

[An earlier version of Jet is available here.]

If you find Jet useful for your work or incorporate any part of it into software you distribute, we earnestly request that you notify Prof. Ralph Grishman.

Installing Jet

You need Java 1.5 or later in order to run Jet.  It runs under Linux, Apple Mac OS X, and Windows through terminal windows.

Download the latest Jet distribution tar file and extract all the files with tar -xzvf.

The expanded directory will include

In addition, the directory will contain the following files and directories for those who wish to recompile or modify Jet

If you plan on using the Tratz dependency parser, you will also need to download parseModel.gz and put it in the jet/data directory.

For the best name tagger coverage, download AceOntoMeneModel.gz, uncompress it, put it in the jet/acedata directory, and then run with the properties
NameTags.ME.fileName = ../acedata/AceOntoMeneModel
WordClusters.fileName = brownClusters10-2014.txt

To use Jet,

The documentation for the current release (also included in the download) is here.