Overview of the TARSQI Toolkit Code

These pages give a high-level overview of the TARSQI Toolkit code. All code, barring a few scripts, lives in the code directory inside the Tarsqi distribution. All paths given are relative to that path. There are or will be chapters on the following components.

Toplevel code
An overview of how documents are processed and how they are steered through all the components. Also includes a list of the tags added by the Tarsqi Toolkit.
The Preprocessor
Descriptions of the input to and output from the tokenizer, tagger and chunker.
GUTime
Descriptions of the input to and output from GUTime, the component that extracts time expressions. Does not contain any description of the actual working of GUTime.
Evita
Description of how Evita, the event recognizer, fits into the pipeline and how it operates.
Slinket
A description of how subordinating links are recognized. Not yet written.
S2T
Not yet written
Blinker
Not yet written
Link Classifier
Not yet written
SputLink
Not yet written