Overview of the TARSQI Toolkit Code
These pages give a high-level overview of the TARSQI Toolkit code. All code,
barring a few scripts, lives in the code directory inside the Tarsqi
distribution. All paths given are relative to that path. There are or will be
chapters on the following components.
- Toplevel code
- An overview of how documents are processed and how they are steered through
all the components. Also includes a list of the tags added by the Tarsqi
Toolkit.
The Preprocessor
Descriptions of the input to and output from the tokenizer, tagger and
chunker.
GUTime
Descriptions of the input to and output from GUTime, the component that
extracts time expressions. Does not contain any description of the actual
working of GUTime.
Evita
Description of how Evita, the event recognizer, fits into the pipeline and
how it operates.
Slinket
A description of how subordinating links are recognized. Not yet written.
S2T
Not yet written
Blinker
Not yet written
Link Classifier
Not yet written
SputLink
Not yet written