Lexical Probabilities File

This file can be generated from a tagged corpus using the script src/utilitities/TRAIN provided in FreeLing package. See comments in the script file to find out which format the corpus is expected to have.

The probabilities file has six sections: <UnknownTags>, <Theeta>, <Suffixes>, <SingleTagFreq>, <ClassTagFreq>, <FormTagFreq>. Each section is closed by it corresponding tag </UnknownTags>, </Theeta>, </Suffixes>, </SingleTagFreq>, </ClassTagFreq>, </FormTagFreq>.

Lluís Padró 2010-09-02