Sample Configuration File

A sample configuration file follows. This is only a sample, and probably won't work if you use it as is. You can start using freeling with the default configuration files which -after installation- are located in /usr/local/share/FreeLing/config (note than prefix /usr/local may differ if you specified an alternative location when installing FreeLing. If you installed from a binary .deb package), it will be at /usr/share/FreeLing/config.

You can use those files as a starting point to customize one configuration file to suit your needs.

Note that file paths in the sample configuration file contain $FREELINGSHARE, which is supposed to be an environment variable. If this variable is not defined, the analyzer will abort, complaining about not finding the files.

If you use the analyze script, it will define the variable for you as /usr/local/share/Freeling (or the right installation path), unless you define it to point somewhere else.

You can also adjust your configuration files to use normal paths for the files (either relative or absolute) instead of using variables.

# ---- sample configuration file for Spanish analyzer

#### General options 
Lang=es

#### Trace options. Only effective if we have compiled with -DVERBOSE
TraceLevel=0
TraceModule=0x0000

## Options to control the applied modules. The input may be partially
## processed, or not a full analysis may me wanted. The specific 
## formats are a choice of the main program using the library, as well
## as the responsability of calling only the required modules.
InputFormat=plain
OutputFormat=morfo

# consider each newline as a sentence end
AlwaysFlush=no

#### Tokenizer options
TokenizerFile="$FREELINGSHARE/es/tokenizer.dat"

#### Splitter options
SplitterFile="$FREELINGSHARE/es/splitter.dat"

#### Morfo options
AffixAnalysis=yes
MultiwordsDetection=yes
NumbersDetection=yes
PunctuationDetection=yes
DatesDetection=yes
QuantitiesDetection=yes
DictionarySearch=yes
ProbabilityAssignment=yes
DecimalPoint=","
ThousandPoint="."
LocutionsFile=$FREELINGSHARE/es/locucions.dat 
QuantitiesFile=$FREELINGSHARE/es/quantities.dat
AffixFile=$FREELINGSHARE/es/afixos.dat
ProbabilityFile=$FREELINGSHARE/es/probabilitats.dat
DictionaryFile=$FREELINGSHARE/es/maco.db
PunctuationFile=$FREELINGSHARE/common/punct.dat
ProbabilityThreshold=0.001
#NER options
NERecognition=basic
NPDataFile=$FREELINGSHARE/es/np.dat
## --- comment lines above and uncomment those below, if you want 
## --- a better NE recognizer (higer accuracy, lower speed)
#NERecognition=bio
#NPDataFile=$FREELINGSHARE/es/ner/ner.dat

## NEC options
NEClassification=no
NECFilePrefix=$FREELINGSHARE/es/nec/nec

## Sense annotation options (none,all,mfs)
SenseAnnotation=none
SenseFile=$FREELINGSHARE/es/senses16.db
DuplicateAnalysis=false
UKBRelations=$FREELINGSHARE/common/wn16-ukb.bin
UKBDictionary=$FREELINGSHARE/es/senses16.ukb
UKBEpsilon=0.03
UKBMaxIter=10

#### Tagger options
Tagger=hmm
#Tagger=relax
TaggerHMMFile=$FREELINGSHARE/es/tagger.dat
TaggerRelaxFile=$FREELINGSHARE/es/constr_gram.dat
TaggerRelaxMaxIter=500
TaggerRelaxScaleFactor=670.0
TaggerRelaxEpsilon=0.001
TaggerRetokenize=yes
TaggerForceSelect=retok

#### Parser options
GrammarFile=$FREELINGSHARE/es/grammar-dep.dat

#### Dependence Parser options
DepParser=txala
DepTxalaFile=$FREELINGSHARE/es/dep/dependences.dat
DepMaltFile=$FREELINGSHARE/es/malt/malt.dat

#### Coreference Solver options
CoreferenceResolution=no
CorefFile=$FREELINGSHARE/es/coref/coref.dat

Lluís Padró 2010-09-02