A sample configuration file follows. This is only a sample, and probably won't work if you use it as is. You can start using freeling with the default configuration files which -after installation- are located in /usr/local/share/FreeLing/config (note than prefix /usr/local may differ if you specified an alternative location when installing FreeLing. If you installed from a binary .deb package), it will be at /usr/share/FreeLing/config.
You can use those files as a starting point to customize one configuration file to suit your needs.
Note that file paths in the sample configuration file contain
$FREELINGSHARE
, which is supposed to be an environment
variable. If this variable is not defined, the analyzer will
abort, complaining about not finding the files.
If you use the analyze script, it will define the variable for you as /usr/local/share/Freeling (or the right installation path), unless you define it to point somewhere else.
You can also adjust your configuration files to use normal paths for the files (either relative or absolute) instead of using variables.
# ---- sample configuration file for Spanish analyzer #### General options Lang=es #### Trace options. Only effective if we have compiled with -DVERBOSE TraceLevel=0 TraceModule=0x0000 ## Options to control the applied modules. The input may be partially ## processed, or not a full analysis may me wanted. The specific ## formats are a choice of the main program using the library, as well ## as the responsability of calling only the required modules. InputFormat=plain OutputFormat=morfo # consider each newline as a sentence end AlwaysFlush=no #### Tokenizer options TokenizerFile="$FREELINGSHARE/es/tokenizer.dat" #### Splitter options SplitterFile="$FREELINGSHARE/es/splitter.dat" #### Morfo options AffixAnalysis=yes MultiwordsDetection=yes NumbersDetection=yes PunctuationDetection=yes DatesDetection=yes QuantitiesDetection=yes DictionarySearch=yes ProbabilityAssignment=yes DecimalPoint="," ThousandPoint="." LocutionsFile=$FREELINGSHARE/es/locucions.dat QuantitiesFile=$FREELINGSHARE/es/quantities.dat AffixFile=$FREELINGSHARE/es/afixos.dat ProbabilityFile=$FREELINGSHARE/es/probabilitats.dat DictionaryFile=$FREELINGSHARE/es/maco.db PunctuationFile=$FREELINGSHARE/common/punct.dat ProbabilityThreshold=0.001 #NER options NERecognition=basic NPDataFile=$FREELINGSHARE/es/np.dat ## --- comment lines above and uncomment those below, if you want ## --- a better NE recognizer (higer accuracy, lower speed) #NERecognition=bio #NPDataFile=$FREELINGSHARE/es/ner/ner.dat ## NEC options NEClassification=no NECFilePrefix=$FREELINGSHARE/es/nec/nec ## Sense annotation options (none,all,mfs) SenseAnnotation=none SenseFile=$FREELINGSHARE/es/senses16.db DuplicateAnalysis=false UKBRelations=$FREELINGSHARE/common/wn16-ukb.bin UKBDictionary=$FREELINGSHARE/es/senses16.ukb UKBEpsilon=0.03 UKBMaxIter=10 #### Tagger options Tagger=hmm #Tagger=relax TaggerHMMFile=$FREELINGSHARE/es/tagger.dat TaggerRelaxFile=$FREELINGSHARE/es/constr_gram.dat TaggerRelaxMaxIter=500 TaggerRelaxScaleFactor=670.0 TaggerRelaxEpsilon=0.001 TaggerRetokenize=yes TaggerForceSelect=retok #### Parser options GrammarFile=$FREELINGSHARE/es/grammar-dep.dat #### Dependence Parser options DepParser=txala DepTxalaFile=$FREELINGSHARE/es/dep/dependences.dat DepMaltFile=$FREELINGSHARE/es/malt/malt.dat #### Coreference Solver options CoreferenceResolution=no CorefFile=$FREELINGSHARE/es/coref/coref.dat
Lluís Padró 2010-09-02