Form Dictionary File

The form dictionary is either a plain text file, or a BerkeleyDB-indexed file. The dictionary module relies on the extension to decide which format to expect (.src for plain text, .db for indexed files)

The plain text dictionary file (.src) format is described below. This file can be directly passed to the constructor of the dictionary search module.

Indexed Berkeley-DB files (.db) may be created with the indexdict program provided with FreeLing, which is called with the command:

   indexdict indexed-dict-name.db  <source-dict.src
Where source-dict.src is a plain text dictionary, and indexed-dict-name.db is the resulting indexed file, which can be directly passed to the constructor of the dictionary search module.

See the (very simple) source code in src/main/utilities/indexdict.cc if you're interested on how it is indexed.



Subsections

Lluís Padró 2010-09-02