#--
# README - Important
# $Id: README,v 1.2 2003/04/08 17:09:54 elzubeir Exp $
#--

The datasets in this directory are all taken from the Buckwalter Morphological
Analyzer. These files are covered under the GNU General Public License
which is included in the file 'gpl.txt'.

The contents were converted to UTF-8 encoding (from its original
transliterated form) in order for the Duali spellchecker to make use of. 
Further, the dict2db.py Python script converts them to bsddb hash database
files. The accompanying script is also under the GPL.

You simply need to run it as follows:

$ ./dict2db.py

This will create the following files:

  - stems.db
  - prefixes.db
  - suffixes.db

Those files need to be stored somewhere accessible to the Duali spellchecker
which will make use of them (along with the other 3 files):

  - tableab
  - tableac
  - tablebc

The original files, 'stems', 'prefixes' and 'suffixes' can be removed, as they
will no longer be of any use for the Duali spellchecker. Refer to the
Duali INSTALL file for more information on where to place them.

Unfortunately, this cannot be directly bundled with the Duali spellchecker
due to the cancerous nature of the GPL. Duali itself is written under the
BSD license.
