This is a short htmlized version of the
readme file from the
ispelleo.tar.bz2 distribution tarball.
|Content:||Information about Esperanto dictionary for the Ispell speller|
|Created:||1997-08-30 by Sergio Pokrovskij <sergio.pokrovskij(ĉe)gmail.com>|
Copyright © 1997, 1998, 2003, 2006, 2008 by Sergio Pokrovskij
This dictionary package is available on the terms of GNU General Public License (Free Software Foundation, 675 Mass Ave, Cambridge, MA 02139, USA).
Here is an Esperanto dictionary, compiled by Sergio Pokrovskij for the version 3 of ispell.
You will first need to get ispell 3.0+; it is available from
here; make sure that in its
NO8BIT thing is commented out and
MASKBITS = 64. To see the
options ispell has been compiled with please run
$ ispell -vv
Several encodings used with Esperanto text are supported:
Unicode is available in UTF-8 encoding, which is preferred for the Unices.
^cirka^u-style: e^ho^san^go ^ciu^ja^ude. Presently this is used as the reference representation, mainly because it is unambiguous (cf names like Michaux); and
cxirkaux-style, which is also convenient for lexicographical ordering and thus is used in the dictionaries; besides, it uses letters only, and that makes it suitable for variable names in computer programs.
2. Go to the root directory
ispell-eo (where this
$ make first
make) in order to check your ispell program.
Examine the output, e.g. do you have the permissions to write the hash file(s) at the install phase?
4. If everything is OK, say
$ make eo
$ make esperanto
$ make all
You'll get a few warnings of from buildhash, like this one:
eo.aff line 218: Flag must be alphabetic
Just ignore them.
$ make install
After that you can call
$ ispell -d eo filename
You'll need emacs to produce the dictionary for myspell which works with Open Office. Customize the word provision as described in “Customized Build” (except the buildhash step); and then say
$ make OO
In order to enable selective construction of dictionaries, some
entries in the source dictionary
./src/vortoj.l3 are marked with
keywords indicating the special field they belong to:
||archaic words, like ĥina (= ĉina) or malkompreni (= miskompreni)|
||a rare botanic word|
||vocabulary of the Esperanto Movement (of Esperantujo)|
||countries and ethnography|
||some computer-science terminology according to the Komputada Leksikono|
||redundant words, which are used by some esperantists, though they are less precise and unnecessarily complicate the language; e.g. olda (maljuna or malnova), mava (= malbona)|
||given names and names of important personalities (e.g. Petro, Zamenhof ...)|
||the words specific to my idiolect|
||rare words which may coincide with a misspelling of a more frequent word; e.g. ajuna, komanditi, liona|
||variant which I do not use but which is frequent enough (e.g. kemio, tekniko opposed to ĥemio and teĥniko)|
$ grep '#mav' ./src/vortoj.l3 | lessin order to see if you feel like me about them; you can either remove all of them from the target dictionary, or remove the
#mavmark from those you do use and like; the default setting in the
short_list = komp,etn,Eujo,pers,mll eo_list = $(short_list),drv esperanto_list = $(short_list),arhx,mav,rarUnless included in the custom list (like
eo_list), a marked word is considered as a special one and is excluded from the build; the above custom lists thus specify “positive criteria”. But some words have several marks (e.g. a word may be “ethnic” and “obsolete” or dangerously close to a misspelling of a frequent word). Thus a “negative” filtration is available via the
$(sen)macro; you can specify it in
src/Makefile, or in the command-line:
$ make eo sen="arhx,rar"
In this way some otherwise eligible words from the
mll categories shall be discarded, if they are also obsolete or rare.
You can use ispell in a stand-alone mode; type
$ ispell -d eo filename
or you may prefer to customize your emacs; e.g. copy
./emacs/ispell-ini.el from this distribution into your site-lisp (or
somewhere else on your emacs load-path), and put this into your .emacs
In order to get a list of all misspelled or unknown words from a text in the Latin-3 encoding you could say (in Linux):
export LC_ALL=eo_XX.ISO-8859-3 ispell -d esperanto -T .l3 -l < FILENAME | sort -u
Some more comments are in
legu-min.l3 (in esperanto).
This probably is no longer required for emacs-21 or newer.
I enclose the
emacs/ispell.el file, which is a modified version of
ispell.el which comes with the Emacs-20 distribution. There are two