Andy Savchenkov said:
You will need to use the -k option to gw when you index to include these
other characters in the index. Normally gw only indexes the letters used
in English. You will need to do something like the following:
gw -d- -unindex
gw -d- -k"[\alpha\x80-\xff]{2,99}" -index
to create an index which includes all the characters with the 8-bit set.
If you only want the characters that are typically used to display then
you would change the \x80 to \xa0. I am unsure as to which of the characters
are used in Cyrillic, so I'm not sure if \xa0 would be sufficient.
John Turnbull
-------------
Thunderstone Technical Support