diacritical marks interaction with thesaurus

Post Reply
rjshelq
Posts: 75
Joined: Thu Nov 17, 2005 3:25 pm

diacritical marks interaction with thesaurus

Post by rjshelq »

Hi,

When a word appears in my custom thesaurus, the search fails to ignore diacritical marks.

For example, these words all refer to the same Persian word:

saki, sâkî , sākī, saqi, sâqî, sāqī

If I search for "saki" (and if saki is not in my custom thesaurus), then I do get search hits on saki, sâkî , sākī as expected, properly ignoring diacritical marks.

However, since I also want to get hits for the variation transliterated with a q (saqi) instead of a k (saki), I added saki,saqi to my custom thesaurus, expecting that a search for saki would then return hits for all variations such as saki, sâkî , sākī, saqi, sâqî, sāqī and any other diacritical marks.

Unfortunately, as soon as I added saki,saqi to the custom thesaurus, the searches stopped being insensitive to diacritical marks, and now if I search for "saki" I only get saki and saqi, but fail to get any hits on other variants with diacritical marks.

Is there a way for searches to maintain diacritical insensitivity for words which appear in my custom thesaurus?
User avatar
Kai
Site Admin
Posts: 1271
Joined: Tue Apr 25, 2000 1:27 pm

diacritical marks interaction with thesaurus

Post by Kai »

It is not currently possible, as textsearchmode (the diacritical-mark ignore/respect setting) is not yet implemented with the thesaurus, only straight single-term sets (no equivalences). This is planned to be fixed in a future release, but it is probably several months away.
josmani
Posts: 53
Joined: Tue Jun 03, 2003 3:38 am

diacritical marks interaction with thesaurus

Post by josmani »

I've just noticed we have the same issue on full Texis 6.01 with thesaurus.

Any workarounds you can suggest?
Post Reply