problems when indexing with full charset

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

problems when indexing with full charset

Post by Thunderstone »



dear webinator!

when indexing with -index - all is ok.
when indexing with more fulll charset settings, gw goes crash :(

here is example:
----------------------------------
D:\WEB\search\webinator>gw -dbinet -k"[\alpha\x80-\xff]{2,99}" -index
Indexing new pages
100 Unable to determine free space. Will proceed assuming there is enough.
000 Index D:\WEB\search\webinator\binet\xhtmlbod should exist, but does not
000 Got signal 11 - quitting now

D:\WEB\search\webinator>gw -dbinet -dropindex -noindex
Deleting all indices

D:\WEB\search\webinator>gw -dbinet -index
Indexing new pages
----------------------------------------

edmunds
www.search.lv


User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

problems when indexing with full charset

Post by Thunderstone »



INTERNET CLUB said:

It does not normally crash with the addition of the extra characters to
the index, and there are many people successfully using it.


This message is probably indicative of the problem. Can you let us know
which operating system and version you are using, how much free disk space
you have on the drive that you are using for the database, as well as which
filesystem you are using.



User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

problems when indexing with full charset

Post by Thunderstone »





This message is probably indicative of the problem. Can you let us know
which operating system and version you are using, how much free disk space
you have on the drive that you are using for the database, as well as which
filesystem you are using.


We use NT40, Service Pack 2, IIS3.0, database is about 15Mb, free space on NTFS disk is about 800Mb, 128Mb RAM.


It looks like that gw goes crash if specific characters are present, possible KOI-8, windows-1251, 1257

edmunds