version upgrade????

anixyz
Posts: 62
Joined: Fri Jul 20, 2001 11:43 am

version upgrade????

Post by anixyz »

we have the following version of the Webinator.

Webinator WWW Site Indexer Version 2.5 (Commercial)
Copyright(c) 1995,1996,1997 Thunderstone EPI Inc.
Release: 19980928

we are running into some problems like...,
1.Multiple search results due to mixed case HREFs.
2.Search doesn't bring up most relevant results first.
3.Document parts like selection boxes cause false hits on search engine.

Do we need to go for an upgrade of the product to solve these?, If so
how do we go about it.

Thanks
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

version upgrade????

Post by mark »

You probably don't need an upgrade for those issues yet.
1. DOS based webservers ignore case on urls thereby allowing sloppy web authoring. Use the -unique option to gw to prevent these duplicates.
2. The default search does a ranked (likep) search and does bring up the most relevant results first. See if you've modified the original script to work differently. Also you may adjust the rank control knobs on the advanced options page to set what you think the importance of each setting is.
3. Webinator 2 doesn't provide for direct exclusion of such items. You could remove them after the walk with something like:
texis -d YOURDB -s "update html set Body=sandr('>>YOUR BOX TEXT','',Body)"
gw -dYOURDB -index
Or use the scripted walker. See http://thunderstone.master.com/texis/ma ... 3ac3633114
The upcoming version 4 release will provide easier support for removing such things.