Page 1 of 2
searching for words with "/" in them
Posted: Fri Jul 17, 1998 2:32 pm
by Thunderstone
Hello again! The search engine is now working beautifully; however, we've
run into an interesting problem. This company has a lot of products with
the "/" in their names (e.g., Window/Washer). However, because of the REX
pattern matcher, I guess the search engine interprets "/" as the beginning
of a regular expression.
I don't think any of the users of this site will want to use the REX
feature. Is there any way of turning it off? Any simple workarounds for
this problem? I could instruct users to use the wildcard *
(Window*Washer) but it would be confusing, since so many documents on this
site do have "/" in them.
Thank you for your help!
Katherine Falk
searching for words with "/" in them
Posted: Fri Jul 17, 1998 2:52 pm
by Thunderstone
=cc=Katherine Falk said:
Searching for the words
window washer
should find those products, which should rank high since the words are
next to each other.
A / in the middle of a term does not begin a REX expression, so
"window/washer" will be treated as a single term. However the index
normally only indexes letters and digits, so you would need to tell it
to also index /. You could do this with:
gw -unindex
gw -k"\alnum{2,30}" -k"\alnum=[\alnum/]{1,29}" -index
This would ensure that searches for "window" or "washer" would still work,
as well as "window/washer".
searching for words with "/" in them
Posted: Tue Oct 02, 2001 1:17 pm
by sduggan
Hello,
When trying to run gw -unindex so I can index "/" for my searches I get the following error message:
" 000 Can't get address for host `napplication/pdf/pdf,pdftotx': 0 - unknown winso
ck error
Visited 0 pages total "
I used: napplication/pdf/pdf,pdftotx to index my pdf's, but seems to be causing a problem when trying to run "-unindex", so what do I have to do to get it to index the "/"
Thanks,
ShaWn
searching for words with "/" in them
Posted: Tue Oct 02, 2001 1:31 pm
by mark
What's the full command line you're using for -unindex? You shouldn't need any options with it except, maybe -d .
searching for words with "/" in them
Posted: Tue Oct 02, 2001 2:57 pm
by sduggan
I typed:
gw -unindex
http://webdev/
And got this for a message:
"No database specified. Use the default (D:/inetpub/millenium/webinator/db)?
(y/n) default is y : You may use "-d-" to skip this question in the future.
I then typed "y" and got:
Deleting search indices
Getting
http://10.100.64.58/robots.txt...Got it...Ok.
Adding todo:
http://webdev/
http://webdev/ is already in the database
Visited 0 pages total
I then typed:
gw -k"\alnum{2,30}" -k"\alnum=[\alnum/]{1,29}" -index
and got:
No database specified. Use the default (D:/inetpub/millenium/webinator/db)?
(y/n) default is y : You may use "-d-" to skip this question in the future.
Visited 0 pages total
I hope this helps.
searching for words with "/" in them
Posted: Tue Oct 02, 2001 4:02 pm
by mark
You shouldn't generally provide a url with -unindex, but it's fairly harmless. Everything you show here indicates that it worked. What's the problem?
searching for words with "/" in them
Posted: Wed Oct 03, 2001 9:13 am
by sduggan
After trying to reindex and getting the messages above, I try to do a search and got:
Warning: Query 'keon' would require linear search
this happened the first time, so I ran -wipe and then re-indexed the site, but then I still couldn't do searches with "/" them, which is what I am trying to accomplish.
If I did a -wipe again and started from scratch, what would be the syntax to type to append to this string:
gw -fasp -n"application/pdf,pdf,pdftotx" http://{sitename}
to reindex my site to be able to do searches with"/" in the search.
Thanks.
searching for words with "/" in them
Posted: Wed Oct 03, 2001 9:54 am
by mark
gw -fasp -n"application/pdf,pdf,pdftotx" -k"\alnum{2,30}" -k"\alnum=[\alnum/]{1,29}" http://{sitename}
searching for words with "/" in them
Posted: Wed Oct 03, 2001 12:13 pm
by sduggan
Mark,
I've done this twice and still any time I do a search, eg. "ace/server" it comes up "No documents matched your query. " but if I type in "ace server" I get documents with "ace/server" in them, but our customers need to be able to type in a search with \ in it.
Any help would be great
Thanks
Shawn
searching for words with "/" in them
Posted: Wed Oct 03, 2001 12:49 pm
by mark
Oops, this
-k"\alnum=[\alnum/]{1,29}"
should be
-k">>\alnum=[\alnum/]{1,29}"