Conditional crawling

valery
Posts: 26
Joined: Thu Mar 15, 2001 9:24 pm

Conditional crawling

Post by valery »

Hi,

I have a couple of questions about commercial Webinator I use:

1. Would it be possible to restrict crawling by keywords, e.g. index only pages where specified keyword appears?

2. During the search, I want only to search a subset of domains indexed. I know that I can do this with something like "Url matches 'www.mysite.com%'. Is
there more scalable method (a subset in my case will sometimes be composed of ~10,000 domains)? Is it possible to add a field 'subset_index' into html table?

Thanks,
Valery.
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Conditional crawling

Post by mark »

1. gw won't do that. You could either delete unwanted pages after the walk. Or you could use the scripted walker and customize it to only store the desired pages.

2. You would need a full Texis license for that level of control over the database.
valery
Posts: 26
Joined: Thu Mar 15, 2001 9:24 pm

Conditional crawling

Post by valery »

> Or you could use the
> scripted walker and customize it to only store the desired pages.
What is the 'scripted walker' and where can I get it?

> 2. You would need a full Texis license for that level of control over the
> database.
Could you give us a quote on that?

Thanks,
Valery.
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Conditional crawling

Post by mark »

The scripted walker has been mentioned numerous times recently on this board. Do a search for "scripted walker" to get the url.

Please use the "Contact Us" link on the left side menu to get Texis pricing info.