Re-indexing Question

Post Reply
User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Re-indexing Question

Post by Thunderstone »



Now that I have reindexed the site and found an elegant search at my
fingertips, the updated pages since the index are not getting reindexed when
I do a:

Webinator WWW Site Indexer Version 2.5 (Free)

gw rewalk myurl

or

gw -index -force myurl

doesn't appear to get my new pages in the database. i'm using the default
DB.

I just want to reindex a page or the entire site. preferably the former.

thanks in advance!
Chris Y







User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Re-indexing Question

Post by Thunderstone »



You left the leading - off of -rewalk. Also, -rewalk does not accept
urls or other options. It rewalks using previous urls and options.

Options are case sensitive. Use:
gw -Force myurl
You don't need to say -index . It's automatic.



User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Re-indexing Question

Post by Thunderstone »



I have built an extensive database of very specific parts of numerous
websites. All of my command lines have been either gw -a .... or gw -D1 ....
After each gw -D1 ... entry I used the following lines so as to not clutter
up my database with unwanted urls. gw -s "delete from todo" and
gw -s "delete from options where Name='URL'"

If I want to refresh my database once a week to make it current but ensure
that
I don't get all those secondary url's I was so careful to delete out after
each -D1 entry, what specific command line would I use?

Thanks.



User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Re-indexing Question

Post by Thunderstone »



So you want to just refetch what's there and nothing new.?

Use the -e"-1 hour" option to refetch any pages fetched more
than an hour ago and the -a option to prevent walking of new links.

If that's not what you want, you'll probably have to repeat the sequence
of commands on a clean database. You can find what you did in gw.log.



Post Reply