Hi,
We are using commercial webinator to index several thousands of websites in our link collection. After running gw on each website, we do
> gw -wipetodo -noindex
This is the snippet from gw.log:
_________________________________
2001/04/29 02:40:01 End (6417) Visited 8 pages total
2001/04/29 02:40:01 Begin (18305) /home/httpd/html/BIOZAK/webinator/bin/gw -d/home/httpd/html/BIOZAK/webinator/db -wipetodo -noindex
2001/04/29 02:52:30 End (18305) Visited 0 pages total
-----------------------------
This is gw -wipetodo -noindex taking 12 minutes 29 seconds on 700MHz Pentium-III with 792MB RAM. CPU utilization was virtually 0% so I am not really sure where it spends all this time - it shouldn't be doing any networking while it is wipetodo-ing, right?.
From the log file, you can see that the previous site had only 8 pages to contribute to database -> todo list for it can't be too large.
Any ideas?
Also, there is another peculiarity we noticed:
We run walker on batches of 200-500 sites at a time. At the starting of one of the batches it gave us
_______________________________
2001/04/29 02:25:59 Creating Unique index on Non-unique data
----------------------------
even though we never run gw without -unique on this database.
How could this happen?
Thanks for your support. Please tell us if you need any additional information.
Valery
We are using commercial webinator to index several thousands of websites in our link collection. After running gw on each website, we do
> gw -wipetodo -noindex
This is the snippet from gw.log:
_________________________________
2001/04/29 02:40:01 End (6417) Visited 8 pages total
2001/04/29 02:40:01 Begin (18305) /home/httpd/html/BIOZAK/webinator/bin/gw -d/home/httpd/html/BIOZAK/webinator/db -wipetodo -noindex
2001/04/29 02:52:30 End (18305) Visited 0 pages total
-----------------------------
This is gw -wipetodo -noindex taking 12 minutes 29 seconds on 700MHz Pentium-III with 792MB RAM. CPU utilization was virtually 0% so I am not really sure where it spends all this time - it shouldn't be doing any networking while it is wipetodo-ing, right?.
From the log file, you can see that the previous site had only 8 pages to contribute to database -> todo list for it can't be too large.
Any ideas?
Also, there is another peculiarity we noticed:
We run walker on batches of 200-500 sites at a time. At the starting of one of the batches it gave us
_______________________________
2001/04/29 02:25:59 Creating Unique index on Non-unique data
----------------------------
even though we never run gw without -unique on this database.
How could this happen?
Thanks for your support. Please tell us if you need any additional information.
Valery