Site Indexing Problem (dup), SCO Webinator

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Site Indexing Problem (dup), SCO Webinator

Post by Thunderstone »



It seems my last message took a wrong turn somewhere. I'll repeat:
I'm hoping someone can help.
I'm having difficulty with using 'gw' to index my site.
Running SCO OpenServer Release 5 unix. Downloaded the latest webinator
for SCO (ftp.thunderstone.com site, /prod/webinator/sco/sco_webinator.tar.Z
compressed archive, 3739544 bytes, dated 24 Sep 96). Unpacked and installed
it without any problems into my Web server's (Apache V1.0.3) document root
/www/webinator/... . Both made sure that /www/webinator/bin/gw and
/usr/local/etc/httpd/cgi-bin/webinator were running as and setuid/setgid
to the uid,gid that the Web server runs as (nouser,nogroup).
When I try to access one of my sites (http://www.nurse.net/index.html),
I get an error from gw about "Interrupted system call ... Can't open
connection to ...: timeout"; see captured output below. I've tried several
variations and still get the same error- have tried (1) alternate
webinator for SCO ODT 3, (2) indexing a second/separate site of mine, (3)
making all the webinator files root owned/privileged, (4) giving everyone
read/write permissions over all the webinator files.
What the heck am I doing wrong <s>?

I first create the nursenet database and log file (gw -d...
-l... -create), which goes fine. Then I try to run
"/www/webinator/bin/gw -d/www/webinator/db/nursenet.db
-l/www/webinator/db/nursenet.log -r -v4 http://www.nurse.net/index.html"

I get the following error message:
-----------------------------------------------------------------------
175 Table hosts not found in data dictionary
115 No such table: hosts
000 SQLExecute() failed with -1
Adding todo: http://www.nurse.net/
http://www.nurse.net/
0: TotLinks: 0, Links: 0/ 0, Good: 0, New: 0 Retrieving
0: TotLinks: 0, Links: 0/ 0, Good: 0, New: 0
000 connect: Interrupted system call
0: TotLinks: 0, Links: 0/ 0, Good: 0, New: 0
002 Can't open connection to www.nurse.net:80: timeout
0: TotLinks: 0, Links: 0/ 0, Good: 0, New: 0
Visited 1 pages
Visited 0 pages
Visited 1 pages total
Remember to run "/www/webinator/bin/gw -index" to update the index when
you finish a batch
---------------------------------------------------------------


If it is of any use, the following is my webinator directory listing:
--------------------------------------------------------------
webinator:
total 26
drwxrwxrwx 2 nouser nogroup 512 Dec 2 23:02 .master
-rw-rw-r-- 1 nouser nogroup 74 Feb 13 1996 0.gif
-rw-rw-r-- 1 nouser nogroup 71 Feb 13 1996 1.gif
drwxr-xr-x 2 nouser nogroup 512 Dec 2 23:02 bin
-rw-rw-r-- 1 nouser nogroup 142 Feb 13 1996 ctx.gif
drwxrwxrwx 3 nouser nogroup 512 Dec 2 23:05 db
-rw-rw-r-- 1 nouser nogroup 105 Sep 7 1995 defindex.html
-rw-rw-r-- 1 nouser nogroup 105 Dec 2 23:02 index.html
-rw-rw-r-- 1 nouser nogroup 138 Feb 13 1996 lnk.gif
-rw-rw-r-- 1 nouser nogroup 139 Feb 13 1996 mlt.gif
-rw-rw-r-- 1 nouser nogroup 102 Feb 13 1996 nolnk.gif
-rw-rw-r-- 1 nouser nogroup 378 Feb 13 1996 wstsbut.gif

webinator/.master:
total 4568
-rw-r--r-- 1 nouser nogroup 130 Jun 10 11:57 .htaccess
-rw------- 1 nouser nogroup 16040 Jun 10 11:57 SYSCOLUMNS.tbl
-rw------- 1 nouser nogroup 3974 Jun 10 11:57 SYSINDEX.tbl
-rw-rw-rw- 1 nouser nogroup 8 Jun 10 11:57 SYSLOCKS.SEQ
-rw------- 1 nouser nogroup 3314 Jun 10 11:57 SYSMETAINDEX.tbl
-rw-r----- 1 nouser nogroup 2259771 Apr 5 1996 SYSOBJECTS.tbl
-rw------- 1 nouser nogroup 4224 Jun 10 11:57 SYSPERMS.tbl
-rw------- 1 nouser nogroup 7736 Jun 10 11:57 SYSTABLES.tbl
-rw------- 1 nouser nogroup 3314 Jun 10 11:57 SYSTRIG.tbl
-rw------- 1 nouser nogroup 3438 Jun 10 11:57 SYSUSERS.tbl
-rw------- 1 nouser nogroup 3314 Jun 10 11:57 error.tbl
-rw-r--r-- 1 nouser nogroup 103 Jun 10 11:57 gw.log
-rw------- 1 nouser nogroup 3314 Jun 10 11:57 html.tbl
-rw------- 1 nouser nogroup 3314 Jun 10 11:57 refs.tbl
-rw------- 1 nouser nogroup 3314 Jun 10 11:57 todo.tbl
-rw------- 1 nouser nogroup 152 Jun 10 11:57 xhtmlurl.btr
-rw------- 1 nouser nogroup 152 Jun 10 11:57 xtodourl.btr

webinator/bin:
total 263
User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

Site Indexing Problem (dup), SCO Webinator

Post by Thunderstone »



..
..

The timeout is not directly related to Webinator, but the response speed
of the site you are trying to index. The connection between the machine
you are running gw on and ww.nurse.net is apparently slow. You can increase
the time gw will wait for a response with the -t option (e.g. -t60).
See http://www.thunderstone.com/gwman/node22.html .

Also make sure the machine you are running gw on can reach the machine
you are trying to index. From the webinator/bin directory, try:
./geturl http://www.nurse.net/index.html