indexing subdirectories only

User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

indexing subdirectories only

Post by Thunderstone »



I want to set up a search engine separate from my main one that will only
search in one folder and down, without going back up. I looked at the
documentation on your web site, but I must be doing something incorrectly.
The following is what I get when I attempt to index only the "it" directory
and it's subdirectories.

tars% gw -d/www/html-docs/webinator/it -jhttp://www.rollins.edu/it/
Visited 0 pages total


Thank you,

Nancy



User avatar
Thunderstone
Site Admin
Posts: 2504
Joined: Wed Jun 07, 2000 6:20 pm

indexing subdirectories only

Post by Thunderstone »

kgrayburn
Posts: 7
Joined: Wed Jun 06, 2001 8:13 am

indexing subdirectories only

Post by kgrayburn »

User avatar
John
Site Admin
Posts: 2623
Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH

indexing subdirectories only

Post by John »

That is the correct syntax, as long as you have some links from /that/ to the /it/ directory. Otherwise you may want to use a url list file, or crawl each directory separately into the same database.
John Turnbull
Thunderstone Software
marsh
Posts: 6
Joined: Mon Oct 01, 2001 6:21 pm

indexing subdirectories only

Post by marsh »

I am trying to index a one subdirectory along with all of its contents which include other subdirectories.

I can index the subdirectiory by using the command

gw -d/whatever -jhttp://www.server.com/subdir/

When I do this the subdirectory "subdir" is the only dierctory indexed and the files located in it get indexed also but the other subdirectories within it do not get indexed. How do I get the subdirectiories within "subdir" to be indexed as well??

Marshall
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

indexing subdirectories only

Post by mark »

I assume you really meant:
gw -d/whatever -jhttp://www.server.com/subdir/ http://www.server.com/subdir/

The subdirectories have to be linked into the pages it walked. If there are extensions other than .htm .html or .txt you will also need to specify them with the -f option. Also make sure they are not excluded by a /robots.txt file on your webserver.
marsh
Posts: 6
Joined: Mon Oct 01, 2001 6:21 pm

indexing subdirectories only

Post by marsh »

How do I set up and use a "url list" file to walk multiple subdirectories?

Marshall
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

indexing subdirectories only

Post by mark »

List all of the subdirectories and also provide a -j option for each.