crawling filesystem

Post Reply
mrouch
Posts: 3
Joined: Thu Feb 12, 2009 4:43 pm

crawling filesystem

Post by mrouch »

i'm trying to crawl a windows share, testing with one URL. in attempting to pick up just one url, which resolves from the browser, i get

Document not found: file:// document from file N:\BrandManagement\business.pdf: The system cannot find the path specified

for the url file:///N:/BrandManagement/business.pdf

i have checked File in Protocols

suggestions? thanks.
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

crawling filesystem

Post by mark »

Your browser is using your local machine's mapping for the "N:" drive. The appliance doesn't know that mapping.

Go to Maintenance->Network filesystems and shares
to tell the appliance about your network file system(s)
and to get the proper base url.
mrouch
Posts: 3
Joined: Thu Feb 12, 2009 4:43 pm

crawling filesystem

Post by mrouch »

i'm running Webinator 5.1.74-Windows-w/plugin and don't see Maintenance on the Admin menu. am i missing something?
User avatar
John
Site Admin
Posts: 2622
Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH
Contact:

crawling filesystem

Post by John »

You will still need to use the UNC path to the file to get it to crawl, e.g. file:\\SERVER\SHARE\BrandManagement ...

There may be permission issues depending on how the crawl is started. You may start the crawl from the command prompt, in which case it will run with your credentials, e.g.

texis profile=Profile dowalk/dispatch.txt
John Turnbull
Thunderstone Software
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

crawling filesystem

Post by mark »

file:///N:/BrandManagement/business.pdf is ok syntactically but the web server process may not have access to network resoruces like shared drives. Try a local drive to see if that's the issue. Running from the command prompt as John suggested should get around the webserver perm restrictions.
mrouch
Posts: 3
Joined: Thu Feb 12, 2009 4:43 pm

crawling filesystem

Post by mrouch »

perfect. that was it, the command prompt worked. thanks for the help.
Post Reply