Page 1 of 1
Directory contents showing up in results
Posted: Thu Feb 03, 2005 6:32 pm
by MattS
In our results page when you search on "HR" we are seeing a link to the actual directories with "HR" in the url:
http://intrardns/hrdocs/8/HRReports/
and then you see the list of files in the directory.
Now we want the HR reports that are in that folder to show up, but why is a link to the actual directory showing up?
Any ideas on what I need to tweak so I can stop this happening?
Thanks,
Matt
Directory contents showing up in results
Posted: Thu Feb 03, 2005 7:15 pm
by John
If there is unique content on the directory pages to indicate they are directories then you can use the exclude by field to exclude the page, but follow the links.
Directory contents showing up in results
Posted: Thu Feb 03, 2005 7:20 pm
by MattS
Thanks for the quick response!
As a newbie, first things first: what/where is a "directory page"?
Secondly when you mean "exclude" you mean in the actual "walk" interface?
Directory contents showing up in results
Posted: Thu Feb 03, 2005 7:56 pm
by mark
A "directory page" is what you're asking to stop showing up.
Yes, from the walk interface under "All walk settings" look for "Exclude by Field". Click the ? next to it for a description of how to use it.
Directory contents showing up in results
Posted: Thu Feb 03, 2005 8:08 pm
by MattS
Thanks, I thought John meant there was an actual "file" somewhere.
We are running the full licence version of Webinator 4.4.16-Windows-w/plugin and I can only see a setting called "Exclusions"? We don't have "Exclude by Field"
In Exclusions we have the cgi folder and the docman folder but that's because we don't want the contents to be displayed/walked/indexed.
We want to see the pages in the HR folder and we want them to be indexed, we just don't want to see the directory listing.
Any tips?
Directory contents showing up in results
Posted: Fri Feb 04, 2005 12:34 pm
by mark
We were discussing a version 5 feature. You can do something similar in 4 by editing dowalk. Use <rex> to identify something distinct about the style of the page you don't want indexed. When you spot it set
<$metarobots_index="N">
<$metarobots_follow="Y">
You'd probably want to do that right before or after the call to <collectmeta>.
Or contact sales to discuss upgrade to version 5.
Directory contents showing up in results
Posted: Mon Feb 07, 2005 5:22 pm
by MattS
Thanks, I'll try and have a tinker today.
I've realised one of the things that has caused me a bit of grief is that the guys who set this up actually upgraded without doing any uninstalling or cleaning up after themselves.
Doing a bit of reading on this on the site here it seems this might be at the heart of some of the other problems I've had - multiple versions installed.
Cheers,
Matt