don't crawl untitled pages

Post Reply
KMandalia
Posts: 301
Joined: Fri Jul 09, 2004 3:50 pm

don't crawl untitled pages

Post by KMandalia »

I am happy to throw out pages that don't have any title. How do I do that in dowalk?

The thing here is that I only want to throw away webpages but keep PDF and Doc files.

I guess I have to also delete all the untitled pages from the database and re-index it for this to take effect. Am I right?
User avatar
John
Site Admin
Posts: 2622
Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH
Contact:

don't crawl untitled pages

Post by John »

The simplest method currently would be after the <metafromfield> call add

<if $title eq ''><$exfield_index eq "N"></if>

which will tell it not to index the page as if you add "Exclude by Field". You would either need to delete all currently untitled pages, or rewalk the database to remove any existing ones.
John Turnbull
Thunderstone Software
Post Reply