wish to crawl pages for links but not include them in the index

Post Reply
jgdoke
Posts: 167
Joined: Wed Jul 14, 2004 10:52 am

wish to crawl pages for links but not include them in the index

Post by jgdoke »

I need to crawl a site and have some pages which have the links to the real content be crawled for the links but not be included in the index.

For instance this page has the links I want in the index:
http://literature.rockwellautomation.co ... %20English

But I dont want that page in the index.
Hope that makes sense.
John
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

wish to crawl pages for links but not include them in the index

Post by mark »

Use "Exclude by field". Create a metamorph query to match only those pages you want to affect. Use Exclude "Pages only" to exclude the page content but keep the links.
jgdoke
Posts: 167
Joined: Wed Jul 14, 2004 10:52 am

wish to crawl pages for links but not include them in the index

Post by jgdoke »

I have searched for documentation on a metamorph query but did not find out how I would exclude pages with *browse* in the url. Can you help with that syntax?
User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

wish to crawl pages for links but not include them in the index

Post by mark »

Do you want to match the word or a substring (joebrowser)?
If the word enter
browse
If the substring enter
/browse
jgdoke
Posts: 167
Joined: Wed Jul 14, 2004 10:52 am

wish to crawl pages for links but not include them in the index

Post by jgdoke »

User avatar
mark
Site Admin
Posts: 5514
Joined: Tue Apr 25, 2000 6:56 pm

wish to crawl pages for links but not include them in the index

Post by mark »

Yes, and anything similar of course.
Post Reply