michel.weber
Posts: 256 Joined: Sat Oct 08, 2005 12:40 pm
Post
by michel.weber » Wed Jul 26, 2006 10:20 am
Hi
Let's say that in the 'Keep tags' list i pur a pair of tags like <div id="thunderstone"> and </div id="thunderstone">.
What happens to a HTML file which does not contain these tags?
What happens to documents (PDF, WORD, ...) which obviously won(t contain these tags either?
John
Site Admin
Posts: 2623 Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH
Post
by John » Wed Jul 26, 2006 10:46 am
If the keep tags are not found then the entire document is kept.
John Turnbull
Thunderstone Software
michel.weber
Posts: 256 Joined: Sat Oct 08, 2005 12:40 pm
Post
by michel.weber » Wed Jul 26, 2006 10:48 am
Great. Thats what i hoped for.
josh104
Posts: 24 Joined: Mon Oct 09, 2006 5:39 pm
Post
by josh104 » Wed Oct 11, 2006 3:36 pm
Does the search appliance follow links that exist outside of these tags? (This question assumes that the tags do exist within the page)
John
Site Admin
Posts: 2623 Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH
Post
by John » Wed Oct 11, 2006 3:43 pm
Yes. The links are extracted before the keep and ignore tags are processed.
John Turnbull
Thunderstone Software