search within a book feature

Post by **mark** » Tue Nov 16, 2004 4:57 pm

p.s.
In search also look where it checks for .pdf extension.

Post by **mark** » Tue Nov 16, 2004 11:08 pm

Something that hasn't been mentioned before, all walk setting "Plugin Split" may be useful for splitting pdf text into individual pages that then refer back to the full document.

KMandalia · Post by **KMandalia** » Thu Nov 18, 2004 12:01 pm

very nice.

But I have decided to go with the option of replacing the 'PDF Document (84k)' that webinator inserts when it can't find the title in the PDF meta data with the file name of my PDF document.

So, I will need to first identify whether the url of the walked document is ours or not and if it is ours, I shall check for no title and replace the 'PDF Docuemnt...' with the file name.

Now, would it be better to implement the above logic in dowalk or should it be in search (ideally, if this all can be done in dowalk then my search would not slow down)?

Post by **John** » Thu Nov 18, 2004 12:21 pm

It is generally best to do that in dowalk, as you only need to do it once when the document is indexed, not for every search.

Post by **mark** » Tue Nov 30, 2004 12:18 pm

Oops. This
<rex "[^/]+\F\.[^.]+>>=" $u>
should be
<rex "[^/]+\F\.=[^.]+>>=" $u>