IGNORE TAGS and HTTP header

Post Reply
tmarino
Posts: 94
Joined: Thu Apr 25, 2002 1:04 pm

IGNORE TAGS and HTTP header

Post by tmarino »

We are using Webinator 4.4.8-Unix-w/plugin and implemented the IGNORE TAGS option with <!--beginnotexis--> and <!--endnotexis-->. Whenever we run the WALK with the tags anywhere in our pages, Webinator fails to get the Last Modified Date from the HTTP header. Any suggestions?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

IGNORE TAGS and HTTP header

Post by mark »

In the "storepage" function of dowalk there's a bit of code like below. Delete it from there and put it into the "collectmeta" function (at the beginning).

<getmodheader><!-- get last-modified date of document -->
<if $ret eq ""><!-- webserver did not provide modified date -->
<$when="now"><!-- use visited date -->
<else>
<!-- expected format: Mon, 25 Oct 1999 19:50:28 GMT -->
<timport "recexpr .+
datefmt x, dd mmm yyyy HH:MM:SS xxx
field when date 1
" $ret></timport><!-- turn it into a texis style date -->
<if $loop eq 0><!-- format didn't match -->
<$when="now"><!-- use visited date -->
<else>
<$when=($when-$TZ)><!-- convert from GMT to localtime -->
</if>
</if>
tmarino
Posts: 94
Joined: Thu Apr 25, 2002 1:04 pm

IGNORE TAGS and HTTP header

Post by tmarino »

ok, I did this and ran the WALK. It only returns today's date as the LAST MODIFIED DATE for all the documents. I edited the DOWALK file at usr/local/morph3/webinator. Is that the right one?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

IGNORE TAGS and HTTP header

Post by mark »

No, that's the as-distributed backup copy. The "live" one is in INSTALLDIR/texis/scripts/webinator. Or in older installations DOCUMENTROOT/webinator.
tmarino
Posts: 94
Joined: Thu Apr 25, 2002 1:04 pm

IGNORE TAGS and HTTP header

Post by tmarino »

Thanks, I edited the right DOWALK. It now returns 12/31/69 as the LAST MODIFIED DATE for each document. Am I getting closer?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

IGNORE TAGS and HTTP header

Post by mark »

I forgot one item. In storepage there's
<local when Hash reason>
Remove the "when". That should do it.
tmarino
Posts: 94
Joined: Thu Apr 25, 2002 1:04 pm

IGNORE TAGS and HTTP header

Post by tmarino »

This works when I run the WALK manually. When crontab kicks off the job, the date reverts back to 12/31/69. Is crontab using a different DOWALK script?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

IGNORE TAGS and HTTP header

Post by mark »

It shouldn't be if you're using webinator's scheduler. Run
texis -LS
from the command line to get details about the scheduled walks.

If you're using crontab, check your crontab entry to make sure it's correct.
Post Reply