I understand that the walker relies on the page's last-modifed attribute (either in the request header or META tag) to determine when to reindex a page.
What if there is no meaningful value that could be attached to these attributes and I want to simply use a META EXPIRES tag to ensure that a page gets reindex every hour, day or week?
If there is no last modified tag it will look at when it last saw a change in the page contents. You can specify a URL that will be updated more frequently.
Does anyone know if I can force a page to update even if the LAST-MODIFIED META tag has not been updated. I am using the last modified tag to assign a different timestamp for an application. This allows me to sort my results by the date the event closes. This means that the page could be updated but the LAST-modified date will not actually change. I would want the walk to force this.
The last modified meta tag will be used to calculate the refresh frequency, however if the webserver still returns the document when conditionally fetched the updated contents will be stored when the page is refreshed.
You can set the Max Refresh Time shorter to refresh pages more frequently.
It was mentioned that support for the META EXPIRES tag would be added to the search appliance (that was last May). Has that happened, and if so, is there any documentation? Specifically, is it guaranteed that the search appliance will revisit an indexed page during refresh walks if that tag is set accordingly?