I am indexing ASP pages that pull data out of a database. In the database, data may change and the new/changed data appears on the ASP page. However, when I walk these pages doing a 'Rewalk Type = Refresh', the new/changed data does not get updated in the index. I'm guessing the changes aren't reflected in the index b/c the URL didn't change and Thunderstone treats it as "not updated". How can get these data changes into the index with a 'Rewalk Type = Refresh'? I know that a 'Rewalk Type = New' would be effective but there is too much data to index all the database content on a daily schedule.
URL changing is not really related. New urls will be added to the database. Unchanged urls will be updated if their content changes.
Refresh will fetch each page that's due for refresh (check your default/min/max refresh times under all walk settings) using "ifmodsince". If the webserver respects ifmodsince and the page hasn't changed since the last visit nothing will be downloaded or updated. If there's no ifmodsince or the webserver says it's modified the page will be downloaded and checksummed. If the checksum is different than the last download the new page will be stored.
You can use list/edit urls to see if your pages are due for update.