I want to process certain non-html pages at an offsite processor. For instance, if GW found a file with an extension of "shtm" at:
http://www.mysite.com/page.shtm
... would there be a way to dynamically have GW retrieve the page at:
http://www.othersite.com/process?url=ht ... /page.shtm
Then, when printing search results, I would use SANDR to replace any "shtm" urls with the original url (using regular expressions, and by finding the string after "url="). The user would then see only the original url, though the information in the search index would come through the offsite processor.
Can this be done?
http://www.mysite.com/page.shtm
... would there be a way to dynamically have GW retrieve the page at:
http://www.othersite.com/process?url=ht ... /page.shtm
Then, when printing search results, I would use SANDR to replace any "shtm" urls with the original url (using regular expressions, and by finding the string after "url="). The user would then see only the original url, though the information in the search index would come through the offsite processor.
Can this be done?