No more same Url

verzetti
Posts: 9
Joined: Wed Jan 03, 2001 11:05 am

No more same Url

Post by verzetti »

Dear staff,
We bought the full Texis some months ago.
I got a problem in the search script.
If I search "webinator rules" in my site, the site will display for example:
http://webinator.com/webinator.html and http://webinator.com/webinator2.html, both containing the words "webinator" and "rules".
So, this is my problem: how can I display just ONE page per domain?
The big problem is that I need to find the faster way to
do this, because our Url database has millions Urls.
Then I thing the best way is to do as you suggested us in a previous email:
"...Or maintain a list of displayed hosts and only display answers for those not already in the list."
How to do this ? There is any demo or can you give us
the code or any ideas ?
Thanks and regards,

Augusto Verzetti - Noago Srl
bart
Posts: 251
Joined: Wed Apr 26, 2000 12:42 am

No more same Url

Post by bart »

See: the <rex> and <xtree> functions.

I'd do it by <rex-ing> out the domain of each url displayed and then checking it against an xtree to see if it was already dislayed, if not show it and add it to the xtree.