Question on 'URL File'

Post Reply
kai.tallafus
Posts: 3
Joined: Tue Dec 10, 2002 6:03 am

Question on 'URL File'

Post by kai.tallafus »

Hi!

Wie have a very large list of urls of dynamic pages (press releases, initially approx. 4000-5000 URL).

I´ve got this URL File indexed properly in the first run. The problem is that due to the large amount of urls we don´t want to index the whole list again every day. The solution would be (after the initial walk) to write only the new press release urls into the text file (approx. 5 urls per day) and do a rewalk type "refresh", so that the index of the first walk is kept and the new urls are added.
But this doesn´t work :(

Is there a way to do this? Please help!! It´s very urgent... :(

Best Regards
Kai Tallafus
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Question on 'URL File'

Post by mark »

If you're specifying all of the pages that need to be added you should be using page file instead of url file.
Then just add the desired urls to the file and run dowalk by hand (see http://www.thunderstone.com/texis/site/ ... ing+dowalk ) and use the "singles" entry point.
kai.tallafus
Posts: 3
Joined: Tue Dec 10, 2002 6:03 am

Question on 'URL File'

Post by kai.tallafus »

Thank you for the answer!

Just to make things more clear for me:

- I should switch to "Page File" instead of "URL File"
- add ALL (the 4000-5000) urls to the Page File and then run the Initial Walk (Rewalk type "New" in webinterface)
- for every new url(s): clear the page file, add the new url(s) and run the dowalk by hand with the "singles" entry point

The command line would look similar to this:
C:\Inetpub\scripts>texis.exe -r profile="egov_pm_l1" "F:\Program Files\Thunderstone Software\Webinator\texis\scripts/Webinator/dowalk\singles.txt"

Is that what you want me to do?

Btw. what is the "-r" switch in the commandline?

Thanks again for your help!
Kai Tallafus
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Question on 'URL File'

Post by mark »

kai.tallafus
Posts: 3
Joined: Tue Dec 10, 2002 6:03 am

Question on 'URL File'

Post by kai.tallafus »

Edited this message:
It worked fine for me now. Just have to test a bit more...

Thanks for your help!!!

Greetz
Kai
Post Reply