Using Cookies

michel.weber
Posts: 256
Joined: Sat Oct 08, 2005 12:40 pm

Using Cookies

Post by michel.weber »

Hi

I'm trying to walk a site that uses cookies.

I tried to use the 'Cookie Source Path' option, but i can't figureout how this works.

First i put in the directory on my PC wherer cookies are stored, but then i get the following error :

Cannot read cookie file/dir D:\WINXP\Profiles\mike\Application Data\Mozilla\Firefox\Profiles\majuceba.default: No such file or directory

Looking at the help it seems like i'm supposed to run a browser from the appliance itself, which strikes me as strange ...

Any ideas what i'm doing wrong?
User avatar
John
Site Admin
Posts: 2623
Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH

Using Cookies

Post by John »

There are several ways to walk a site that uses cookies, and the Search Appliance should work with cookies that are set by the site on visited pages. If a login or similar cookie needs to be set the preferred method is to use the Primer URL option, which specifies a URL to visit before crawling any other pages to get a cookie set. It can include login parameters if needed, and supports http-post URLs if a POST instead of GET is required.
John Turnbull
Thunderstone Software
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Using Cookies

Post by mark »

The primer is by far the preferred solution. But if that method just can't work for some reason you would have to share your folder with the cookie file or put the cookie file on a network share. Then mount that share to the appliance using Maintenance->Network filesystems and shares. The path to the cookie file would then be of the form /mnt/local/HOST/SHARE/PATH_TO_COOKIE_FILE . Make sure that share is always working and the file always present when a walk runs.
michel.weber
Posts: 256
Joined: Sat Oct 08, 2005 12:40 pm

Using Cookies

Post by michel.weber »

The primer url does not work because when i connect to the site and i don't have the cookie, the site sends an e-mail with an url that will generate the cookie. Unfortunately this url expires after 48 hours.

So do i fill in the primer url with the cookie url do a 'new' walk, clean out the primer url and do 'refresh' walks after that?
User avatar
John
Site Admin
Posts: 2623
Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH

Using Cookies

Post by John »

That should work if the cookie keeps being "refreshed" so if the appliance keeps using it it gets another 48 hours.
John Turnbull
Thunderstone Software
michel.weber
Posts: 256
Joined: Sat Oct 08, 2005 12:40 pm

Using Cookies

Post by michel.weber »

I don't know what you mean by the cookie being "refreshed", but it does not work.
Normally the generated cookie has a life time of 10 years, but i need a way to get the appliance to remember it. The link to set up the cookie unfortunately expires after 48 hours.

I'll have a try at the second method, although it does seem to be a bit unsecure to put a cookie on a network share.
User avatar
John
Site Admin
Posts: 2623
Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH

Using Cookies

Post by John »

We will add the ability to upload the cookie file to the appliance in a future version, but for now the network share is the appropriate method.
John Turnbull
Thunderstone Software
michel.weber
Posts: 256
Joined: Sat Oct 08, 2005 12:40 pm

Using Cookies

Post by michel.weber »

OK

That works