Performance problems

michel.weber
Posts: 256
Joined: Sat Oct 08, 2005 12:40 pm

Performance problems

Post by michel.weber »

Hi when a walk is in progress, search perfmormance drops dramatically ...
- No walk in progress : Search time = immediate
- 1 walk in progress : Search time > 10 sec.
- 3 walks in progress : Search time = time out
Enabling fast result counts does not help a lot.
Process size is set to medium
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Performance problems

Post by mark »

How many servers are these walks indexing?
Is the profile being queried the same one being walked?
Is it refresh or new walk?
What if you set crawl delay to 1?
michel.weber
Posts: 256
Joined: Sat Oct 08, 2005 12:40 pm

Performance problems

Post by michel.weber »

>> How many servers are these walks indexing?
The big one was indexing 6 servers the other 2 one each. Each one has 5 threads.
I decreased the big one to 2 servers and i still get time-outs (although less frequently) as soon as a second walk is running simultaneously.
>> What if you set crawl delay to 1?
Changing crawl delay did not change anything.
>> Is it refresh or new walk?
All the walks were 'refresh'.
>> Is the profile being queried the same one being walked?
Yes and no. The big one was most affected (having time-outs. The other have their response time going up from 1-2 secs to 10-20.
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Performance problems

Post by mark »

Also reduce threads to 2 or 1. Try a process size of small to see if it helps without causing too many walk restarts.

Does the big one take a long time to do the whole job? And are a lot of pages changed on each cycle? If so it may be beneficial to pause-and-live it in the middle then resume it.
dietric
Posts: 100
Joined: Fri May 20, 2005 10:57 am

Performance problems

Post by dietric »

The only thing that helped me significantly reduce the timeout was to reduce the "Maximum Load Average". However, this does not completely eradicate the problem...
michel.weber
Posts: 256
Joined: Sat Oct 08, 2005 12:40 pm

Performance problems

Post by michel.weber »

I set thrads, servers down to one and process size to small without much result.

Here are the numbers from th walk status page :
9,020 pages in todo
240,652 pages scheduled to be refreshed
85 pages visited in the last hour (42 success/43 failed)
249,999 pages in index
The refresh walk now lasts about 2-5 minutes because of the license limit.

NB: I didn't change the 'load average' setting, because the help does not specify it's range.

To do some further testing i did the following setup
Profile A 250 000 pages, having the performance problem
Thread : 1 Server : 1 delay : 1, size : small
Profile B 120 000 pages, not having the problem
Thread : 5 Server : 1 delay : 0, size : medium

1 : Do a search returning several 1000 documents on both profiles (repeat with 3 different searches)
A result takes 5-10 seconds
B result is instantaneous
2 : Launch a refresh walk for both profiles
3 : Do the same searches again
A result takes 15-20 seconds
B result is still instantaneous
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Performance problems

Post by mark »

That's new. I thought the slowness was across the board when any crawl was running. It appears the problem is just the one profile whether it's crawling or not, so most of my suggestions are moot.

What non-default search settings are you using for profile A?
User avatar
John
Site Admin
Posts: 2623
Joined: Mon Apr 24, 2000 3:18 pm
Location: Cleveland, OH

Performance problems

Post by John »

On the Walk Status are there messages similar to:

Creating search index on fetched pages...Done.
Creating spell-checker dictionaries...Done.
Verifying usability of new walk.
John Turnbull
Thunderstone Software
michel.weber
Posts: 256
Joined: Sat Oct 08, 2005 12:40 pm

Performance problems

Post by michel.weber »

Hi

Mark :
Search setting differences between A and B an DEFAULT :

A B DEFAULT
Query Logging Yes Yes no
Search Timeout 60 NULL NULL
Abstract Length 180 180 NULL
Results per Page 25 25 10
Display no no null
Thunderstone logo
Show yes yes null
Advanced Search
Top HTML Custom Custom Default
Bottom HTML Custom Custom Default
Synonyms Phrase& Phrase& Phrase
allow synonym allow synonym recognition
only
Fast Yes Yes No
Result Counts
Word Forms Plural& Plural& Exact
possesifs possesifs match

John :

more or less ... i have
Updating search index ...WARNING: You are within 10% of your page limit, 249999.
Creating spell-checker dictionaries...Done.
Verifying usability of new walk.
Could it be that the index is not updated naymore due to licence limits?
michel.weber
Posts: 256
Joined: Sat Oct 08, 2005 12:40 pm

Performance problems

Post by michel.weber »

Oops sorry for the layout, i tried to make a readable table, but all multiple spaces got eaten.