Page 1 of 1

Suffix Processing

Posted: Wed Jan 15, 2003 3:30 pm
by kzinda
I am searching for concept variants of "sterilization" in my text. While it would appear to me from the morphene stripping routine described in the manaual that after stripping, the word form being matched in the text is "steriliz," Texis does not find the form "sterilized." Conversely, "sterilized" does not find "sterilization." Why is that? And yes, suffix processing is turned on.

Suffix Processing

Posted: Wed Jan 15, 2003 3:52 pm
by kzinda
minwordlen is set at default, which I beleive is 5.

Suffix Processing

Posted: Wed Jan 15, 2003 4:08 pm
by mark
With default suffixes sterilized strips to steril and sterilization strips to steriliz. You could add ization to the suffix list to correspond to to ized.

Suffix Processing

Posted: Thu Jan 16, 2003 11:45 am
by kzinda
I still don't understand why if "sterilzation" is stripped to "Steriliz", why it doesn't recognize "sterilized". Doesn't Texis look for the pattern match (steriliz) and then check for appropriate suffixes?

Suffix Processing

Posted: Thu Jan 16, 2003 11:56 am
by mark
When it finds sterilized it strips it down, then compares to what it's looking for. Basically, it compares roots after all suffix removal.

Suffix Processing

Posted: Thu Jan 16, 2003 12:05 pm
by kzinda
SO what your saying is that when it finds sterilzed in the text, it strips the the word (steril)and compares it against the stripped word in the query (steriliz)?

Suffix Processing

Posted: Thu Jan 16, 2003 12:13 pm
by mark
yes