Suffix Processing

Post Reply
kzinda
Posts: 62
Joined: Fri Nov 30, 2001 6:18 am

Suffix Processing

Post by kzinda »

When searching our patent database for
"inhaler dispense w/2",

the engine doesn't find

"dispensed for oral inhalation".

The engine requires an asterisk in the search string:
"inhaler dispens* w/2"

Why doesn't the suffix processing routine identify "dispensed" from "dispense"?
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Suffix Processing

Post by mark »

What's your minwordlen setting? By default it's 255 effectively disabling suffix processing.

And unless you've "set withinmode='word'" w/2 means within 2 characters.
kzinda
Posts: 62
Joined: Fri Nov 30, 2001 6:18 am

Suffix Processing

Post by kzinda »

minwordlen = 5, withinword=word

Also note that the engine finds "inhalation" from "inhaler" indicating suffix processing is on. The fact that it finds the phrase with the asterisk indicates word proximity.
User avatar
mark
Site Admin
Posts: 5519
Joined: Tue Apr 25, 2000 6:56 pm

Suffix Processing

Post by mark »

dispense is one of those odd words all forms of which don't strip to the same root using the default suffixes.

dispense doesn't strip down, but dispen[s][ed] does.
Post Reply