Zeus
Posts: 31 Joined: Thu Jul 29, 2004 5:12 pm
Post
by Zeus » Mon Nov 22, 2004 8:25 pm
Hi,
I am searching for terms of the form
AN-
(AN followed by hypen literally).
If I search,
tsql "set alpostproc='on';set hyphenphrase='off';set allinear='on';set wordc='[\alnum\X27]';set langc='[\alnum\X27 \-\.]';set defsuffrm=1;set minwordlen=5;set suffixproc='On';select DOCID,DESCRIPTION from tblnew where DESCRIPTION like 'AN-'"
I get hits which has just AN.
I tried putting the double quotes around it.
Any help is appreciated.
Thanks!!
mark
Site Admin
Posts: 5519 Joined: Tue Apr 25, 2000 6:56 pm
Post
by mark » Tue Nov 23, 2004 11:47 am
This looks the same as your jerry-smith issue. I can't replicate this either. Could you create a complete sequence of statements that includes table and index creation and insertion and replicates the problem?
Zeus
Posts: 31 Joined: Thu Jul 29, 2004 5:12 pm
Post
by Zeus » Tue Nov 23, 2004 1:34 pm
Here are the steps, in our test. (Please look at step 10 and 11 the actual problem queries)
1. tsql "create table hyphentest(DOCID varchar(10),DOCTEXT blob)"
2. tsql "insert into hyphentest values('1','jerry smith')"
3. tsql "insert into hyphentest values('2','jerry-smith')"
4. tsql "insert into hyphentest values('3','AN-')"
5. tsql "insert into hyphentest values('4','AN-DALY')"
6. tsql "insert into hyphentest values('5','AN DALY')"
7. tsql "insert into hyphentest values('6','AN - DALY')"
8. tsql "drop index idxdoctext"
9. tsql -q "set keepnoise='on';set delexp=0;set addexp='\punct{1,5}';set addexp='\alnum{1,99}';set addexp='>>\alpha{1,50},=\alpha{1,50}';create metamorph inverted index idxdoctext on hyphentest(DOCTEXT);"
10. tsql "set alpostproc='on';set hyphenphrase='off';set allinear='on';set wordc='[\alnum\X27]';set langc='[\alnum\X27 \-\.]';set defsuffrm=1;set minwordlen=5;set suffixproc='On';select DOCID,DOCTEXT from hyphentest where DOCTEXT like 'jerry-smith'"
Texis Version 04.04.1067366033(20031028) Copyright (c) 1988-2003 Thunderstone EPI
DOCID DOCTEXT
------------+------------+
(No results)
11. tsql "set alpostproc='on';set hyphenphrase='off';set allinear='on';set wordc='[\alnum\X27]';set langc='[\alnum\X27 \-\.]';set defsuffrm=1;set minwordlen=5;set suffixproc='On';select DOCID,DOCTEXT from hyphentest where DOCTEXT like 'AN-'"
Texis Version 04.04.1067366033(20031028) Copyright (c) 1988-2003 Thunderstone EPI
DOCID DOCTEXT
------------+------------+
3 AN-
6 AN - DALY
(should return only AN- but not AN - DALY)