We're using Webinator Commercial, version 2.56, release 20000620 on a web server that runs Irix 6.5.
It used to index pdf documents, but for months hasn't been including them. It gives no errors, it
just ignores them. No pdfs appear in the list of files it displays as it indexes. Here's the command
I use to create the index:
gw -ddb2 -z300000 -Iindex.htm -fshtml -meta=keywords,description
-n"application/pdf,pdf,/www/pub/webinator/bin/anytotx" -fpdf http://www.dot.ca.gov
When I run anytotx on the command line it does just what it should - it displays the
text of the pdf document in stdout.
I set up a test index to help figure out what's wrong. You can see it at
http://www.dot.ca.gov/cgi-bin/texis/web ... h/?db=test
The words fastrak or collection or toll should produce results, but every search produces in a page
with "no documents matched the query" and these error messages in the code:
<!-- 002 /webinator/search:473: Can't open /www/pub/webinator/test/SYSOBJECTS.tbl: No such file or directory in the function open equivs -->
<!-- 100 /webinator/search:473: Openmmapi Failed -->
<!-- 100 /webinator/search:473: Metamorph open failed. -->
The SYSOBJECTS.tbl file is there with 444 permissions, owned by webinator. I suppose this is a separate problem. I don't see this with our main index.
Here's the command I used to create the test index:
gw -dtest -n"application/pdf,pdf,/www/pub/webinator/bin/anytotx" -fpdf
-z500000 http://www.dot.ca.gov/fastrak/pdftest.htm
There are only 3 pages in this index - 1 pdf file and 2 htm files. It appears to index all 3 without complaint.
If I try to force it to add a pdf this happens:
gw -dtest -g http://www.dot.ca.gov/fastrak/FasTrakApp.pdf
http://www.dot.ca.gov/fastrak/FasTrakApp.pdf: Disallowed extension
Visited 0 pages total
I see this same message if I try to add a pdf to our main index.
What can I do to get webinator to cheerfully include pdfs? Is there something special about the default index?
I believe our troubles started when I began creating a second index, db2, renaming it "db" and replacing the
db directory with it.
It used to index pdf documents, but for months hasn't been including them. It gives no errors, it
just ignores them. No pdfs appear in the list of files it displays as it indexes. Here's the command
I use to create the index:
gw -ddb2 -z300000 -Iindex.htm -fshtml -meta=keywords,description
-n"application/pdf,pdf,/www/pub/webinator/bin/anytotx" -fpdf http://www.dot.ca.gov
When I run anytotx on the command line it does just what it should - it displays the
text of the pdf document in stdout.
I set up a test index to help figure out what's wrong. You can see it at
http://www.dot.ca.gov/cgi-bin/texis/web ... h/?db=test
The words fastrak or collection or toll should produce results, but every search produces in a page
with "no documents matched the query" and these error messages in the code:
<!-- 002 /webinator/search:473: Can't open /www/pub/webinator/test/SYSOBJECTS.tbl: No such file or directory in the function open equivs -->
<!-- 100 /webinator/search:473: Openmmapi Failed -->
<!-- 100 /webinator/search:473: Metamorph open failed. -->
The SYSOBJECTS.tbl file is there with 444 permissions, owned by webinator. I suppose this is a separate problem. I don't see this with our main index.
Here's the command I used to create the test index:
gw -dtest -n"application/pdf,pdf,/www/pub/webinator/bin/anytotx" -fpdf
-z500000 http://www.dot.ca.gov/fastrak/pdftest.htm
There are only 3 pages in this index - 1 pdf file and 2 htm files. It appears to index all 3 without complaint.
If I try to force it to add a pdf this happens:
gw -dtest -g http://www.dot.ca.gov/fastrak/FasTrakApp.pdf
http://www.dot.ca.gov/fastrak/FasTrakApp.pdf: Disallowed extension
Visited 0 pages total
I see this same message if I try to add a pdf to our main index.
What can I do to get webinator to cheerfully include pdfs? Is there something special about the default index?
I believe our troubles started when I began creating a second index, db2, renaming it "db" and replacing the
db directory with it.