Page 1 of 1
powerpoint crawling
Posted: Fri Feb 13, 2004 3:02 pm
by surfer71
Hi,
Webinator version: Webinator 4.4.9-Unix-w/plugin
I am able to successfully crawl ppt files but the results show lot of garbage characters. Is there any way to fix that?
Thanks in advance.
powerpoint crawling
Posted: Fri Feb 13, 2004 3:57 pm
by mark
The plugin is geared more towards making the text searchable than looking pretty. It's normal to get some extra junk with some formats sometimes. Is the expected text there?
powerpoint crawling
Posted: Fri Feb 13, 2004 4:07 pm
by surfer71
Thanks for the quick reply.
The expected text is there inside the presentation but the results shows like this:
No Slide Title
ñõ7ýü4^ý¡sÜyüÍWç= ] =ì æ¤Äy) õ ý ÇN)M'J`oêVÚ Øç> *¾¾$Õø L Ô^ióEºE'ðr åKç¹뵾ª ¾D hÎ vµ-Äz`Ϟð\gþ{ÛÇ'êW4 ¯¨¿ô1Tr ¬?î·ò5 ] Q÷ ¡ ¯ø?ôbÒ{ ZJvHQ@ ØÖü^ èO¨¡4 ...
URL:
http://espnfield/...sales/pages/asm/ESP ... r02_03.ppt
powerpoint crawling
Posted: Fri Feb 13, 2004 4:51 pm
by mark
I meant, is the expected text there in the database? Click "Match info" to see all the text that was extracted.
Can you give the actual url for an example ppt?
powerpoint crawling
Posted: Tue Feb 17, 2004 8:26 am
by surfer71
Yes, I can see the text underlined when I click on "preview document matches".
I can't give you the url as its on our intranet site but I can send it to you via email.
powerpoint crawling
Posted: Tue Feb 17, 2004 10:06 am
by mark
You can open a tech support ticket using the "Tech Support" link on the left.