Hi,
anytotx extracts the text for most of the PDF docs, but for some it doesn't. When I did a,
anytotx -fpdf <pdfdoc.pdf >pdfdoc.txt
it gave me an error, 000 Can't get text from PDF document.
what could be wrong? does it have something to do with the way pdf docs are created? anytotx --identify gives,
release: 20010418
thunderstone: 1
formats: pdf html msw swf auto
acrobat: 30
metaok: 1
features: meta links images
thanx,
anytotx extracts the text for most of the PDF docs, but for some it doesn't. When I did a,
anytotx -fpdf <pdfdoc.pdf >pdfdoc.txt
it gave me an error, 000 Can't get text from PDF document.
what could be wrong? does it have something to do with the way pdf docs are created? anytotx --identify gives,
release: 20010418
thunderstone: 1
formats: pdf html msw swf auto
acrobat: 30
metaok: 1
features: meta links images
thanx,