if I look at your unsuccessful document, it seems to be images of a scanned
document, because some pages seems to be a little crooked or missaligned.
That kind of documents needs an OCR (Optical Character Recognition) to be able
to retrieve the texts from it.
Dspace isn't an OCR tool, because
Thanks Stephanie,
Just to be clear, does the MediaFilter grab the text file which is
attached to the PDF, if the PDF has a text file?
So, if the PDF is simply an image, then MediaFilter will create a blank
page.
Is this correct?
Shawna
Shawna Sadler
Coordinator, Digital Initiatives
Libraries
2 matches
Mail list logo