Right. I think ocr from acrobat can be run from command line.
Cheers Al On 14/06/2016 6:53 am, "Tilman Hausherr" <[email protected]> wrote: > Am 13.06.2016 um 20:49 schrieb Al Grant: > >> Morning >> >> Is it possible to extract text from a scanned pdf using pdfbox? >> > > If the PDF has invisible OCRed text, yes. If not, then you'd need to OCR > it. TIKA is working on something. > > Tilman > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >

