Re: [tesseract-ocr] "Empty Page" and incomplete text recognition

2015-10-28 Thread Daniel Kraft
Hi! On 2015-10-28 18:15, Tom Morris wrote: > In addition to the skew, which I didn't notice until Alistair mentioned > it, closer examination also reveals that the images are warped, almost > as if the text was displayed on the face of a curved CRT from the olden > days. You might try de-warping

Re: [tesseract-ocr] "Empty Page" and incomplete text recognition

2015-10-27 Thread Daniel Kraft
Hi! On 2015-10-27 16:10, Allistair wrote: > Firstly I do not get Empty Page with Tesseract 3 on Mac. It reads a > couple of lines then gives up. Yes, that's true -- this particular example gives a few lines (actually the *later* ones, not the first and then giving up). But with a slightly differ

Re: [tesseract-ocr] "Empty Page" and incomplete text recognition

2015-10-27 Thread Daniel Kraft
Hi! On 2015-10-27 08:57, Allistair C wrote: > I think your whole document needs enough surrounding margin - I found > the empty page issue when my text was too close to the page edges. In > your first image you have this but not your second. Yes, that's also what I read. Note, however, that the