[Simh] scanning while hanging below [was:best way to scan 172 column fanfold 80s printout?]

Carey Tyler Schug Mon, 12 Feb 2018 05:09:06 -0800

Hanging below may be best idea on this sub-thread. Hanging below onecould BACK LIGHT the paper, which would make even lighting much moreeasy. A milky white piece of glass with several lights behind it, whereif they were in front they would block the camera. It would requirecompensation for the green bar fanfold, but maybe greenish lights wouldhelp that?


On 02/11/2018 02:10 PM, Timothe Litt wrote:

On 11-Feb-18 14:29, Davis Johnson wrote:
I think what you need is a wide carriage printer with the typicalfeed up through a slot in the bottom, and a camera.
The only working function needed from the printer is form feed.Photograph the page that is hanging below the printer, form feed andrepeat.
Anybody here ought to be able to handle the programming to automatethis process.
You would need to manually photograph the first page.

The camera would need good depth of field.
It's not that simple. You need to deal with at least 2 commonvertical pitches (6 & 8 LPI), and a number of page lengths (andwidths). These need to be setup per job; not all printers support allthese. Plus, misalignment (as Al noted, crossing the perforations atthe bottom of a page is quite common). The OP mentioned that hislistings have a hard crease; this will cause (at least) feed andstacking problems. Form feed causes a high-speed slew; this becomesless reliable as the distance moved increases. You're proposing anentire page at a time - which means that the paper will jump off thetractors frequently.[1] Old paper is fragile. Over hundreds of pages,dimensions may not be stable; it was not uncommon to have to re-adjustTOF after a while. There's a fair bit of error detection and recoveryto work out.
Lighting is an issue, as is compensating for keystoning and othermisalignments. Most cameras don't have a standard remote triggerinterface - one of the pointers I provided loads modified firmwareinto cameras from one manufacturer to make this work. If you look atdigital camera reviews, you'll see that the lenses have varyingdegrees of artifacts, especially at the edges. So you need to findand zoom to an area that's relatively "flat" & doesn't need a lot ofcorrection. While depth of field will help, it also will result inapparent font size changes as paper sways forward and back. If youstop that, you simplify the OCR - and don't need as much depth of field.
There are many backgrounds that need to be subtracted for OCR towork. (Printer paper was notorious for institutional logos, as wellas bars and other aids to human readers.) Then there are the otherissues mentioned in my earlier note.
It seems simple, but it is a P.roject. That's a capital P. With a lotof roject to work out.
It's worthwhile, but it's not simple. It's a pretty interestinghardware (and software) project. I don't mean to discourage anyonewho wants to work on it - but you need to go in with eyes open, oryou'll end up very, very frustrated.
Thunderscan tried to scan line by line & retrieve grayscale; thechallenges were piecing together the adjacent lines with pixelresolution. The focal distance was constant because the camera wason a carriage. The idea here is to capture a page per frame. So theregistration problems are quite different. One could try thethunderscan approach; it would trade one set of problems xxx"challenges and opportunities" for another.
[1] In my experience, with many brands and models of tractor feedprinters over many years. Paper handling is really difficult to getright.
http://mailman.trailing-edge.com/mailman/listinfo/simh


_______________________________________________
Simh mailing list
[email protected]
http://mailman.trailing-edge.com/mailman/listinfo/simh

[Simh] scanning while hanging below [was:best way to scan 172 column fanfold 80s printout?]

Reply via email to