Re: Scanning docs for bitsavers

Grant Taylor via cctalk Mon, 02 Dec 2019 18:09:05 -0800

On 12/2/19 5:34 PM, Guy Dunphy via cctalk wrote:

Interesting comments Guy.

I'm completely naive when it comes to scanning things for preservation.Your comments do pass my naive understanding.

But PDF literally cannot be used as a wrapper for the results,since it doesn't incorporate the required image compression formats.This is why I use things like html structuring, wrapped as either a zipfile or RARbook format. Because there is no other option at present.There will be eventually. Just not yet. PDF has to be either greatlyextended, or replaced.

I *HATE* doing anything with PDFs other than reading them. My opinionis that PDF is where information goes to die. Creating the PDF was thelast time that anything other than a human could use the information asa unit. Now, in the future, it's all chopped up lines of text that maybe in a nonsensical order. I believe it will take humans (or somethingyet to be created with human like ability) to make sense of the contentand recreate it in a new form for further consumption.

Have you done any looking at ePub? My understanding is that they are azip of a directory structure of HTML and associated files. That soundsquite similar to what you're describing.

And that's why I get upset when people physically destroy rare olddocuments during or after scanning them currently. It happens sofrequently, that by the time we have a technically adequate documentcoding scheme, a lot of old documents won't have any survivingpaper copies. They'll be gone forever, with only really crap qualityscans surviving.


Fair enough.



--
Grant. . . .
unix || die

Re: Scanning docs for bitsavers

Reply via email to