Dave Nice one mate - let me know how you get on with it.
How do you deal with things like statements that roll over onto multiple pages? Or put another way, do you scan each individual page, save it, and move onto the next one without attempting to link the two images together either by filename or other method, or are you relying on the text document to provide the linkage? E -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Behalf Of Dave Walker Sent: 16 September 2007 15:56 To: British Ubuntu Talk Subject: Re: [ubuntu-uk] Document Storage On Sun, 2007-09-16 at 12:10 +0100, Ian Pascoe wrote: > Morning Folks > > In the list's opinion which is the best way to store documents? > > In particular, as my own filing system is, well non existant, I was thinking > about scanning all necessary documents and then storing them eithre to HD or > CD / DVD. > > I've been trying to work out in my own mind what would be the better way to > store these scanned documents that will maintain the clarity and be of > minimal size. <SNIP> Hi Ian, I have been looking into the same possibility over the last few months. Personally i would recommend PDF as it is easily accessible and a good file size with ~300dpi. One method of retrieval I did consider was http based searching with the scanned image OCR'd, this would allow $filename.pdf and $filename.txt. The later containing the 'best effort' of OCR. The web interface could search the txt's for keywords and link to the corresponding PDF's. Currently I have automated the scan process producing a PDF and an OCR'd txt file. If I find time this week, I will try and work on it further - and maybe create a project on launchpad hosting the source. Kind Regards Dave Walker -- ubuntu-uk@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-uk https://wiki.kubuntu.org/UKTeam/