On Tue, May 18, 2021 at 6:17 PM TomasK <[email protected]> wrote: > > I have some PDFs (contracts) from docusign and/or similar cloud service > - I can read and print them, but I cannot copy or search their content. > > The zealots have encoded every paragraph/page with some hash and > included custom fonts to make the document look and print normal. > > Does anyone know of some linuxy way to get rid of this BS and convert > the PDFs to normal unicode? > > Of course, I can print + scan + OCR the darn thing, but that feels like > heavy handed solution to this idiotic problem. Obviously I am looking > for faster solution, so that does not include me finding the hash and > coding the hell out of it by Christmas 202x. >
I don't know the answer to your question, but you might be able to skip the print part by uploading it to google drive and using the OCR there. Not a great solution, but a possible solution. Bill _______________________________________________ PLUG: https://pdxlinux.org PLUG mailing list [email protected] http://lists.pdxlinux.org/mailman/listinfo/plug
