On Tue, 18 May 2021 19:17:44 -0400
TomasK <[email protected]> dijo:

>I have some PDFs (contracts) from docusign and/or similar cloud service
>- I can read and print them, but I cannot copy or search their content.
>
>The zealots have encoded every paragraph/page with some hash and
>included custom fonts to make the document look and print normal.
>
>Does anyone know of some linuxy way to get rid of this BS and convert
>the PDFs to normal unicode?

I don't know if this will work, but it's the first thing I would try:
Import the PDF to LibreOffice Writer, or Scribus, then re-export from
them. Both of those programs have PDF import that sometimes will retain
the original text, not separated into groups of a few words, as text is
usually handled in a PDF.

Another option would be to open in a PDF viewer, then export under
various options.

Also Ghostscript can view a PDF file, and can then export as pure
Postscript. Not sure what that might accomplish, but give it a try.
_______________________________________________
PLUG: https://pdxlinux.org
PLUG mailing list
[email protected]
http://lists.pdxlinux.org/mailman/listinfo/plug

Reply via email to