Hi there
Those pdf's are sent to us by a customer. They are hotel bookings by a Tour
Operator. We'll contact them s ask their cooperation. Is the only think we
can do.
Thank you Leonard
_____
De: Leonard Rosenthol-3 [via iText - General]
[mailto:[email protected]]
Enviado el: viernes, 11 de marzo de 2011 14:43
Para: Pakhu
Asunto: Re: Unreadable Pdf with PdfTextExtractor
<style><!-- /* Font Definitions */ @font-face {font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;} @font-face {font-family:Calibri; panose-1:2
15 5 2 2 2 4 3 2 4;} @font-face {font-family:Tahoma; panose-1:2 11 6 4 3 5 4
4 2 4;} /* Style Definitions */ p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in; margin-bottom:.0001pt; font-size:12.0pt; font-family:"Times New
Roman","serif";} a:link, span.MsoHyperlink {mso-style-priority:99;
color:blue; text-decoration:underline;} a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99; color:purple; text-decoration:underline;}
span.EmailStyle17 {mso-style-type:personal-reply;
font-family:"Calibri","sans-serif"; color:#1F497D;} .MsoChpDefault
{mso-style-type:export-only;} @page WordSection1 {size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;} div.WordSection1 {page:WordSection1;}
--></style>
Printing to a slightly older version of the Postscript driver on Windows
from some application - which means that since Postscript is for printing,
there is no requirement that the font encoding information be maintained.
And then an old version of Ghostscript used to convert that PS into PDF.
Definitely NOT a good combination.
Is there any way that you can control this production process?!?!
Leonard
From: Pakhu [mailto:[hidden email]]
Sent: Friday, March 11, 2011 8:29 AM
To: [hidden email]
Subject: Re: [iText-questions] Unreadable Pdf with PdfTextExtractor
<style> <!-- /* Font Definitions */ @font-face {font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;} /* Style Definitions */ p.MsoNormal,
li.MsoNormal, div.MsoNormal {margin:0cm; margin-bottom:.0001pt;
font-size:12.0pt; font-family:"Times New Roman";} a:link, span.MsoHyperlink
{color:blue; text-decoration:underline;} a:visited,
span.MsoHyperlinkFollowed {color:blue; text-decoration:underline;}
span.EstiloCorreo17 {mso-style-type:personal-reply; font-family:Arial;
color:navy;} @page Section1 {size:595.3pt 841.9pt; margin:70.85pt 3.0cm
70.85pt 3.0cm;} div.Section1 {page:Section1;} --> </style>
Yes sure:
<</Producer(GNU Ghostscript 7.05)
/Title(Provider Communication)
/Creator(PScript5.dll Version 5.2.2)
_____
De: Leonard Rosenthol-3 [via iText - General] [mailto:[hidden email]]
Enviado el: viernes, 11 de marzo de 2011 14:23
Para: Pakhu
Asunto: Re: Unreadable Pdf with PdfTextExtractor
Can you tell us what software was used to produce these PDFs??
-----Original Message-----
From: Pakhu [mailto:[hidden email]]
Sent: Friday, March 11, 2011 6:43 AM
To: [hidden email]
Subject: Re: [iText-questions] Unreadable Pdf with PdfTextExtractor
Thank to both of you.
You are right: when coping to text there is nothing but random characters
because the font (namely the differences array) is wrong.
But I have discovered why is wrong: the character g3 in the vector , for
instance, means the Ascii code 29+3=32 which is an space. All characters
follow the same patern gnn (the letter g followed by an integer). the Ascii
code is always 29+nn
Therefore I made a little program that edits the pdf, gets the differences
array, compute the right caracter and then rebuilds the array back. Now I
can read the pdf, once is beeing rebuilt in this fashion.
I know I should not spend so much time correcting somebody else's mistakes,
but I receive plenty of pdf like this...
--
View this message in context:
http://itext-general.2136553.n4.nabble.com/Unreadable-Pdf-with-PdfTextExtrac
tor-tp3345219p3347943.html
<http://itext-general.2136553.n4.nabble.com/Unreadable-Pdf-with-PdfTextExtra
ctor-tp3345219p3347943.html?by-user=t&by-user=t&by-user=t>
Sent from the iText - General mailing list archive at Nabble.com.
----------------------------------------------------------------------------
--
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
iText-questions mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a
reference to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php
----------------------------------------------------------------------------
--
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
iText-questions mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a
reference to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php
_____
If you reply to this email, your message will be added to the discussion
below:
http://itext-general.2136553.n4.nabble.com/Unreadable-Pdf-with-PdfTextExtrac
tor-tp3345219p3348123.html
<http://itext-general.2136553.n4.nabble.com/Unreadable-Pdf-with-PdfTextExtra
ctor-tp3345219p3348123.html?by-user=t&by-user=t>
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
iText-questions mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a
reference to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php
_____
If you reply to this email, your message will be added to the discussion
below:
http://itext-general.2136553.n4.nabble.com/Unreadable-Pdf-with-PdfTextExtrac
tor-tp3345219p3348155.html
To unsubscribe from Unreadable Pdf with PdfTextExtractor, click
<http://itext-general.2136553.n4.nabble.com/template/NamlServlet.jtp?macro=u
nsubscribe_by_code&node=3345219&code=ZnBvbnNAaG90ZWxtYW5hZ2VtZW50LmVzfDMzNDU
yMTl8LTUwMTU2NjU5MQ==> here.
--
View this message in context:
http://itext-general.2136553.n4.nabble.com/Unreadable-Pdf-with-PdfTextExtractor-tp3345219p3348386.html
Sent from the iText - General mailing list archive at Nabble.com.------------------------------------------------------------------------------
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php