Please unsubscribe [email protected]

I briefly had the ambition to teach myself how to maintain bookmarks and XML metadata for sheet music libraries but gave up that idea because of the complexity of PDF files.

-----Original Message----- From: Tilman Hausherr
Sent: Saturday, January 10, 2015 11:24 PM
To: [email protected]
Subject: Re: Content of pdf moved around

Hi,

The PDF didn't go through (never does), but you can try to use
PDFTextStripper.setSortByPosition().

Tilman|*
*|
Am 10.01.2015 um 14:04 schrieb Renaud Billen:
Hello,

I have a little issue with the extraction of the text of some pdfs, where some words are switching order with others..

With the pdf attached to this mail, if I use "save as text » from adobe reader, I get :

Référence: LIX-673LIX-6737


Nom: The test company


Type:
Ouverture: 24/04/2007

Titulaire: BD
Resp.: LIX
Co-Resp.: BB
Client




But with pdfbox I get :

Référence: LIX-6737
Nom: The test company
Titulaire: BD
Resp.: LIX
Co-Resp.: BB
Type:
Ouverture: 24/04/2007
Client


Could you tell me if something can be done to solve this problem?

Thanks,
Renaud



Reply via email to