[jira] [Resolved] (PDFBOX-2791) Provide access to Type 1 font data

2015-05-08 Thread John Hewson (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Hewson resolved PDFBOX-2791.
-
   Resolution: Fixed
Fix Version/s: 2.0.0

> Provide access to Type 1 font data
> --
>
> Key: PDFBOX-2791
> URL: https://issues.apache.org/jira/browse/PDFBOX-2791
> Project: PDFBox
>  Issue Type: Improvement
>  Components: FontBox
>Affects Versions: 2.0.0
>Reporter: John Hewson
>Assignee: John Hewson
>Priority: Minor
> Fix For: 2.0.0
>
>
> I was analysing some PDF files recently and wanted to dump the repaired Type 
> 1 font stream, however there's no API which provides access to this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2791) Provide access to Type 1 font data

2015-05-08 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535876#comment-14535876
 ] 

ASF subversion and git services commented on PDFBOX-2791:
-

Commit 1678458 from [~jahewson] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1678458 ]

PDFBOX-2791: Provide access to Type 1 font segments

> Provide access to Type 1 font data
> --
>
> Key: PDFBOX-2791
> URL: https://issues.apache.org/jira/browse/PDFBOX-2791
> Project: PDFBox
>  Issue Type: Improvement
>  Components: FontBox
>Affects Versions: 2.0.0
>Reporter: John Hewson
>Assignee: John Hewson
>Priority: Minor
> Fix For: 2.0.0
>
>
> I was analysing some PDF files recently and wanted to dump the repaired Type 
> 1 font stream, however there's no API which provides access to this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-2791) Provide access to Type 1 font data

2015-05-08 Thread John Hewson (JIRA)
John Hewson created PDFBOX-2791:
---

 Summary: Provide access to Type 1 font data
 Key: PDFBOX-2791
 URL: https://issues.apache.org/jira/browse/PDFBOX-2791
 Project: PDFBox
  Issue Type: Improvement
  Components: FontBox
Affects Versions: 2.0.0
Reporter: John Hewson
Assignee: John Hewson
Priority: Minor


I was analysing some PDF files recently and wanted to dump the repaired Type 1 
font stream, however there's no API which provides access to this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2272) Can't extract vertical text correctly

2015-05-08 Thread John Hewson (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535866#comment-14535866
 ] 

John Hewson commented on PDFBOX-2272:
-

The file from PDFBOX-2711 has the same problem, the vertical text is extracted 
in the wrong order.

> Can't extract vertical text correctly
> -
>
> Key: PDFBOX-2272
> URL: https://issues.apache.org/jira/browse/PDFBOX-2272
> Project: PDFBox
>  Issue Type: Bug
>  Components: Text extraction
>Affects Versions: 1.8.6, 2.0.0
>Reporter: Biligsaikhan Batjargal
> Attachments: test.pdf, test.txt
>
>
> - 1.8.6 can't extract the Unicode due to failing to map the UCS2 CMap for 
> 90ms-RKSJ-V.
> - 2.0 extracts the text but can't handle the vertical layout
> Also see the file from PDFBOX-2294 which contains both horizontal and 
> vertical text.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2711) Japanese text not extracted

2015-05-08 Thread John Hewson (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535861#comment-14535861
 ] 

John Hewson commented on PDFBOX-2711:
-

The Unicode characters can now be extracted, however they appear in the wrong 
order because ExtractText doesn't support vertical text, see PDFBOX-2272.

> Japanese text not extracted
> ---
>
> Key: PDFBOX-2711
> URL: https://issues.apache.org/jira/browse/PDFBOX-2711
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Daniel Bonniot de Ruisselet
>Assignee: John Hewson
> Fix For: 2.0.0
>
> Attachments: 150218-pdfbox-1.8.8.txt, 150218-pdfbox-2.0.0.txt, 
> 150218-pdftotext.txt, 150218.pdf
>
>
> ExtractText does not return the text content of this PDF. There are just a 
> few real characters when running 1.8.8, and none with today's 2.0.0 snapshot.
> I also attach the output from pdftotext 0.26.5 (from poppler-utils), which 
> seems to get it mostly right.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-2711) Japanese text not extracted

2015-05-08 Thread John Hewson (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Hewson resolved PDFBOX-2711.
-
   Resolution: Fixed
Fix Version/s: 2.0.0

> Japanese text not extracted
> ---
>
> Key: PDFBOX-2711
> URL: https://issues.apache.org/jira/browse/PDFBOX-2711
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Daniel Bonniot de Ruisselet
>Assignee: John Hewson
> Fix For: 2.0.0
>
> Attachments: 150218-pdfbox-1.8.8.txt, 150218-pdfbox-2.0.0.txt, 
> 150218-pdftotext.txt, 150218.pdf
>
>
> ExtractText does not return the text content of this PDF. There are just a 
> few real characters when running 1.8.8, and none with today's 2.0.0 snapshot.
> I also attach the output from pdftotext 0.26.5 (from poppler-utils), which 
> seems to get it mostly right.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2711) Japanese text not extracted

2015-05-08 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535857#comment-14535857
 ] 

ASF subversion and git services commented on PDFBOX-2711:
-

Commit 1678457 from [~jahewson] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1678457 ]

PDFBOX-2711: Unicode extraction from Type 0 Identity-H/V fonts with CJK 
descendants

> Japanese text not extracted
> ---
>
> Key: PDFBOX-2711
> URL: https://issues.apache.org/jira/browse/PDFBOX-2711
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Daniel Bonniot de Ruisselet
>Assignee: John Hewson
> Attachments: 150218-pdfbox-1.8.8.txt, 150218-pdfbox-2.0.0.txt, 
> 150218-pdftotext.txt, 150218.pdf
>
>
> ExtractText does not return the text content of this PDF. There are just a 
> few real characters when running 1.8.8, and none with today's 2.0.0 snapshot.
> I also attach the output from pdftotext 0.26.5 (from poppler-utils), which 
> seems to get it mostly right.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2711) Japanese text not extracted

2015-05-08 Thread John Hewson (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535850#comment-14535850
 ] 

John Hewson commented on PDFBOX-2711:
-

This is related to the problem from PDFBOX-2509, where the PDF spec defines 
some extra rules for Type 0 fonts which use Identity-H/V encodings:

{quote}
if the font is composite and uses a predefined cmap (excluding Identity-H/V) 
or if its descendant font uses Adobe-GB1/CNS1/Japan1/Korea1 then ...
{quote}

So there's special handling where we need to check if the dependent font 
identifies itself as CJK in its CIDSystemInfo. If so, then we know which UCS-2 
CMap to use. I was actually been working on a patch for this already.

> Japanese text not extracted
> ---
>
> Key: PDFBOX-2711
> URL: https://issues.apache.org/jira/browse/PDFBOX-2711
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Daniel Bonniot de Ruisselet
>Assignee: John Hewson
> Attachments: 150218-pdfbox-1.8.8.txt, 150218-pdfbox-2.0.0.txt, 
> 150218-pdftotext.txt, 150218.pdf
>
>
> ExtractText does not return the text content of this PDF. There are just a 
> few real characters when running 1.8.8, and none with today's 2.0.0 snapshot.
> I also attach the output from pdftotext 0.26.5 (from poppler-utils), which 
> seems to get it mostly right.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2509) Korean Text font substitution issues

2015-05-08 Thread John Hewson (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535841#comment-14535841
 ] 

John Hewson commented on PDFBOX-2509:
-

I've implemented the missing functionality in PDFBOX-2711.

> Korean Text font substitution issues
> 
>
> Key: PDFBOX-2509
> URL: https://issues.apache.org/jira/browse/PDFBOX-2509
> Project: PDFBox
>  Issue Type: Bug
>  Components: Rendering
>Affects Versions: 2.0.0
>Reporter: simon steiner
>Assignee: John Hewson
> Fix For: 2.1.0
>
> Attachments: japan.patch, pdfbox147.png, pdfbox238.png, 
> pdfbox238_2.png, pdfbox328.png
>
>
> http://acroeng.adobe.com/Test_Files/fonts/asian%20font%20files/Korean/nonembedded/K4SystemFontsNotEmbeded218.PDF
> and
> http://acroeng.adobe.com/Test_Files/fonts/asian%20font%20files/Korean/nonembedded/KGulimcheNotembeded218.PDF
> and
> http://acroeng.adobe.com/Test_Files/fonts/asian%20font%20files/Korean/nonembedded/VariousKFontsNotembeded218.PDF
> and
> http://acroeng.adobe.com/Test_Files/fonts//EmbeddedCmap.pdf
> and
> http://acroeng.adobe.com/Test_Files/fonts/asian%20font%20files/Japanese/nonembedded/Jun101.pdf
> and
> http://acroeng.adobe.com/Test_Files/fonts/asian%20font%20files/Japanese/nonembedded/ACPTJ_WIN_MSGothic.DOC.pdf
> java -jar ~/pdf-box-svn/app/target/pdfbox-app-2.0.0-SNAPSHOT.jar PDFToImage 
> K4SystemFontsNotEmbeded218.PDF



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2711) Japanese text not extracted

2015-05-08 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535810#comment-14535810
 ] 

ASF subversion and git services commented on PDFBOX-2711:
-

Commit 1678454 from [~jahewson] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1678454 ]

PDFBOX-2711: Log warning when Type 0 font text cannot be extracted

> Japanese text not extracted
> ---
>
> Key: PDFBOX-2711
> URL: https://issues.apache.org/jira/browse/PDFBOX-2711
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Daniel Bonniot de Ruisselet
>Assignee: John Hewson
> Attachments: 150218-pdfbox-1.8.8.txt, 150218-pdfbox-2.0.0.txt, 
> 150218-pdftotext.txt, 150218.pdf
>
>
> ExtractText does not return the text content of this PDF. There are just a 
> few real characters when running 1.8.8, and none with today's 2.0.0 snapshot.
> I also attach the output from pdftotext 0.26.5 (from poppler-utils), which 
> seems to get it mostly right.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Assigned] (PDFBOX-2711) Japanese text not extracted

2015-05-08 Thread John Hewson (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Hewson reassigned PDFBOX-2711:
---

Assignee: John Hewson

> Japanese text not extracted
> ---
>
> Key: PDFBOX-2711
> URL: https://issues.apache.org/jira/browse/PDFBOX-2711
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 1.8.8, 2.0.0
>Reporter: Daniel Bonniot de Ruisselet
>Assignee: John Hewson
> Attachments: 150218-pdfbox-1.8.8.txt, 150218-pdfbox-2.0.0.txt, 
> 150218-pdftotext.txt, 150218.pdf
>
>
> ExtractText does not return the text content of this PDF. There are just a 
> few real characters when running 1.8.8, and none with today's 2.0.0 snapshot.
> I also attach the output from pdftotext 0.26.5 (from poppler-utils), which 
> seems to get it mostly right.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2119) Possible printing bug for V2.00

2015-05-08 Thread John Hewson (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535715#comment-14535715
 ] 

John Hewson commented on PDFBOX-2119:
-

This particular issue never existed in the first place. Please open a new JIRA 
issue and attach the PDF file in question. I suspect the problem is that you're 
not taking into account that your default printer has margins - if you want to 
print at 100% scale, you'll need to set these to zero.

> Possible printing bug for V2.00
> ---
>
> Key: PDFBOX-2119
> URL: https://issues.apache.org/jira/browse/PDFBOX-2119
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 2.0.0
> Environment: Window 7 Professional SP1, JRE 8.
>Reporter: You Liang
>  Labels: pdfbox, printer, printing
>
> Printing seems to be using the window default printer paper size instead of 
> selected printService papersize.
> Etc my default Printer is an A4 Printer, and the printer that i had choosen 
> to print is a receipt printer.
> When i print to the receipt printer, it will print out a blown up version of 
> the original pdf, and when i change my default printer to the receipt 
> printer.. everything work fine.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Website Updates

2015-05-08 Thread John Hewson
Hi All,

I made a few updates to the website, cleaning up some of our docs and adding a 
few links. The coding conventions have finally been updated, along with the 
build instructions and dependencies page for 2.0.

I’ve added boilerplate “Getting Started” and “Examples” pages for 2.0 which 
just link to the relevant information. This will be expanded in time, but we 
kept getting questions on the mailing list about these, so I wanted to add at 
least some pointers.

Probably the main thing that we’re in need of is a guide explaining how to 
migrate from 1.8 to 2.0.

— John



Re: extracting text from an "encrypted" pdf

2015-05-08 Thread John Hewson
Great!

— John

> On 8 May 2015, at 14:49, Tilman Hausherr  wrote:
> 
> Am 08.05.2015 um 23:47 schrieb John Hewson:
>> Can’t we make PDFBox open the document with an empty password? What’s the 
>> story for 2.0?
> 
> In 2.0 it opens immediately. Same in 1.8 when using the loadNonSeq().
> 
> Tilman
> 
>> 
>> — John
>> 
>>> On 8 May 2015, at 08:52, Tilman Hausherr  wrote:
>>> 
>>> Am 08.05.2015 um 17:51 schrieb Clemens Wyss DEV:
 Thx for the very fast answer.
> new StandardDecryptionMaterial( password );
 I have no password. The pdf is a public user manual.
>>> Use an empty password :-)
>>> 
>>> Tilman
>>> 
> That is TIKA, isn't it?
 True
 
 
 -Ursprüngliche Nachricht-
 Von: Tilman Hausherr [mailto:thaush...@t-online.de]
 Gesendet: Freitag, 8. Mai 2015 17:44
 An: us...@pdfbox.apache.org
 Betreff: Re: extracting text from an "encrypted" pdf
 
 Am 08.05.2015 um 17:36 schrieb Clemens Wyss DEV:
> When I try to extract an "encrypted" (which can be read in AcrobatReader) 
> document with:
> 
> pdfDocument = PDDocument.load( is );
 add
 if( document.isEncrypted() )
 {
   StandardDecryptionMaterial sdm = new StandardDecryptionMaterial( 
 password ); document.openProtection( sdm ); }
 
 or use loadNonSeq()
 
> PDFTextStripper pdfStripper = new PDFTextStripper(); parsedText =
> pdfStripper.getText( pdfDocument );
> 
> I get an empty string, and " o.apache.pdfbox.pdfparser.PDFParser - 
> Document is encrypted" is logged.
> 
> When, on the other hand, I do:
> 
> ContentHandler handler = new BodyContentHandler( -1 ); ParseContext
> context = new ParseContext(); parser = new AutoDetectParser();
> context.set( Parser.class, parser );
>   parser.parse( is, handler, metadata, context ); parsedText =
> handler.toString();
> 
> I get to see the text/content of the very pdf.
> 
> 1) What ist he preferred way to extract text from a 
> pdf("-that-can-be-read-in-AcrobatReader")?
 https://svn.apache.org/viewvc/pdfbox/branches/1.8/pdfbox/src/main/java/org/apache/pdfbox/ExtractText.java?view=markup&sortby=date
 
>   2) Does the second approach possibly return "more than text"? Blobs? 
> Binary data?
 That is TIKA, isn't it?
 
 Tilman
 
> -
> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
> For additional commands, e-mail: users-h...@pdfbox.apache.org
> 
 -
 To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
 For additional commands, e-mail: users-h...@pdfbox.apache.org
 
 
 -
 To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
 For additional commands, e-mail: users-h...@pdfbox.apache.org
 
>>> 
>>> -
>>> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org 
>>>  
>>> >> >
>>> For additional commands, e-mail: users-h...@pdfbox.apache.org 
>>>  >> >
> 
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org 
> 
> For additional commands, e-mail: dev-h...@pdfbox.apache.org 
> 


[jira] [Commented] (PDFBOX-2786) PDPageDestination page index off by one

2015-05-08 Thread John Hewson (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535660#comment-14535660
 ] 

John Hewson commented on PDFBOX-2786:
-

Why not just fix getPageNumber()? That method isn't any use otherwise.

> PDPageDestination page index off by one
> ---
>
> Key: PDFBOX-2786
> URL: https://issues.apache.org/jira/browse/PDFBOX-2786
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 1.8.9, 2.0.0
>Reporter: Johanneke Lamberink
>Assignee: Tilman Hausherr
> Fix For: 1.8.10, 2.0.0
>
> Attachments: Archive.zip
>
>
> When creating a new bookmark with the same page number as an existing 
> bookmark, the resulting destination is offset by 1 compared to the old 
> destination.
> This results in the bookmark being set for the next page, which could be a 
> non-existing page.
> I've added a class with an example pdf and my own output pdf. Run with 
> argument of a path to where you have the pdf, including a trailing slash.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: extracting text from an "encrypted" pdf

2015-05-08 Thread Tilman Hausherr

Am 08.05.2015 um 23:47 schrieb John Hewson:

Can’t we make PDFBox open the document with an empty password? What’s the story 
for 2.0?


In 2.0 it opens immediately. Same in 1.8 when using the loadNonSeq().

Tilman



— John


On 8 May 2015, at 08:52, Tilman Hausherr  wrote:

Am 08.05.2015 um 17:51 schrieb Clemens Wyss DEV:

Thx for the very fast answer.

new StandardDecryptionMaterial( password );

I have no password. The pdf is a public user manual.

Use an empty password :-)

Tilman


That is TIKA, isn't it?

True


-Ursprüngliche Nachricht-
Von: Tilman Hausherr [mailto:thaush...@t-online.de]
Gesendet: Freitag, 8. Mai 2015 17:44
An: us...@pdfbox.apache.org
Betreff: Re: extracting text from an "encrypted" pdf

Am 08.05.2015 um 17:36 schrieb Clemens Wyss DEV:

When I try to extract an "encrypted" (which can be read in AcrobatReader) 
document with:

pdfDocument = PDDocument.load( is );

add
if( document.isEncrypted() )
{
   StandardDecryptionMaterial sdm = new StandardDecryptionMaterial( password ); 
document.openProtection( sdm ); }

or use loadNonSeq()


PDFTextStripper pdfStripper = new PDFTextStripper(); parsedText =
pdfStripper.getText( pdfDocument );

I get an empty string, and " o.apache.pdfbox.pdfparser.PDFParser - Document is 
encrypted" is logged.

When, on the other hand, I do:

ContentHandler handler = new BodyContentHandler( -1 ); ParseContext
context = new ParseContext(); parser = new AutoDetectParser();
context.set( Parser.class, parser );
   parser.parse( is, handler, metadata, context ); parsedText =
handler.toString();

I get to see the text/content of the very pdf.

1) What ist he preferred way to extract text from a 
pdf("-that-can-be-read-in-AcrobatReader")?

https://svn.apache.org/viewvc/pdfbox/branches/1.8/pdfbox/src/main/java/org/apache/pdfbox/ExtractText.java?view=markup&sortby=date


   2) Does the second approach possibly return "more than text"? Blobs? Binary 
data?

That is TIKA, isn't it?

Tilman


-
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org


-
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org


-
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: users-h...@pdfbox.apache.org



-
To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org 

For additional commands, e-mail: users-h...@pdfbox.apache.org 




-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: extracting text from an "encrypted" pdf

2015-05-08 Thread John Hewson
Can’t we make PDFBox open the document with an empty password? What’s the story 
for 2.0?

— John

> On 8 May 2015, at 08:52, Tilman Hausherr  wrote:
> 
> Am 08.05.2015 um 17:51 schrieb Clemens Wyss DEV:
>> Thx for the very fast answer.
>>> new StandardDecryptionMaterial( password );
>> I have no password. The pdf is a public user manual.
> 
> Use an empty password :-)
> 
> Tilman
> 
>> 
>>> That is TIKA, isn't it?
>> True
>> 
>> 
>> -Ursprüngliche Nachricht-
>> Von: Tilman Hausherr [mailto:thaush...@t-online.de]
>> Gesendet: Freitag, 8. Mai 2015 17:44
>> An: us...@pdfbox.apache.org
>> Betreff: Re: extracting text from an "encrypted" pdf
>> 
>> Am 08.05.2015 um 17:36 schrieb Clemens Wyss DEV:
>>> When I try to extract an "encrypted" (which can be read in AcrobatReader) 
>>> document with:
>>> 
>>> pdfDocument = PDDocument.load( is );
>> add
>> if( document.isEncrypted() )
>> {
>>   StandardDecryptionMaterial sdm = new StandardDecryptionMaterial( password 
>> ); document.openProtection( sdm ); }
>> 
>> or use loadNonSeq()
>> 
>>> PDFTextStripper pdfStripper = new PDFTextStripper(); parsedText =
>>> pdfStripper.getText( pdfDocument );
>>> 
>>> I get an empty string, and " o.apache.pdfbox.pdfparser.PDFParser - Document 
>>> is encrypted" is logged.
>>> 
>>> When, on the other hand, I do:
>>> 
>>> ContentHandler handler = new BodyContentHandler( -1 ); ParseContext
>>> context = new ParseContext(); parser = new AutoDetectParser();
>>> context.set( Parser.class, parser );
>>>   parser.parse( is, handler, metadata, context ); parsedText =
>>> handler.toString();
>>> 
>>> I get to see the text/content of the very pdf.
>>> 
>>> 1) What ist he preferred way to extract text from a 
>>> pdf("-that-can-be-read-in-AcrobatReader")?
>> https://svn.apache.org/viewvc/pdfbox/branches/1.8/pdfbox/src/main/java/org/apache/pdfbox/ExtractText.java?view=markup&sortby=date
>> 
>>>   2) Does the second approach possibly return "more than text"? Blobs? 
>>> Binary data?
>> That is TIKA, isn't it?
>> 
>> Tilman
>> 
>>> -
>>> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
>>> For additional commands, e-mail: users-h...@pdfbox.apache.org
>>> 
>> 
>> -
>> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
>> For additional commands, e-mail: users-h...@pdfbox.apache.org
>> 
>> 
>> -
>> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org
>> For additional commands, e-mail: users-h...@pdfbox.apache.org
>> 
> 
> 
> -
> To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org 
> 
> For additional commands, e-mail: users-h...@pdfbox.apache.org 
> 


[jira] [Commented] (PDFBOX-2788) Seemingly good document gets semi-corrupted

2015-05-08 Thread John Hewson (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535637#comment-14535637
 ] 

John Hewson commented on PDFBOX-2788:
-

This fix will be in 1.8.10 when it is released, which will be a few months. 
It's just a standard maintenance release.

> Seemingly good document gets semi-corrupted
> ---
>
> Key: PDFBOX-2788
> URL: https://issues.apache.org/jira/browse/PDFBOX-2788
> Project: PDFBox
>  Issue Type: Bug
>  Components: Writing
>Affects Versions: 1.8.9
> Environment: Ubuntu Linux 14.04, Java SE 7
>Reporter: Are Husby
>Assignee: Tilman Hausherr
> Fix For: 1.8.10
>
> Attachments: Reproduce possible bug in PdfBox.zip, 
> output_1430933423628-better.pdf
>
>
> I use PdfBox to insert a little bit of text at the top of my PDF-documents. I 
> have found one case (one input document)  where the resulting document 
> appears to become semi-corrupted by PdfBox.
> I will try to attach to this Jira issue a zip file with the PDF document in 
> case and a small, self-contained Java application that allows you to easily 
> reproduce the problem.
> The text I try to insert is inserted okay and is not the problem. The problem 
> is that other parts of the documents seem to get destroyed. You see this by 
> comparing the original document with the processed document in a PDF document 
> viewer.
> The problem manifests itself in different ways depending on which PDF 
> document viewer application I use. I have tried Evince (comes as default on 
> Ubuntu Linux 14.04), Firefox (also as default in Ubuntu), Google Chrome, and 
> Adobe Acrobat Reader v.11 (both on Windows 7 Enterprise and in Ubuntu with 
> Wine, the Windows emulator).
> I you use for example Adobe ACrobat Reader, look in particular for the logo 
> image in the upper right corner of both pages, the fonts and the formatting 
> of the line on the second page near the bottom that says "Fakturasum: 2 
> 572,50".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2576) Improve code quality

2015-05-08 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535344#comment-14535344
 ] 

ASF subversion and git services commented on PDFBOX-2576:
-

Commit 1678441 from [~tilman] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1678441 ]

PDFBOX-2576: slight simplify; make method static that is called by static method

> Improve code quality
> 
>
> Key: PDFBOX-2576
> URL: https://issues.apache.org/jira/browse/PDFBOX-2576
> Project: PDFBox
>  Issue Type: Task
>Affects Versions: 2.0.0
>Reporter: Tilman Hausherr
> Attachments: ExtractText.2.patch, ExtractText.patch, 
> GraphicsOperatorProcessor.patch, SecuryHandlerFactory.patch, 
> Type5ShadingContext.patch, examples.arrayclone.patch, 
> fontbox.arrayclone.patch, org.apache.fontbox.afm.patch, 
> org.apache.fontbox.cff.cffparser.patch, org.apache.fontbox.cff.patch, 
> org.apache.fontbox.cmap.patch, 
> org.apache.pdfbox.contentstream.operator.state.patch, 
> org.apache.pdfbox.cos.patch, org.apache.pdfbox.filter-2.patch, 
> org.apache.pdfbox.filter.patch, org.apache.pdfbox.pdfwriter.COSWriter.patch, 
> org.apache.pdfbox.pdmodel.documentinterchange.logicalstructure.patch, 
> org.apache.pdfbox.pdmodel.documentinterchange.patch, 
> org.apache.pdfbox.preflight.graphic.patch, org.apache.pdfbox.resource.patch, 
> org.apache.pdfbox.text.testtextstripper.patch, pdfbox-override-patch.txt, 
> pdfbox-raw-type-patch.txt, pdfbox.arrayclone.patch, 
> pdfcloneutility-patch.txt, pdftextstripperbyarea-patch.txt, 
> ttfsubsetter-2.patch, ttfsubsetter-3.patch, ttfsubsetter-patch.txt
>
>
> This is a longterm issue for the task to improve code quality, by using the 
> [SonarQube 
> report|https://analysis.apache.org/dashboard/index/org.apache.pdfbox:pdfbox-reactor],
>  hints in different IDEs, the FindBugs tool and other code quality tools.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-2672) Wrong code convention link on the website

2015-05-08 Thread John Hewson (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Hewson resolved PDFBOX-2672.
-
Resolution: Fixed

> Wrong code convention link on the website
> -
>
> Key: PDFBOX-2672
> URL: https://issues.apache.org/jira/browse/PDFBOX-2672
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Reporter: Andrea Vacondio
>Priority: Trivial
>
> Currently the website page https://pdfbox.apache.org/codingconventions.html 
> points to the Sun's code convention -> http://java.sun.com/docs/codeconv but 
> that page doesn't exist anymore. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2672) Wrong code convention link on the website

2015-05-08 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535292#comment-14535292
 ] 

ASF subversion and git services commented on PDFBOX-2672:
-

Commit 1678428 from [~jahewson] in branch 'cmssite/trunk'
[ https://svn.apache.org/r1678428 ]

PDFBOX-2672: Update coding conventions

> Wrong code convention link on the website
> -
>
> Key: PDFBOX-2672
> URL: https://issues.apache.org/jira/browse/PDFBOX-2672
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Reporter: Andrea Vacondio
>Priority: Trivial
>
> Currently the website page https://pdfbox.apache.org/codingconventions.html 
> points to the Sun's code convention -> http://java.sun.com/docs/codeconv but 
> that page doesn't exist anymore. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535219#comment-14535219
 ] 

Tilman Hausherr commented on PDFBOX-2790:
-

[~s...@apache.org] I'll leave that decision to our chairman :-)

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535217#comment-14535217
 ] 

Tilman Hausherr commented on PDFBOX-2790:
-

Andreas wrote a text that appeared in the dev mailing list only:
{quote}
The doap file isn't related to a specific version. It is used to automagically 
provide some information for people.a.o , only the trunk version is needed.

IMHO you might remove the fixed version and close the ticket.
{quote}

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Closed] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr closed PDFBOX-2790.
---

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Issue Comment Deleted] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-2790:

Comment: was deleted

(was: Please don't close issues; we set them to resolved if there is a code 
change and close it only after a release. Only non-code-change issues (i.e. 
"duplicate" "not a problem", etc) may be closed immediately.)

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-2790:

Fix Version/s: (was: 2.0.0)

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Sebb (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535203#comment-14535203
 ] 

Sebb commented on PDFBOX-2790:
--

Or you could do like MiNA does and use a separate area for the project metadata:

http://svn.apache.org/repos/asf/mina/metadata/

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
> Fix For: 2.0.0
>
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: [jira] [Resolved] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Andreas Lehmkühler
Hi, 

@Tilman: thanks for the fast fix.

The doap file isn't related to a specific version. It is used to automagically 
provide some information for people.a.o , only the trunk version is needed.

IMHO you might remove the fixed version and close the ticket.

BR, Andreas

Am 8. Mai 2015 19:26:59 GMT+01:00, schrieb "Tilman Hausherr (JIRA)" 
:
>
>[
>https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
>]
>
>Tilman Hausherr resolved PDFBOX-2790.
>-
>   Resolution: Fixed
>Fix Version/s: 2.0.0
>
>> Syntax error in DOAP file release section
>> -
>>
>> Key: PDFBOX-2790
>> URL:
>https://issues.apache.org/jira/browse/PDFBOX-2790
>> Project: PDFBox
>>  Issue Type: Bug
>>  Components: Documentation
>>Affects Versions: 2.0.0
>> Environment:
>http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>>Reporter: Sebb
>>Assignee: Tilman Hausherr
>>  Labels: DOAP
>> Fix For: 2.0.0
>>
>>
>> DOAP files can contain details of multiple release Versions, however
>each must be listed in a separate release section, for example:
>> {code}
>> 
>>   
>> Apache XYZ
>> 2015-02-16
>> 1.6.2
>>   
>> 
>> 
>>   
>> Apache XYZ
>> 2014-09-24
>> 1.6.1
>>   
>> 
>> {code}
>> Please can the project DOAP be corrected accordingly?
>> Thanks.
>
>
>
>--
>This message was sent by Atlassian JIRA
>(v6.3.4#6332)
>
>-
>To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
>For additional commands, e-mail: dev-h...@pdfbox.apache.org


[jira] [Commented] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Sebb (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535198#comment-14535198
 ] 

Sebb commented on PDFBOX-2790:
--

The DOAP files don't make much sense in code releases, so I don't think there's 
any point in fixing them in branches.

Rather than store the DOAP in a code location, why not store it somewhere else, 
e.g. in the source area for building the site?
The projects file 
https://svn.apache.org/repos/asf/infrastructure/site-tools/trunk/projects/files.xml
 can be changed to pick up the DOAP from somewhere like

http://pdfbox.apache.org/doap_PDFbox.rdf

This has the advantage that any changes to repo URLs won't affect the build of 
the projects.apache.org site.
But of course updates won't be seen unless the site is republished.

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
> Fix For: 2.0.0
>
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Resolved] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr resolved PDFBOX-2790.
-
   Resolution: Fixed
Fix Version/s: 2.0.0

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
> Fix For: 2.0.0
>
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535159#comment-14535159
 ] 

Tilman Hausherr edited comment on PDFBOX-2790 at 5/8/15 6:26 PM:
-

Now it validates with http://www.w3.org/RDF/Validator/ . (There was also 
another bug)

Should this also be corrected in the next 1.8.* release? That one is outdated, 
the last version it mentions is 1.8.1 and has the same bugs. Is this file 
important for released jars, or only for the repository?

https://svn.apache.org/repos/asf/pdfbox/branches/1.8/doap_PDFBox.rdf


was (Author: tilman):
Now it validates with http://www.w3.org/RDF/Validator/ . (There was also 
another bug)

Should this also be corrected in the next 1.8.* release? That one is outdated, 
the last version it mentions is 1.8.1 and has the same bugs. Is this file 
important for released jars, or only for the repository?

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
> Fix For: 2.0.0
>
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Reopened] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr reopened PDFBOX-2790:
-

Please don't close issues; we set them to resolved if there is a code change 
and close it only after a release. Only non-code-change issues (i.e. 
"duplicate" "not a problem", etc) may be closed immediately.

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535159#comment-14535159
 ] 

Tilman Hausherr commented on PDFBOX-2790:
-

Now it validates with http://www.w3.org/RDF/Validator/ . (There was also 
another bug)

Should this also be corrected in the next 1.8.* release? That one is outdated, 
the last version it mentions is 1.8.1 and has the same bugs. Is this file 
important for released jars, or only for the repository?

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Closed] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Sebb (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebb closed PDFBOX-2790.

Resolution: Fixed

Thanks, looks good now

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-2790:

Description: 
DOAP files can contain details of multiple release Versions, however each must 
be listed in a separate release section, for example:
{code}

  
Apache XYZ
2015-02-16
1.6.2
  


  
Apache XYZ
2014-09-24
1.6.1
  

{code}
Please can the project DOAP be corrected accordingly?

Thanks.

  was:
DOAP files can contain details of multiple release Versions, however each must 
be listed in a separate release section, for example:


  
Apache XYZ
2015-02-16
1.6.2
  


  
Apache XYZ
2014-09-24
1.6.1
  


Please can the project DOAP be corrected accordingly?

Thanks.


> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> {code}
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> {code}
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-2790:

Labels: DOAP  (was: )

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-2790:

Component/s: Documentation

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>  Components: Documentation
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>  Labels: DOAP
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-2790:

Affects Version/s: 2.0.0

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535141#comment-14535141
 ] 

ASF subversion and git services commented on PDFBOX-2790:
-

Commit 1678410 from [~tilman] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1678410 ]

PDFBOX-2790: fix syntax errors in DOAP file, as suggested by Sebb

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 2.0.0
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Assigned] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Tilman Hausherr (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-2790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr reassigned PDFBOX-2790:
---

Assignee: Tilman Hausherr

> Syntax error in DOAP file release section
> -
>
> Key: PDFBOX-2790
> URL: https://issues.apache.org/jira/browse/PDFBOX-2790
> Project: PDFBox
>  Issue Type: Bug
> Environment: 
> http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
>Reporter: Sebb
>Assignee: Tilman Hausherr
>
> DOAP files can contain details of multiple release Versions, however each 
> must be listed in a separate release section, for example:
> 
>   
> Apache XYZ
> 2015-02-16
> 1.6.2
>   
> 
> 
>   
> Apache XYZ
> 2014-09-24
> 1.6.1
>   
> 
> Please can the project DOAP be corrected accordingly?
> Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-2790) Syntax error in DOAP file release section

2015-05-08 Thread Sebb (JIRA)
Sebb created PDFBOX-2790:


 Summary: Syntax error in DOAP file release section
 Key: PDFBOX-2790
 URL: https://issues.apache.org/jira/browse/PDFBOX-2790
 Project: PDFBox
  Issue Type: Bug
 Environment: 
http://svn.apache.org/repos/asf/pdfbox/trunk/doap_PDFBox.rdf
Reporter: Sebb


DOAP files can contain details of multiple release Versions, however each must 
be listed in a separate release section, for example:


  
Apache XYZ
2015-02-16
1.6.2
  


  
Apache XYZ
2014-09-24
1.6.1
  


Please can the project DOAP be corrected accordingly?

Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: svn commit: r950554 - in /websites/staging/pdfbox/trunk/content: ./ 1.8/ 1.8/cookbook/ 2.0/ FontAwesome/ docs/1.8.2/ errors/

2015-05-08 Thread Maruan Sahyoun
Hi,

> Am 08.05.2015 um 17:38 schrieb Tilman Hausherr :
> 
> Am 08.05.2015 um 10:28 schrieb build...@apache.org:
>> Modified: websites/staging/pdfbox/trunk/content/FontAwesome/README.html
> 
> What is this
> 
> https://pdfbox.apache.org/FontAwesome/
> https://pdfbox.apache.org/FontAwesome/docs/
> 

I removed the unneeded files and disabled directory listing in 950602

BR
Maruan

> 
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
> For additional commands, e-mail: dev-h...@pdfbox.apache.org
> 


-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: svn commit: r950554 - in /websites/staging/pdfbox/trunk/content: ./ 1.8/ 1.8/cookbook/ 2.0/ FontAwesome/ docs/1.8.2/ errors/

2015-05-08 Thread Maruan Sahyoun
webfont for icons - I'm removing the stuff that's not needed

BR
Maruan


> Am 08.05.2015 um 17:38 schrieb Tilman Hausherr :
> 
> Am 08.05.2015 um 10:28 schrieb build...@apache.org:
>> Modified: websites/staging/pdfbox/trunk/content/FontAwesome/README.html
> 
> What is this
> 
> https://pdfbox.apache.org/FontAwesome/
> https://pdfbox.apache.org/FontAwesome/docs/
> 
> 
> 
> -
> To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
> For additional commands, e-mail: dev-h...@pdfbox.apache.org
> 


-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2788) Seemingly good document gets semi-corrupted

2015-05-08 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14534753#comment-14534753
 ] 

Tilman Hausherr commented on PDFBOX-2788:
-

There is no roadmap, this isn't a commercial project. The plan is to release 
2.0 when the issues have been resolved
https://issues.apache.org/jira/issues/?jql=project%20%3D%20PDFBOX%20AND%20fixVersion%20%3D%202.0.0%20AND%20resolution%20%3D%20Unresolved%20ORDER%20BY%20updated%20DESC%2C%20due%20ASC%2C%20priority%20DESC%2C%20created%20ASC
or at least the blocker issues, but this hasn't happened :-(

The 1.8.* versions are usually updated about every three months. So you'd 
either have to wait, or change your own source code that you derive a new class 
from PDPageContentStream, and whose constructor doesn't have the problem that I 
solved. Another possibility would be to detect files with global resources, and 
not do the modification there.

> Seemingly good document gets semi-corrupted
> ---
>
> Key: PDFBOX-2788
> URL: https://issues.apache.org/jira/browse/PDFBOX-2788
> Project: PDFBox
>  Issue Type: Bug
>  Components: Writing
>Affects Versions: 1.8.9
> Environment: Ubuntu Linux 14.04, Java SE 7
>Reporter: Are Husby
>Assignee: Tilman Hausherr
> Fix For: 1.8.10
>
> Attachments: Reproduce possible bug in PdfBox.zip, 
> output_1430933423628-better.pdf
>
>
> I use PdfBox to insert a little bit of text at the top of my PDF-documents. I 
> have found one case (one input document)  where the resulting document 
> appears to become semi-corrupted by PdfBox.
> I will try to attach to this Jira issue a zip file with the PDF document in 
> case and a small, self-contained Java application that allows you to easily 
> reproduce the problem.
> The text I try to insert is inserted okay and is not the problem. The problem 
> is that other parts of the documents seem to get destroyed. You see this by 
> comparing the original document with the processed document in a PDF document 
> viewer.
> The problem manifests itself in different ways depending on which PDF 
> document viewer application I use. I have tried Evince (comes as default on 
> Ubuntu Linux 14.04), Firefox (also as default in Ubuntu), Google Chrome, and 
> Adobe Acrobat Reader v.11 (both on Windows 7 Enterprise and in Ubuntu with 
> Wine, the Windows emulator).
> I you use for example Adobe ACrobat Reader, look in particular for the logo 
> image in the upper right corner of both pages, the fonts and the formatting 
> of the line on the second page near the bottom that says "Fakturasum: 2 
> 572,50".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



Re: svn commit: r950554 - in /websites/staging/pdfbox/trunk/content: ./ 1.8/ 1.8/cookbook/ 2.0/ FontAwesome/ docs/1.8.2/ errors/

2015-05-08 Thread Tilman Hausherr

Am 08.05.2015 um 10:28 schrieb build...@apache.org:

Modified: websites/staging/pdfbox/trunk/content/FontAwesome/README.html


What is this

https://pdfbox.apache.org/FontAwesome/
https://pdfbox.apache.org/FontAwesome/docs/



-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2785) Support filling in landscape oriented AcroForms

2015-05-08 Thread Maruan Sahyoun (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14534575#comment-14534575
 ] 

Maruan Sahyoun commented on PDFBOX-2785:


could you attach the empty form template too? There are some values which are 
OK e.g. DATDELIVRY. Would like to see if there is something special about the 
field definition in the template.

> Support filling in landscape oriented AcroForms
> ---
>
> Key: PDFBOX-2785
> URL: https://issues.apache.org/jira/browse/PDFBOX-2785
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 1.8.9, 2.0.0
>Reporter: Maruan Sahyoun
> Attachments: newPdfCodeBarre.pdf
>
>
> From the users mailing list 
> {quote}
> Hello,
> when you add an AcroForm to a document that has a 90 degree orientation, does 
> the form automatically have a 90 degree orientation?
> What about it fields?
> My problem with the code below is that the fields' data are printed 
> vertically instead of horizontally.
> {quote}
> And another description on stackoverflow
> http://stackoverflow.com/questions/16952710/filling-landscape-pdf-with-pdfbox



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2785) Support filling in landscape oriented AcroForms

2015-05-08 Thread Philippe de Rochambeau (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14534510#comment-14534510
 ] 

Philippe de Rochambeau commented on PDFBOX-2785:


All data.

> Support filling in landscape oriented AcroForms
> ---
>
> Key: PDFBOX-2785
> URL: https://issues.apache.org/jira/browse/PDFBOX-2785
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 1.8.9, 2.0.0
>Reporter: Maruan Sahyoun
> Attachments: newPdfCodeBarre.pdf
>
>
> From the users mailing list 
> {quote}
> Hello,
> when you add an AcroForm to a document that has a 90 degree orientation, does 
> the form automatically have a 90 degree orientation?
> What about it fields?
> My problem with the code below is that the fields' data are printed 
> vertically instead of horizontally.
> {quote}
> And another description on stackoverflow
> http://stackoverflow.com/questions/16952710/filling-landscape-pdf-with-pdfbox



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2788) Seemingly good document gets semi-corrupted

2015-05-08 Thread Are Husby (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14534405#comment-14534405
 ] 

Are Husby commented on PDFBOX-2788:
---

Thanks for rapid response. The organization I work for is reluctant to use any 
libraries (especially snapshot-versions) that are not properly and officially 
released to the Maven Central Repository. Is there a roadmap for DropWizard or 
an estimate for when versjon 2 will be officially released?

> Seemingly good document gets semi-corrupted
> ---
>
> Key: PDFBOX-2788
> URL: https://issues.apache.org/jira/browse/PDFBOX-2788
> Project: PDFBox
>  Issue Type: Bug
>  Components: Writing
>Affects Versions: 1.8.9
> Environment: Ubuntu Linux 14.04, Java SE 7
>Reporter: Are Husby
>Assignee: Tilman Hausherr
> Fix For: 1.8.10
>
> Attachments: Reproduce possible bug in PdfBox.zip, 
> output_1430933423628-better.pdf
>
>
> I use PdfBox to insert a little bit of text at the top of my PDF-documents. I 
> have found one case (one input document)  where the resulting document 
> appears to become semi-corrupted by PdfBox.
> I will try to attach to this Jira issue a zip file with the PDF document in 
> case and a small, self-contained Java application that allows you to easily 
> reproduce the problem.
> The text I try to insert is inserted okay and is not the problem. The problem 
> is that other parts of the documents seem to get destroyed. You see this by 
> comparing the original document with the processed document in a PDF document 
> viewer.
> The problem manifests itself in different ways depending on which PDF 
> document viewer application I use. I have tried Evince (comes as default on 
> Ubuntu Linux 14.04), Firefox (also as default in Ubuntu), Google Chrome, and 
> Adobe Acrobat Reader v.11 (both on Windows 7 Enterprise and in Ubuntu with 
> Wine, the Windows emulator).
> I you use for example Adobe ACrobat Reader, look in particular for the logo 
> image in the upper right corner of both pages, the fonts and the formatting 
> of the line on the second page near the bottom that says "Fakturasum: 2 
> 572,50".



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-2785) Support filling in landscape oriented AcroForms

2015-05-08 Thread Maruan Sahyoun (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-2785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14534349#comment-14534349
 ] 

Maruan Sahyoun commented on PDFBOX-2785:


[~phiroc] has all data been set using PDFBox or only some?

> Support filling in landscape oriented AcroForms
> ---
>
> Key: PDFBOX-2785
> URL: https://issues.apache.org/jira/browse/PDFBOX-2785
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 1.8.9, 2.0.0
>Reporter: Maruan Sahyoun
> Attachments: newPdfCodeBarre.pdf
>
>
> From the users mailing list 
> {quote}
> Hello,
> when you add an AcroForm to a document that has a 90 degree orientation, does 
> the form automatically have a 90 degree orientation?
> What about it fields?
> My problem with the code below is that the fields' data are printed 
> vertically instead of horizontally.
> {quote}
> And another description on stackoverflow
> http://stackoverflow.com/questions/16952710/filling-landscape-pdf-with-pdfbox



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org