[jira] [Updated] (PDFBOX-2377) Apparent regression in character mapping in a few files from govdocs1

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-2377: Attachment: 357094-1.8.8.txt 357094-1.8.6.txt 357094.pdf Sa

[jira] [Closed] (PDFBOX-2259) PDFTextStripper has problem with semi-space characters

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson closed PDFBOX-2259. --- Resolution: Not a Problem I took another look at this PDF, the "marked content" does not include any

Jenkins build is back to stable : PDFBox-trunk » Apache PDFBox #1305

2014-09-25 Thread Apache Jenkins Server
See

Jenkins build is back to stable : PDFBox-trunk » Apache PDFBox tools #1305

2014-09-25 Thread Apache Jenkins Server
See

Jenkins build is back to stable : PDFBox-trunk #1305

2014-09-25 Thread Apache Jenkins Server
See

[jira] [Commented] (PDFBOX-2380) Glyphlist .properties are not ordered

2014-09-25 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148640#comment-14148640 ] ASF subversion and git services commented on PDFBOX-2380: - Commit

[jira] [Commented] (PDFBOX-2380) Glyphlist .properties are not ordered

2014-09-25 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148639#comment-14148639 ] ASF subversion and git services commented on PDFBOX-2380: - Commit

Jenkins build became unstable: PDFBox-trunk #1304

2014-09-25 Thread Apache Jenkins Server
See

Jenkins build became unstable: PDFBox-trunk » Apache PDFBox tools #1304

2014-09-25 Thread Apache Jenkins Server
See

Jenkins build became unstable: PDFBox-trunk » Apache PDFBox #1304

2014-09-25 Thread Apache Jenkins Server
See

[jira] [Commented] (PDFBOX-2372) Trash Glyphs: Regressions 19.9.2014

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148426#comment-14148426 ] John Hewson commented on PDFBOX-2372: - Finally :) > Trash Glyphs: Regressions 19.9.2

[jira] [Comment Edited] (PDFBOX-2380) Glyphlist .properties are not ordered

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148419#comment-14148419 ] John Hewson edited comment on PDFBOX-2380 at 9/25/14 10:44 PM:

[jira] [Commented] (PDFBOX-2380) Glyphlist .properties are not ordered

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148419#comment-14148419 ] John Hewson commented on PDFBOX-2380: - I've refactored the way that glyph lists are l

[jira] [Commented] (PDFBOX-2380) Glyphlist .properties are not ordered

2014-09-25 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148410#comment-14148410 ] ASF subversion and git services commented on PDFBOX-2380: - Commit

[jira] [Updated] (PDFBOX-1094) Pattern colorspace support

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-1094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-1094: Attachment: gs-bugzilla694385.pdf gs-bugzilla692503.ai gs-bu

[jira] [Commented] (PDFBOX-1094) Pattern colorspace support

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-1094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148267#comment-14148267 ] Tilman Hausherr commented on PDFBOX-1094: - A few files are clearly better (e.g. p

[jira] [Resolved] (PDFBOX-2372) Trash Glyphs: Regressions 19.9.2014

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr resolved PDFBOX-2372. - Resolution: Fixed > Trash Glyphs: Regressions 19.9.2014 > ---

[jira] [Commented] (PDFBOX-2372) Trash Glyphs: Regressions 19.9.2014

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148232#comment-14148232 ] Tilman Hausherr commented on PDFBOX-2372: - I probably mixed up two files, the "ß"

[jira] [Updated] (PDFBOX-2376) Small regression in text extraction with PDFBox 1.8.7 vs. 1.8.6

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2376?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-2376: Fix Version/s: 2.0.0 1.8.8 > Small regression in text extraction with PD

Jenkins build is back to normal : PDFBox-trunk » PDFBox parent #1303

2014-09-25 Thread Apache Jenkins Server
See

Jenkins build is back to normal : PDFBox-trunk #1303

2014-09-25 Thread Apache Jenkins Server
See

Build failed in Jenkins: PDFBox-trunk » PDFBox parent #1302

2014-09-25 Thread Apache Jenkins Server
See -- [...truncated 824 lines...] Downloaded: http://repo.maven.apache.org/maven2/org/apache/maven/doxia/doxia-sink-api/1.4/doxia-sink-api-1.4.jar (11 KB at 683.8 KB/sec)

Build failed in Jenkins: PDFBox-trunk #1302

2014-09-25 Thread Apache Jenkins Server
See Changes: [jahewson] PDFBOX-2372: Fix: Load symbolic/non-symbolic flags from AFM into FontDescriptor [jahewson] PDFBOX-2372: Fix: don't use Standard 14 mapping without encoding -- [...truncated

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148158#comment-14148158 ] ASF subversion and git services commented on PDFBOX-2350: - Commit

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148159#comment-14148159 ] John Hewson commented on PDFBOX-2350: - [~daniel.scheibe] I've applied a minimal versi

[jira] [Assigned] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson reassigned PDFBOX-2350: --- Assignee: John Hewson > Type1 Parser hangs indefinitely > --- >

[jira] [Updated] (PDFBOX-2372) Trash Glyphs: Regressions 19.9.2014

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson updated PDFBOX-2372: Summary: Trash Glyphs: Regressions 19.9.2014 (was: Regressions 19.9.2014) > Trash Glyphs: Regressi

[jira] [Commented] (PDFBOX-2372) Regressions 19.9.2014

2014-09-25 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148102#comment-14148102 ] ASF subversion and git services commented on PDFBOX-2372: - Commit

[jira] [Commented] (PDFBOX-2372) Regressions 19.9.2014

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148103#comment-14148103 ] John Hewson commented on PDFBOX-2372: - These should be fixed now, except for: {quote

[jira] [Commented] (PDFBOX-2372) Regressions 19.9.2014

2014-09-25 Thread ASF subversion and git services (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148100#comment-14148100 ] ASF subversion and git services commented on PDFBOX-2372: - Commit

[jira] [Commented] (PDFBOX-2381) BaseParser - IOException: Push back buffer is full

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14148051#comment-14148051 ] Tilman Hausherr commented on PDFBOX-2381: - Can happen if a stream with wrong leng

Re: Arabic compound characters not recognized by pdfbox

2014-09-25 Thread John Hewson
Hi Ahmet We’re currently looking into a similar problem https://issues.apache.org/jira/browse/PDFBOX-2259 If you think this is the *exact* same problem that you’re seeing, please attach your PDF file to that JIRA issue, if not then please open a new JIRA issue and attach your file. (You can at

[jira] [Created] (PDFBOX-2381) BaseParser - IOException: Push back buffer is full

2014-09-25 Thread John Hewson (JIRA)
John Hewson created PDFBOX-2381: --- Summary: BaseParser - IOException: Push back buffer is full Key: PDFBOX-2381 URL: https://issues.apache.org/jira/browse/PDFBOX-2381 Project: PDFBox Issue Type:

[jira] [Updated] (PDFBOX-2320) IOException: Could not read embedded TTF for font TimesNewRoman

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson updated PDFBOX-2320: Attachment: TEST_SetCharSpacing_Error.pdf > IOException: Could not read embedded TTF for font Times

[jira] [Updated] (PDFBOX-2320) IOException: Could not read embedded TTF for font TimesNewRoman

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson updated PDFBOX-2320: Description: java -jar ~/pdf-box-svn/app/target/pdfbox-app-2.0.0-SNAPSHOT.jar PDFToImage -nonSeq T

[jira] [Updated] (PDFBOX-52) DCTFilter is not implemented yet

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-52?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson updated PDFBOX-52: -- Attachment: amyuni2_05d__pdf1_3_acro4x.pdf > DCTFilter is not implemented yet > -

[jira] [Resolved] (PDFBOX-52) DCTFilter is not implemented yet

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-52?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson resolved PDFBOX-52. --- Resolution: Fixed > DCTFilter is not implemented yet > > >

[jira] [Reopened] (PDFBOX-52) DCTFilter is not implemented yet

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-52?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson reopened PDFBOX-52: --- Assignee: (was: Tilman Hausherr) Re-opening to attach PDF file > DCTFilter is not implemented ye

[jira] [Updated] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread John Hewson (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson updated PDFBOX-2350: Attachment: bad_length1.pdf I've attached a PDF file which I altered with my hex editor to have a L

[jira] [Comment Edited] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147658#comment-14147658 ] Tilman Hausherr edited comment on PDFBOX-2350 at 9/25/14 3:45 PM: -

[jira] [Comment Edited] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147671#comment-14147671 ] Tilman Hausherr edited comment on PDFBOX-2350 at 9/25/14 3:45 PM: -

Arabic compound characters not recognized by pdfbox

2014-09-25 Thread Ahmet Aker
Hi, I am using pdfBox (1.8.6) for converting Arabic pdf files (not images of texts but real texts) to html. PdfBox works really good in most cases however, it does have problems in recognizing compound characters. I am attaching you a sample pdf file. In that e.g. I get الفغاني but I should be ge

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Daniel Scheibe (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147713#comment-14147713 ] Daniel Scheibe commented on PDFBOX-2350: Did a bit more of testing and if we want

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147671#comment-14147671 ] Tilman Hausherr commented on PDFBOX-2350: - Even the nonSeq parser can't help if,

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Daniel Scheibe (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147666#comment-14147666 ] Daniel Scheibe commented on PDFBOX-2350: I don't see a difference in using load v

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Daniel Scheibe (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147663#comment-14147663 ] Daniel Scheibe commented on PDFBOX-2350: Ah this might be important i'm doing my

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147658#comment-14147658 ] Tilman Hausherr commented on PDFBOX-2350: - The nonseq parser does use the /length

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Daniel Scheibe (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147655#comment-14147655 ] Daniel Scheibe commented on PDFBOX-2350: [~tilman] no my trick does only seem to

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Daniel Scheibe (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147639#comment-14147639 ] Daniel Scheibe commented on PDFBOX-2350: [~jahewson] using PDFDebugger i was able

[jira] [Commented] (PDFBOX-2380) Glyphlist .properties are not ordered

2014-09-25 Thread JIRA
[ https://issues.apache.org/jira/browse/PDFBOX-2380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147599#comment-14147599 ] Andreas Lehmkühler commented on PDFBOX-2380: I've create LEGAL-208 to ask for

[jira] [Commented] (PDFBOX-2350) Type1 Parser hangs indefinitely

2014-09-25 Thread Daniel Scheibe (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147568#comment-14147568 ] Daniel Scheibe commented on PDFBOX-2350: In the meantime: http://feliam.wordpre

[jira] [Comment Edited] (PDFBOX-2372) Regressions 19.9.2014

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147464#comment-14147464 ] Tilman Hausherr edited comment on PDFBOX-2372 at 9/25/14 7:13 AM: -

[jira] [Comment Edited] (PDFBOX-2372) Regressions 19.9.2014

2014-09-25 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-2372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14147464#comment-14147464 ] Tilman Hausherr edited comment on PDFBOX-2372 at 9/25/14 7:12 AM: -