[jira] Created: (PDFBOX-915) some pdf file for chinese can't extracted by correct encode

2010-12-07 Thread chenlong (JIRA)
some pdf file for chinese can't extracted by correct encode Key: PDFBOX-915 URL: https://issues.apache.org/jira/browse/PDFBOX-915 Project: PDFBox Issue Type: Bug

[jira] Updated: (PDFBOX-915) some pdf file for chinese can't extracted by correct encode

2010-12-07 Thread chenlong (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] chenlong updated PDFBOX-915: Attachment: 821-2302.pdf help me ,this file can't extract correct some pdf file for chinese can't

Problems with colors when extracting CMYK images from pdf files.

2010-12-07 Thread Pontus Hulin
Hello I have a problem extracting images from pdf files using the code below. All images in RGB format look ok, but images in CMYK format does not look ok. What can I do to fix this problem. Do I need to covert the image after export or can I du it during export? Best regards / Pontus

[jira] Reopened: (PDFBOX-909) Add support for a 6 element matrix

2010-12-07 Thread Jeremias Maerki (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jeremias Maerki reopened PDFBOX-909: Andreas, are you sure these changes are right? Right now, I have a case reported to me by a

[jira] Commented: (PDFBOX-909) Add support for a 6 element matrix

2010-12-07 Thread JIRA
[ https://issues.apache.org/jira/browse/PDFBOX-909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12968698#action_12968698 ] Andreas Lehmkühler commented on PDFBOX-909: --- I'm pretty sure that my changes are

RE: 1.3.2 release?

2010-12-07 Thread Martinez, Mel - 1004 - MITLL
+1 Next Thursday works for me. Thanks, Andreas! -Mel -Original Message- From: Andreas Lehmkuehler [mailto:andr...@lehmi.de] Sent: Tuesday, December 07, 2010 2:17 AM To: dev@pdfbox.apache.org Subject: Re: 1.3.2 release? Hi, Am 06.12.2010 14:36, schrieb Jukka Zitting: Hi, On

[jira] Created: (PDFBOX-916) Create embedded index

2010-12-07 Thread nielsen (JIRA)
Create embedded index - Key: PDFBOX-916 URL: https://issues.apache.org/jira/browse/PDFBOX-916 Project: PDFBox Issue Type: New Feature Components: Utilities Affects Versions: 1.3.1 Environment:

[jira] Resolved: (PDFBOX-909) Add support for a 6 element matrix

2010-12-07 Thread JIRA
[ https://issues.apache.org/jira/browse/PDFBOX-909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler resolved PDFBOX-909. --- Resolution: Fixed I added the proposed fix in revision 1043175. It uses the same

[jira] Commented: (PDFBOX-521) Improved PDF Text Extraction that notes paragraph boundaries

2010-12-07 Thread JIRA
[ https://issues.apache.org/jira/browse/PDFBOX-521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12968930#action_12968930 ] Andreas Lehmkühler commented on PDFBOX-521: --- Ted, sounds interesting. There are

Re: Conforming parser

2010-12-07 Thread martijn.list
I'm sorry I cannot help you with the startxref issue but I have some thoughts about parsing non-conforming PDFs. Any other suggestions, words of warning, etc.? Like, how should I deal with violations of the spec? I think it's important to graceful handle non-conforming PDFs. Currently PDFBox