Hmm, Ok, trying to reduce this to the simplest test case, I ran podofotxtextract.exe on the pdf.
C:\Users\frank\mydev\test_artifacts>podofotxtextract.exe C:\tmp\Math_Standards.pdf <</ID[<9E60894142914DC68917FB9871F53C8E><67EAD78FF7364ACB9B039E9B1F9E41F5>]/Info 320 0 R/Root 1 0 R/Size 400>> Error: An error 40 ocurred during processing the pdf file. PoDoFo encounter an error. Error: 40 ePdfError_Flate Error Description: ZLib returned an error. Callstack: #0 Error Source: ..\..\podofo-0.9.3\src\base\PdfFiltersPrivate.cpp:549 Any thoughts ?? Cheers / Frank On Oct 22, 2015, at 10:28 AM, Frank Singleton <fsingle...@iscanonline.com<mailto:fsingle...@iscanonline.com>> wrote: Hi, First of all, thanks for PoDoFo. Anyone who seeks to reliably parse (among other things) PDF documents is truly brave ;-) So, I downloaded podofo-0.9.3 and have it compiled on Fedora, Mac and (after some juggling) also Windows 7 Ultimate with CMake and VS 2010. Test PDF location: www.nasa.gov/pdf/703154main_earth_art-ebook.pdf<http://www.nasa.gov/pdf/703154main_earth_art-ebook.pdf> www.corestandards.org/wp-content/uploads/Math_Standards.pd<http://www.corestandards.org/wp-content/uploads/Math_Standards.pd> Example Mac Usage $ ./bin/podofopdfinfo D ~/vmshared/703154main_earth_art-ebook.pdf Document Info ------------- File: /Users/frank/vmshared/703154main_earth_art-ebook.pdf PDF Version: 1.6 Page Count: 172 Page Size: 720 x 720 pts Fast Web View Enabled: Yes Tagged: Yes Encrypted: No Printing Allowed: Yes Modification Allowed: Yes Copy&Paste Allowed: Yes Add/Modify Annotations Allowed: Yes Fill&Sign Allowed: Yes Accessibility Allowed: Yes Document Assembly Allowed: Yes High Quality Print Allowed: Yes Example Windows Usage on same file podofopdfinfo.exe D c:\tmp\703154main_earth_art-ebook.pdf Error: An error 40 ocurred during uncompressing the pdf file. PoDoFo encounter an error. Error: 40 ePdfError_Flate Error Description: ZLib returned an error. Callstack: #0 Error Source: ..\..\podofo-0.9.3\src\base\PdfParser.cpp:226 Information: Unable to load objects from file. #1 Error Source: ..\..\podofo-0.9.3\src\base\PdfParser.cpp:326 Information: Unable to read linearized XRef section. #2 Error Source: ..\..\podofo-0.9.3\src\base\PdfFiltersPrivate.cpp:549 Example windows usage on another PDF file works. podofopdfinfo.exe D c:\tmp\Math_Standards.pdf Document Info ------------- File: c:\tmp\Math_Standards.pdf PDF Version: 1.3 Page Count: 93 Page Size: 612 x 792 pts Fast Web View Enabled: No Tagged: No Encrypted: No Printing Allowed: Yes Modification Allowed: Yes Copy&Paste Allowed: Yes Add/Modify Annotations Allowed: Yes Fill&Sign Allowed: Yes Accessibility Allowed: Yes Document Assembly Allowed: Yes High Quality Print Allowed: Yes I used the following artifacts during compilation cmake 2.8.12.2 freetype 2.6 zlib128 lpng1618 podofo-0.9.3 VS2010 I notice also there have been quite a few commits between the 0.9.3 release and the present day. So I have some questions. Q1. Any thoughts on why the first windows PDF example fails? Unable to read linearized XRef section ?? Q2. Should I try a later version of podofo Q3. How stable or not is trunk (LOL) Q4. Recommendations on how to solve this issue for Windows. Q5. Is anyone else able to parse this failing PDF on Windows with Cmake/VS2010. If so, what versions of the required library artifacts are being used? Thanks for your time. Cheers / Frank ------------------------------------------------------------------------------ _______________________________________________ Podofo-users mailing list Podofo-users@lists.sourceforge.net<mailto:Podofo-users@lists.sourceforge.net> https://lists.sourceforge.net/lists/listinfo/podofo-users
------------------------------------------------------------------------------
_______________________________________________ Podofo-users mailing list Podofo-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/podofo-users