Hi, I have tried your command with the given PDF, the extraction succeeded on my environment. I'm under a Fedora with an OpenJDK (version 1.6.0_22 64Bits)
Could you give us some details about the configuration of your environment? Best regards, Eric 2012/10/19 Peter Williams <[email protected]> > Hi, > > Your web page seemed to say that bugs should be reported by emailing this > address. > > Steps to Reproduce > > Download GAM-OptimalScaling2.pdf from > http://www.math.vu.nl/sto/onderwijs/statlearn/GAM-OptimalScaling2.pdf > > java -jar pdfbox-app-1.7.1.jar ExtractText -sort GAM-OptimalScaling2.pdf > GAM-OptimalScaling2.sorted.txt > ExtractText failed with the following exception: > java.lang.IllegalArgumentException: Comparison method violates its general > contract! > at java.util.TimSort.mergeHi(Unknown Source) > at java.util.TimSort.mergeAt(Unknown Source) > at java.util.TimSort.mergeCollapse(Unknown Source) > at java.util.TimSort.sort(Unknown Source) > at java.util.TimSort.sort(Unknown Source) > at java.util.Arrays.sort(Unknown Source) > at java.util.Collections.sort(Unknown Source) > at > org.apache.pdfbox.util.PDFTextStripper.writePage(PDFTextStripper.java:558) > at > > org.apache.pdfbox.util.PDFTextStripper.processPage(PDFTextStripper.java:449) > at > > org.apache.pdfbox.util.PDFTextStripper.processPages(PDFTextStripper.java:372) > at > org.apache.pdfbox.util.PDFTextStripper.writeText(PDFTextStripper.java:328) > at > org.apache.pdfbox.ExtractText.startExtraction(ExtractText.java:274) > at org.apache.pdfbox.ExtractText.main(ExtractText.java:84) > at org.apache.pdfbox.PDFBox.main(PDFBox.java:42) > > > ---------------------------------------------- > Peter Williams > 0488 783 700 / +61 488 783 700 >

