[ https://issues.apache.org/jira/browse/TIKA-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13404833#comment-13404833 ]
Michael McCandless commented on TIKA-758: ----------------------------------------- Looks like the TODOs are all in PDF2XHTML.java, currently: {noformat} mike@vine:/l/tika.trunk$ grep -r TODO . | grep -i PDFBOX | grep .java: ./tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java: // TODO: remove once PDFBOX-1130 is fixed: ./tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java: // TODO: remove once PDFBOX-1143 is fixed: ./tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java: // TODO: remove once PDFBOX-1130 is fixed ./tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDF2XHTML.java: // TODO: remove once PDFBOX-1130 is fixed {noformat} > Address TODOs when we upgrade to next PDFBox release > ---------------------------------------------------- > > Key: TIKA-758 > URL: https://issues.apache.org/jira/browse/TIKA-758 > Project: Tika > Issue Type: Improvement > Reporter: Michael McCandless > > Like TIKA-757 for POI, I'm opening this blanket issue to address any TODOs in > the code when we next upgrade PDFBox. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira