[jira] [Created] (TIKA-757) Address TODOs when we upgrade to next POI release (3.8 beta 5)

2011-10-20 Thread Michael McCandless (Created) (JIRA)
Address TODOs when we upgrade to next POI release (3.8 beta 5) -- Key: TIKA-757 URL: https://issues.apache.org/jira/browse/TIKA-757 Project: Tika Issue Type: Improvement

[jira] [Created] (TIKA-758) Address TODOs when we upgrade to next PDFBox release

2011-10-20 Thread Michael McCandless (Created) (JIRA)
Address TODOs when we upgrade to next PDFBox release Key: TIKA-758 URL: https://issues.apache.org/jira/browse/TIKA-758 Project: Tika Issue Type: Improvement Reporter: Michael

[jira] [Created] (TIKA-753) Improve performance when parsing embedded Office docs

2011-10-14 Thread Michael McCandless (Created) (JIRA)
Improve performance when parsing embedded Office docs - Key: TIKA-753 URL: https://issues.apache.org/jira/browse/TIKA-753 Project: Tika Issue Type: Improvement Components: parser

[jira] [Created] (TIKA-751) Small improvements to how embedded docs are parsed in AbstractPOIFSExtractor.handleEmbeddedOfficeDoc

2011-10-12 Thread Michael McCandless (Created) (JIRA)
Small improvements to how embedded docs are parsed in AbstractPOIFSExtractor.handleEmbeddedOfficeDoc Key: TIKA-751 URL:

[jira] [Created] (TIKA-742) PDF2XHTML fails to insert p nor space around page marker

2011-10-04 Thread Michael McCandless (Created) (JIRA)
PDF2XHTML fails to insert p nor space around page marker -- Key: TIKA-742 URL: https://issues.apache.org/jira/browse/TIKA-742 Project: Tika Issue Type: Bug Components: parser

[jira] [Created] (TIKA-738) Tika fails to extract text from PDF annotations

2011-10-03 Thread Michael McCandless (Created) (JIRA)
Tika fails to extract text from PDF annotations --- Key: TIKA-738 URL: https://issues.apache.org/jira/browse/TIKA-738 Project: Tika Issue Type: Bug Components: parser

[jira] [Created] (TIKA-736) OpenOffice parser: master footer text isn't extracted

2011-10-01 Thread Michael McCandless (Created) (JIRA)
OpenOffice parser: master footer text isn't extracted - Key: TIKA-736 URL: https://issues.apache.org/jira/browse/TIKA-736 Project: Tika Issue Type: Bug Components: parser