[jira] [Resolved] (TIKA-875) Temporary file leak in ImageParser

2012-03-13 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-875. - Resolution: Fixed Fix Version/s: 1.2 Temporary file leak in ImageParser

[jira] [Resolved] (TIKA-870) Allow to use call parseToString with a additional parameter of MaxStringLength, so it can be changed per call

2012-03-11 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-870. - Resolution: Fixed Fix Version/s: 1.2 Thanks Shay! Allow to use

[jira] [Resolved] (TIKA-801) ContentHandlerDecorator outputs invalid element

2011-12-09 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-801. - Resolution: Fixed Fix Version/s: 1.1 ContentHandlerDecorator outputs invalid

[jira] [Resolved] (TIKA-738) Tika fails to extract text from PDF annotations

2011-11-26 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-738. - Resolution: Fixed Per discussion on tika-dev I'll leave this issue closed, and commit this

[jira] [Resolved] (TIKA-778) NullPointerException in tika-app, parsing PDF content

2011-11-26 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-778. - Resolution: Fixed Fix Version/s: 1.1 NullPointerException in tika-app, parsing

[jira] [Resolved] (TIKA-736) OpenOffice parser: master footer text isn't extracted

2011-10-28 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-736. - Resolution: Fixed Fix Version/s: 1.0 OpenOffice parser: master footer text

[jira] [Resolved] (TIKA-582) Lithuanian language identification

2011-10-26 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-582. - Resolution: Fixed Fix Version/s: (was: 0.9) 1.0 Thansk

[jira] [Resolved] (TIKA-753) Improve performance when parsing embedded Office docs

2011-10-20 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-753. - Resolution: Fixed I opened TIKA-757 as the blanket issue for addressing TODOs on next POI

[jira] [Resolved] (TIKA-724) PDF text sometimes has extra space between letters

2011-10-20 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-724. - Resolution: Fixed Fix Version/s: 1.0 PDF text sometimes has extra space

[jira] [Resolved] (TIKA-718) PDF bookmark text isn't extracted

2011-10-18 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-718. - Resolution: Invalid My bad: I'm using OS X's preview to view PDFs, which lets you add

[jira] [Resolved] (TIKA-751) Small improvements to how embedded docs are parsed in AbstractPOIFSExtractor.handleEmbeddedOfficeDoc

2011-10-12 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-751. - Resolution: Fixed Small improvements to how embedded docs are parsed in

[jira] [Resolved] (TIKA-742) PDF2XHTML fails to insert p nor space around page marker

2011-10-05 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-742. - Resolution: Fixed PDF2XHTML fails to insert p nor space around page marker

[jira] [Resolved] (TIKA-717) Comment/annotation is sometimes not extracted

2011-10-03 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-717. - Resolution: Fixed Fix Version/s: 1.0 Comment/annotation is sometimes not

[jira] [Resolved] (TIKA-722) Arabic PDF doesn't extract correctly

2011-10-03 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-722. - Resolution: Won't Fix OK resolving as Won't Fix. I don't see how Tika can recover when

[jira] [Resolved] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException

2011-10-03 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-733. - Resolution: Fixed Thanks Jeremy! [PATCH] RTF TextExtractor

[jira] [Resolved] (TIKA-711) Word parser doesn't extract optional hyphen correctly

2011-10-03 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-711. - Resolution: Fixed Fix Version/s: 1.0 Word parser doesn't extract optional

[jira] [Resolved] (TIKA-632) Rtf parsing ignores links

2011-10-01 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-632. - Resolution: Fixed Fix Version/s: 1.0 Rtf parsing ignores links

[jira] [Resolved] (TIKA-720) EBCDIC encoding not detected

2011-10-01 Thread Michael McCandless (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless resolved TIKA-720. - Resolution: Fixed Fix Version/s: 0.10 I think this was fixed in 0.10?