[jira] [Assigned] (TIKA-875) Temporary file leak in ImageParser

2012-03-13 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-875: --- Assignee: Michael McCandless > Temporary file leak in ImageParser > ---

[jira] [Assigned] (TIKA-870) Allow to use call parseToString with a additional parameter of MaxStringLength, so it can be changed per call

2012-03-07 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-870: --- Assignee: Michael McCandless > Allow to use call parseToString with a additional pa

[jira] [Assigned] (TIKA-801) ContentHandlerDecorator outputs invalid element

2011-12-08 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-801: --- Assignee: Michael McCandless > ContentHandlerDecorator outputs invalid element > --

[jira] [Assigned] (TIKA-778) NullPointerException in tika-app, parsing PDF content

2011-11-26 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-778: --- Assignee: Michael McCandless > NullPointerException in tika-app, parsing PDF conten

[jira] [Assigned] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-15 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-782: --- Assignee: Michael McCandless > Add support for parsing binary data in RTF files > -

[jira] [Assigned] (TIKA-781) RTF parser should ignore most control words in ignore groups

2011-11-11 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-781: --- Assignee: Michael McCandless > RTF parser should ignore most control words in ignor

[jira] [Assigned] (TIKA-529) IBM420 charset detection's isLamAlef is allocation-happy

2011-11-08 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-529: --- Assignee: Michael McCandless (was: Ken Krugler) > IBM420 charset detection's isLam

[jira] [Assigned] (TIKA-777) RTF parser incorrectly applies fonts to complete group

2011-11-08 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-777: --- Assignee: Michael McCandless > RTF parser incorrectly applies fonts to complete gro

[jira] [Assigned] (TIKA-714) Word art isn't extracted for various doc types

2011-11-06 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-714: --- Assignee: Michael McCandless > Word art isn't extracted for various doc types > ---

[jira] [Assigned] (TIKA-736) OpenOffice parser: master footer text isn't extracted

2011-10-26 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-736: --- Assignee: Michael McCandless > OpenOffice parser: master footer text isn't extracte

[jira] [Assigned] (TIKA-724) PDF text sometimes has extra space between letters

2011-10-19 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-724: --- Assignee: Michael McCandless > PDF text sometimes has extra space between letters >

[jira] [Assigned] (TIKA-738) Tika fails to extract text from PDF annotations

2011-10-18 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-738?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-738: --- Assignee: Michael McCandless > Tika fails to extract text from PDF annotations > --

[jira] [Assigned] (TIKA-748) RTF parser fails to extract the body

2011-10-09 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-748: --- Assignee: Michael McCandless > RTF parser fails to extract the body > -

[jira] [Assigned] (TIKA-721) UTF16-LE not detected

2011-10-02 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-721: --- Assignee: Michael McCandless > UTF16-LE not detected > - > >

[jira] [Assigned] (TIKA-711) Word parser doesn't extract optional hyphen correctly

2011-10-02 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-711: --- Assignee: Michael McCandless > Word parser doesn't extract optional hyphen correctl

[jira] [Assigned] (TIKA-733) [PATCH] RTF TextExtractor processGroupEnd() NoSuchElementException

2011-09-28 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-733: --- Assignee: Michael McCandless > [PATCH] RTF TextExtractor processGroupEnd() NoSuchEl