Re: svn commit: r1163970 - in /tika/trunk: tika-core/src/main/java/org/apache/tika/extractor/ tika-core/src/main/java/org/apache/tika/io/ tika-core/src/main/java/org/apache/tika/parser/ tika-core/src/

2011-09-02 Thread Michael McCandless
On Thu, Sep 1, 2011 at 12:39 PM, Jukka Zitting wrote: > Hi, > > On Thu, Sep 1, 2011 at 5:08 PM, Michael McCandless > wrote: >> We might want to mark APIs like TemporaryResources "internal" in the >> javadocs, ie, that we reseve the right to suddenly change them and >> they are just public so that

[jira] [Commented] (TIKA-701) Fix problems with TemporaryFiles

2011-09-02 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13095878#comment-13095878 ] Michael McCandless commented on TIKA-701: - bq. The idea behind that logic is that if

[jira] [Commented] (TIKA-683) RTF Parser issues with non european characters

2011-09-02 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13095904#comment-13095904 ] Jukka Zitting commented on TIKA-683: +1, I'm eager to see us drop the javax.swing depend

[jira] [Resolved] (TIKA-207) MS word doc containing tracked changes produces incorrect text

2011-09-02 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-207. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Thanks, Curt! Patch comm

[jira] [Commented] (TIKA-683) RTF Parser issues with non european characters

2011-09-02 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13095978#comment-13095978 ] Michael McCandless commented on TIKA-683: - Thanks Jukka! That's a good idea to move

[jira] [Resolved] (TIKA-704) PDF and Outlook docs embedded in MS Word documents not parsed

2011-09-02 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-704. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Thanks for bringing this

[jira] [Resolved] (TIKA-702) Cannot compile Tika with Java 7 (ImageMetadataExtractor.java)

2011-09-02 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-702. Resolution: Fixed Assignee: Jukka Zitting Fixed in revision 1164617 by no longer using the com

[jira] [Resolved] (TIKA-698) "Invalid UTF-16 surrogate detected:" parsing PowerPoint 97-2003

2011-09-02 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-698. Resolution: Fixed Assignee: Jukka Zitting Thanks for reporting this! Fixed in revision 1164655.

[jira] [Commented] (TIKA-612) Specify PDFBox options via ParseContext

2011-09-02 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13096266#comment-13096266 ] Jukka Zitting commented on TIKA-612: +1 looks good to me. A possible design improvement