[jira] [Assigned] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-15 Thread Michael McCandless (Assigned) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-782: --- Assignee: Michael McCandless > Add support for parsing binary data in RTF files > -

[jira] [Commented] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-15 Thread Arjohn Kampman (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150719#comment-13150719 ] Arjohn Kampman commented on TIKA-782: - I've attached an improved patch that actually rea

[jira] [Updated] (TIKA-782) Add support for parsing binary data in RTF files

2011-11-15 Thread Arjohn Kampman (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arjohn Kampman updated TIKA-782: Attachment: bin2.patch improved patch > Add support for parsing binary data in RTF f

[jira] [Commented] (TIKA-778) NullPointerException in tika-app, parsing PDF content

2011-11-15 Thread Bastian Mathes (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150570#comment-13150570 ] Bastian Mathes commented on TIKA-778: - Calling the extraction directly on the command li

[jira] [Updated] (TIKA-612) Specify PDFBox options via ParseContext

2011-11-15 Thread Michael McCandless (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated TIKA-612: Attachment: TIKA-612.patch Patch, just adding setSortByPosition to PDFParser. I think this i

[jira] [Commented] (TIKA-773) .NET version of Tika

2011-11-15 Thread Chris A. Mattmann (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150535#comment-13150535 ] Chris A. Mattmann commented on TIKA-773: That is awesome, Jukka! >

[jira] [Commented] (TIKA-778) NullPointerException in tika-app, parsing PDF content

2011-11-15 Thread Jukka Zitting (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150395#comment-13150395 ] Jukka Zitting commented on TIKA-778: Looks like the problem is coming from the HTML seri

[jira] [Commented] (TIKA-773) .NET version of Tika

2011-11-15 Thread Jukka Zitting (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150392#comment-13150392 ] Jukka Zitting commented on TIKA-773: There's now an ikvm profile in the tika-app POM tha

[jira] [Resolved] (TIKA-783) MD5 and SHA1 values posted on the download page for the .jar do not match actual computed values

2011-11-15 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-783. Resolution: Invalid No, the values on the web site are correct. I suspect the jar you downloaded may

[jira] [Resolved] (TIKA-779) Detection of Microsoft Works 2000 Word Processor files

2011-11-15 Thread Nick Burch (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-779. - Resolution: Fixed Fix Version/s: 1.1 > Detection of Microsoft Works 2000 Word Processor files >

[jira] [Commented] (TIKA-779) Detection of Microsoft Works 2000 Word Processor files

2011-11-15 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150331#comment-13150331 ] Nick Burch commented on TIKA-779: - Thanks, patch applied (with some tweaks) in r1202109.

Re: Tika-605 GDAL Parser

2011-11-15 Thread Nick Burch
On Thu, 10 Nov 2011, Ramirez, Paul M (388J) wrote: I'm really interested in working on Tika-605 and just wondered if anyone else is already out there trying to finish this off. I'd suggest the silence is a sign you should just go for it! Nick

[jira] [Commented] (TIKA-663) JSP files data extraction failed

2011-11-15 Thread Dave Meikle (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150324#comment-13150324 ] Dave Meikle commented on TIKA-663: -- Thanks Nick. Was going to add it last night but forgot

[jira] [Commented] (TIKA-663) JSP files data extraction failed

2011-11-15 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150306#comment-13150306 ] Nick Burch commented on TIKA-663: - The mimetype entry looks good to me, so I've added it (wi