[jira] [Created] (TIKA-1430) CHM parser gets faulty text (fix found)

2014-09-28 Thread Bin Hawking (JIRA)
Bin Hawking created TIKA-1430: - Summary: CHM parser gets faulty text (fix found) Key: TIKA-1430 URL: https://issues.apache.org/jira/browse/TIKA-1430 Project: Tika Issue Type: Bug

Re: Apache Tika - JSON?

2014-09-28 Thread Lewis John Mcgibbney
Hi Vineet, On Sun, Sep 28, 2014 at 1:21 AM, dev-digest-h...@tika.apache.org wrote: I was wondering if there any in built parser to get help in conversion from XHTML to JSON. My research showed that there is one named org.apache.io.json which just one method implemented. Also, I tried GJSON

[jira] [Updated] (TIKA-1220) Parser implementration for IFC files

2014-09-28 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle updated TIKA-1220: -- Assignee: Lewis John McGibbney Parser implementration for IFC files

[jira] [Commented] (TIKA-1220) Parser implementration for IFC files

2014-09-28 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151204#comment-14151204 ] Dave Meikle commented on TIKA-1220: --- Hi [~lewismc] - this is now assigned to you. I have

[jira] [Commented] (TIKA-1420) Add Metadata Extraction to Arbitrary Parsers

2014-09-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151207#comment-14151207 ] Chris A. Mattmann commented on TIKA-1420: - Any reason this is in examples? I think

[jira] [Commented] (TIKA-1220) Parser implementration for IFC files

2014-09-28 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151209#comment-14151209 ] Lewis John McGibbney commented on TIKA-1220: Dynamite [~davemeikle] some man.

[jira] [Commented] (TIKA-1420) Add Metadata Extraction to Arbitrary Parsers

2014-09-28 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151228#comment-14151228 ] Nick Burch commented on TIKA-1420: -- Now it's dependency free, don't see why it can't be in

[jira] [Commented] (TIKA-1420) Add Metadata Extraction to Arbitrary Parsers

2014-09-28 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151230#comment-14151230 ] Chris A. Mattmann commented on TIKA-1420: - Thanks Nick! [~tpalsulich] when you get

[jira] [Commented] (TIKA-1420) Add Metadata Extraction to Arbitrary Parsers

2014-09-28 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14151232#comment-14151232 ] Tyler Palsulich commented on TIKA-1420: --- Thanks [~gagravarr] and [~chrismattmann].

Re: Apache Tika - JSON?

2014-09-28 Thread Vineet Ghatge Hemantkumar
Hi Lewis, Thanks. I did take a look at this. I was looking for ways to jsonify data extracted by tika, but that seems like nothing out of box is present with tika (other than getting metadata parsed). The way I have done is by extending one of the SAXHandler and overriding the existing functions