[ 
https://issues.apache.org/jira/browse/TIKA-2224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16743319#comment-16743319
 ] 

Tim Allison commented on TIKA-2224:
-----------------------------------

If I understand correctly, the reason to go with a custom parser that calls the 
c++ executable instead of configuring an ExternalParser is so that Tika would 
convert the JSON into our usual xhtml?

Thank you for the ping and the pointer.  I just searched around briefly for 
java and OneNote...to no avail...

> Mime magic for OneNote formats
> ------------------------------
>
>                 Key: TIKA-2224
>                 URL: https://issues.apache.org/jira/browse/TIKA-2224
>             Project: Tika
>          Issue Type: Improvement
>          Components: mime
>    Affects Versions: 1.14
>            Reporter: Nick Burch
>            Priority: Major
>         Attachments: Sample1.json, Sample1.one, note-ssn-test-mmmm.one
>
>
> As raised at 
> http://stackoverflow.com/questions/41272195/onenote-support-for-apache-tika-parsers,
>  we don't have any magic for the OneNote formats. Several years ago we dug 
> out the file format specs (see 
> http://lucene.472066.n3.nabble.com/Tika-OneNote-Support-td4020393.html), but 
> didn't have volunteer energy to implement a parser. However, armed with those 
> specs, we should be able to come up with some mime magic for detection



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to