[
https://issues.apache.org/jira/browse/TIKA-3104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17123801#comment-17123801
]
Tim Allison commented on TIKA-3104:
-----------------------------------
[~Tamane] I have no idea if I was decoding it properly because I don't
understand what the binary is encoding. I did call Tika's AutoDetectParser on
the data.getBytes(), but that was useless, because as you point out this is
some proprietary graph info.
I would encourage downloading url content in a separate process than running
Tika. If you must do both at once, try something like this.
{noformat}
Parser p = new AutoDetectParser();
Metadata m = new Metadata();
URL url = new
URL("https://issues.apache.org/jira/secure/attachment/13004188/DeepScroll_Example%5B4988%5D.memgraph");
ContentHandler contentHandler = new ToXMLContentHandler();
try (TikaInputStream tis = TikaInputStream.get(url)) {
p.parse(tis, contentHandler, m, new ParseContext());
}
System.out.println(contentHandler.toString());
{noformat}
> Detection of memgraph files exported from Xcode
> -----------------------------------------------
>
> Key: TIKA-3104
> URL: https://issues.apache.org/jira/browse/TIKA-3104
> Project: Tika
> Issue Type: Wish
> Components: core
> Affects Versions: 1.24
> Reporter: Parth
> Assignee: Tim Allison
> Priority: Major
> Labels: detection, features, new-parser
> Attachments: DeepScroll_Example[4988].memgraph, memgraph.xml,
> out.memgraph.json, out.memgraph.xhtml
>
>
> I wanted to detect a memgraph file linked by a url. But currently detection
> of memgraph file is not supported. I tried adding to custom-mimetypes but
> that did not help.
> <mime-info>
> <mime-type type="application/memgraph">
> <glob pattern="*.memgraph"/>
> </mime-type>
> </mime-info>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)