Re: Does tika support "HWP"?

2015-09-01 Thread Mungeol Heo
Thank you for your reply. I will try to write a customized parser for HWP file. And if my code is "pretty enough", I will consider to contribute it. Again, thank you. On Tue, Sep 1, 2015 at 7:58 PM, Nick Burch wrote: > On Tue, 1 Sep 2015, Mungeol Heo wrote: >>> >>> java -jar tika-app-1.10.jar --l

Re: Can I add custom detector to be called last to parse common containers' subtypes?

2015-09-01 Thread Nick Burch
On Thu, 27 Aug 2015, Mikhail Titov wrote: On Wed, Aug 26, 2015 at 6:11 AM, Nick Burch wrote: You probably shouldn't be defining additional mimetypes to DefaultParser. I had an impression that indeed there should be no explicit definition and new types should be hooked up to a default parse

Re: Does tika support "HWP"?

2015-09-01 Thread Nick Burch
On Tue, 1 Sep 2015, Mungeol Heo wrote: java -jar tika-app-1.10.jar --list-supported-types | grep hwp application/x-hwp That means the mime type has been defined in some way java -jar tika-app-1.10.jar --detect sample.hwp application/x-tika-msoffice That means that the HWP file is based on t

Re: Use TikaJAXRS with HDD offsets instead of urls

2015-09-01 Thread Sergey Beryozkin
Hi the server accepts InputStream form a multipart attachment or fromj the immediate request body, in the latter case it is HTTP PUT, so you can use the client library to PUT bytes to the server Cheers, Sergey On 01/09/15 09:44, zahlenm...@gmx.de wrote: Hey everyone, I am parsing file system

Use TikaJAXRS with HDD offsets instead of urls

2015-09-01 Thread zahlenmeer
Hey everyone, I am parsing file systems in hdd images in a c++ program. For further analysis I would like to parse the files I find with Tika. The recommended way I found was setting up a Tika server and send and recieve files and responses with its RESTful interface. Unfortunately I can only se