[ 
https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738502#comment-14738502
 ] 

mungeol heo edited comment on TIKA-1728 at 9/10/15 9:47 AM:
------------------------------------------------------------

Yes, I know. It is the reason why I used "file header" at the first place.
My point is that is there any way we can make good use of "HWP Document File" 
kind of thing for making sure correct detecting of HWP 5.0 file.
Excuse my poor English, if I confused you.
I just try to make sure tika will not mis-detect other file as HWP.
Anyway, I will stop to discuss it, and believe the experts will handle it 
greatly.


was (Author: mungeol):
Yes, I know. It is the reason why I used "file header" at the first place.
Excuse my poor English, if I confused you.
I just try to make sure tika will not mis-detect other file as HWP.
Anyway, I will stop to discuss it, and believe the experts will handle it 
greatly.

> Detection is not working properly for detecting HWP 5.0 file
> ------------------------------------------------------------
>
>                 Key: TIKA-1728
>                 URL: https://issues.apache.org/jira/browse/TIKA-1728
>             Project: Tika
>          Issue Type: Bug
>         Environment: OS: windows 7 and centos 6
> Java: 1.7
> Tika jar: tika-app-1.10.jar
> File: HWP 5.0
>            Reporter: mungeol heo
>         Attachments: HWP-document-file-formats-3.0-Korean.pdf, 
> HWP-document-file-formats-5.0-Korean.pdf, error-message.png, test_3.0.hwp, 
> test_5.0.hwp
>
>
> HWP file has two formats which are HWP 3.0 and HWP 5.0.
> 'tika-app-1.10.jar' detects HWP 3.0 format's file correctly.
> But, not for HWP 5.0.
> Used commands and returned results are addresses below.
> > java -jar tika-app-1.10.jar --detect test_3.0.hwp
> > application/x-hwp
> > java -jar tika-app-1.10.jar --detect test_5.0.hwp
> > application/x-tika-msoffice



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to