[jira] [Commented] (TIKA-1728) Detection is not working properly for detecting HWP 5.0 file

2015-09-10 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738454#comment-14738454 ] Nick Burch commented on TIKA-1728: -- That's the header of one of the OLE2 streams, not of t

[jira] [Commented] (TIKA-1728) Detection is not working properly for detecting HWP 5.0 file

2015-09-10 Thread mungeol heo (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738502#comment-14738502 ] mungeol heo commented on TIKA-1728: --- Yes, I know. It is the reason why I used "file heade

[jira] [Comment Edited] (TIKA-1728) Detection is not working properly for detecting HWP 5.0 file

2015-09-10 Thread mungeol heo (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738502#comment-14738502 ] mungeol heo edited comment on TIKA-1728 at 9/10/15 9:41 AM: Yes

[jira] [Comment Edited] (TIKA-1728) Detection is not working properly for detecting HWP 5.0 file

2015-09-10 Thread mungeol heo (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738502#comment-14738502 ] mungeol heo edited comment on TIKA-1728 at 9/10/15 9:47 AM: Yes

[jira] [Comment Edited] (TIKA-1728) Detection is not working properly for detecting HWP 5.0 file

2015-09-10 Thread mungeol heo (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738502#comment-14738502 ] mungeol heo edited comment on TIKA-1728 at 9/10/15 9:49 AM: Yes

[jira] [Created] (TIKA-1733) RuntimeException when parsing some word (.doc) documents

2015-09-10 Thread Christophe Lacroix (JIRA)
Christophe Lacroix created TIKA-1733: Summary: RuntimeException when parsing some word (.doc) documents Key: TIKA-1733 URL: https://issues.apache.org/jira/browse/TIKA-1733 Project: Tika I

[jira] [Commented] (TIKA-1731) Try to integrate java-hwp into Tika

2015-09-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738638#comment-14738638 ] Tim Allison commented on TIKA-1731: --- Thank you for looking into this. bq. can Tika+POI

[jira] [Updated] (TIKA-1733) RuntimeException when parsing some word (.doc) documents

2015-09-10 Thread Christophe Lacroix (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christophe Lacroix updated TIKA-1733: - Description: I'm using Tika to extract text for Solr indexing. For some word documents, Ti

[jira] [Commented] (TIKA-1731) Try to integrate java-hwp into Tika

2015-09-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738660#comment-14738660 ] Tim Allison commented on TIKA-1731: --- Great. Thank you so much! It would be helpful to k

[jira] [Comment Edited] (TIKA-1731) Try to integrate java-hwp into Tika

2015-09-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738660#comment-14738660 ] Tim Allison edited comment on TIKA-1731 at 9/10/15 12:26 PM: - G

[jira] [Commented] (TIKA-1731) Try to integrate java-hwp into Tika

2015-09-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738663#comment-14738663 ] Tim Allison commented on TIKA-1731: --- [~mungeol], out of curiosity, what is your gut feeli

Get DEBUG level log in tika-server

2015-09-10 Thread rahulk09
Hi, I want tika server to give DEBUG level logs and it should be written to a log file. I am running tika-server via this command java -jar tika-server.jar -h 0.0.0.0 -p 9998 but for every file it only displays just one message to stdout- "Tika auto detecting type" i used "java -jar tika-serv

[jira] [Commented] (TIKA-1733) RuntimeException when parsing some word (.doc) documents

2015-09-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14738809#comment-14738809 ] Tim Allison commented on TIKA-1733: --- Thank you for submitting a document that triggers th

[jira] [Commented] (TIKA-1733) RuntimeException when parsing some word (.doc) documents

2015-09-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14739005#comment-14739005 ] Tim Allison commented on TIKA-1733: --- Can't figure out what's going wrong, I've opened: h

[jira] [Commented] (TIKA-1733) RuntimeException when parsing some word (.doc) documents

2015-09-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14739022#comment-14739022 ] Tim Allison commented on TIKA-1733: --- And, y, in Tika 1.4 we grabbed footer text with this

RE: ApacheCon Europe meetup and/or hackathon?

2015-09-10 Thread Allison, Timothy B.
Ditto! Cheers! -Original Message- From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Wednesday, September 09, 2015 7:52 PM To: dev@tika.apache.org; d...@poi.apache.org Subject: Re: ApacheCon Europe meetup and/or hackathon? Sounds awesome. I won’t be there, but I

[jira] [Commented] (TIKA-1731) Try to integrate java-hwp into Tika

2015-09-10 Thread mungeol heo (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14740141#comment-14740141 ] mungeol heo commented on TIKA-1731: --- I believe HWP similar with microsoft word. e.g. HW

[jira] [Commented] (TIKA-1731) Try to integrate java-hwp into Tika

2015-09-10 Thread mungeol heo (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14740144#comment-14740144 ] mungeol heo commented on TIKA-1731: --- I will tell ddoleye about the 5 steps. > Try to int