[jira] [Commented] (TIKA-1472) Warning on Tika Server startup - Failed to load class org.slf4j.impl.StaticLoggerBinder

2014-11-18 Thread Darya Arbuzova (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215883#comment-14215883 ] Darya Arbuzova commented on TIKA-1472: -- Thank you! Just to be sure, I want to ask: can

[jira] [Created] (TIKA-1480) TikaJAXRS

2014-11-18 Thread Darya Arbuzova (JIRA)
Darya Arbuzova created TIKA-1480: Summary: TikaJAXRS Key: TIKA-1480 URL: https://issues.apache.org/jira/browse/TIKA-1480 Project: Tika Issue Type: Bug Components: server

[jira] [Updated] (TIKA-1480) TikaJAXRS get all resourses call fail

2014-11-18 Thread Darya Arbuzova (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darya Arbuzova updated TIKA-1480: - Summary: TikaJAXRS get all resourses call fail (was: TikaJAXRS ) TikaJAXRS get all resourses

[jira] [Created] (TIKA-1481) TikaJAXRS get metadata calls give different results

2014-11-18 Thread Darya Arbuzova (JIRA)
Darya Arbuzova created TIKA-1481: Summary: TikaJAXRS get metadata calls give different results Key: TIKA-1481 URL: https://issues.apache.org/jira/browse/TIKA-1481 Project: Tika Issue Type:

[jira] [Updated] (TIKA-1481) TikaJAXRS get metadata calls give different results

2014-11-18 Thread Darya Arbuzova (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darya Arbuzova updated TIKA-1481: - Description: Hello! I'm trying to use Tika in server mode. I downloaded tika-server-1.6.jar from

[jira] [Updated] (TIKA-1481) TikaJAXRS get metadata calls give different results

2014-11-18 Thread Darya Arbuzova (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darya Arbuzova updated TIKA-1481: - Description: Hello! I'm trying to use Tika in server mode. I downloaded tika-server-1.6.jar from

[jira] [Commented] (TIKA-1472) Warning on Tika Server startup - Failed to load class org.slf4j.impl.StaticLoggerBinder

2014-11-18 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215947#comment-14215947 ] Konstantin Gribov commented on TIKA-1472: - If you want to _upload_ jar, you have to

[jira] [Commented] (TIKA-1472) Warning on Tika Server startup - Failed to load class org.slf4j.impl.StaticLoggerBinder

2014-11-18 Thread Darya Arbuzova (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215954#comment-14215954 ] Darya Arbuzova commented on TIKA-1472: -- [~grossws], awesome, it works, thanks!

[jira] [Commented] (TIKA-1480) TikaJAXRS get all resourses call fail

2014-11-18 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215961#comment-14215961 ] Konstantin Gribov commented on TIKA-1480: - You can browse [http://localhost:9998/]

[jira] [Created] (TIKA-1482) ForkParser throws exceptions when process some large pdf files

2014-11-18 Thread Sean Zhao (JIRA)
Sean Zhao created TIKA-1482: --- Summary: ForkParser throws exceptions when process some large pdf files Key: TIKA-1482 URL: https://issues.apache.org/jira/browse/TIKA-1482 Project: Tika Issue Type:

[jira] [Updated] (TIKA-1482) ForkParser throws exceptions when process some large pdf files

2014-11-18 Thread Sean Zhao (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Zhao updated TIKA-1482: Attachment: SRCH-13412.pdf Sample File will cause ForkParser throw exception. ForkParser throws exceptions

[jira] [Updated] (TIKA-1482) ForkParser throws exceptions when process some large pdf files

2014-11-18 Thread Sean Zhao (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Zhao updated TIKA-1482: Description: In Tika 1.6, ForkParser throws org.apache.tika.exception.TikaException , message:Unexpected

[jira] [Commented] (TIKA-1480) TikaJAXRS get all resourses call fail

2014-11-18 Thread Darya Arbuzova (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216040#comment-14216040 ] Darya Arbuzova commented on TIKA-1480: -- Thank you! I'll write a letter asking someone

[jira] [Closed] (TIKA-1480) TikaJAXRS get all resourses call fail

2014-11-18 Thread Darya Arbuzova (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Darya Arbuzova closed TIKA-1480. Resolution: Fixed TikaJAXRS get all resourses call fail -

[jira] [Commented] (TIKA-1480) TikaJAXRS get all resourses call fail

2014-11-18 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216130#comment-14216130 ] Dave Meikle commented on TIKA-1480: --- I have updated the Wiki page. TikaJAXRS get all

[jira] [Assigned] (TIKA-595) HtmlHandler does not support multivalue metadata

2014-11-18 Thread Dave Meikle (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dave Meikle reassigned TIKA-595: Assignee: Dave Meikle HtmlHandler does not support multivalue metadata

[jira] [Commented] (TIKA-1469) Upgrade to POI 3.11-beta3 when available

2014-11-18 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216238#comment-14216238 ] Nick Burch commented on TIKA-1469: -- 3.11 beta 3 has finally hit maven central (took a

RE: TIKA-1445 and having multiple Parsers (as many as needed) work on the same MediaType

2014-11-18 Thread Allison, Timothy B.
Chris, Thank you for moving this to the dev list. This would be a fairly large change, and the discussion is valuable. -Original Message- From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Monday, November 17, 2014 5:25 PM To: dev@tika.apache.org Subject:

[jira] [Commented] (TIKA-1473) Apache Tika is not working for .docx documents

2014-11-18 Thread Milan Zivkovic (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216337#comment-14216337 ] Milan Zivkovic commented on TIKA-1473: -- I have the similar problem with .docx file.

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-18 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216351#comment-14216351 ] Tim Allison commented on TIKA-1445: --- [~gagravarr], thank you for explaining the original

[jira] [Commented] (TIKA-595) HtmlHandler does not support multivalue metadata

2014-11-18 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216358#comment-14216358 ] Tim Allison commented on TIKA-595: -- +1 to supporting multivalue metadata. Be careful to

[jira] [Commented] (TIKA-595) HtmlHandler does not support multivalue metadata

2014-11-18 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216359#comment-14216359 ] Tim Allison commented on TIKA-595: -- Doh! Sorry, you're just dealing with String keys not

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-18 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216365#comment-14216365 ] Tim Allison commented on TIKA-1445: --- Copied from dev discussion to record points on this

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-18 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216444#comment-14216444 ] Nick Burch commented on TIKA-1445: -- I think it's fairly common for people to have 4-5

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216451#comment-14216451 ] Chris A. Mattmann commented on TIKA-1445: - Hey Guys, to be honest, the way I see

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-18 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216466#comment-14216466 ] Nick Burch commented on TIKA-1445: -- Anyone using tika-parser OOTB has two parsers services

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216960#comment-14216960 ] Chris A. Mattmann commented on TIKA-1445: - Hi Nick: I think we need to be careful

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217100#comment-14217100 ] Lewis John McGibbney commented on TIKA-1445: We can run many extractors against

[jira] [Commented] (TIKA-1482) ForkParser throws exceptions when process some large pdf files

2014-11-18 Thread Sean Zhao (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217218#comment-14217218 ] Sean Zhao commented on TIKA-1482: - Hello Nick, Thank you very much for quick response. And

[jira] [Created] (TIKA-1483) Create a general raw string parser

2014-11-18 Thread Luis Filipe Nassif (JIRA)
Luis Filipe Nassif created TIKA-1483: Summary: Create a general raw string parser Key: TIKA-1483 URL: https://issues.apache.org/jira/browse/TIKA-1483 Project: Tika Issue Type: New

[jira] [Commented] (TIKA-1445) Figure out how to add Image metadata extraction to Tesseract parser

2014-11-18 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217407#comment-14217407 ] Lewis John McGibbney commented on TIKA-1445: OK so in Any23, if we were to take

[jira] [Commented] (TIKA-1358) Add support for newer iWork file formats

2014-11-18 Thread Trejkaz (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217468#comment-14217468 ] Trejkaz commented on TIKA-1358: --- And of course now iWork is using zip files [again? There is