tika-trunk-jdk1.7 - Build # 54 - Failure

2014-06-18 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-trunk-jdk1.7 (build #54) Status: Failure Check console output at https://builds.apache.org/job/tika-trunk-jdk1.7/54/ to view the results.

tika-trunk-jdk1.6 - Build # 54 - Still Failing

2014-06-18 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-trunk-jdk1.6 (build #54) Status: Still Failing Check console output at https://builds.apache.org/job/tika-trunk-jdk1.6/54/ to view the results.

[jira] [Commented] (TIKA-1063) OpenDocument basic style support

2014-06-18 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14035362#comment-14035362 ] Axel Dörfler commented on TIKA-1063: The patch still applies on branches/1.5 as well as

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Sergey Beryozkin
Though we can exclude some of the dependencies directly in our pom.xml, that can work too. We'll experiment a bit with which dependencies are absolutely needed for tika-parsers and which may be excluded. Any feedback will be apreciated Thanks, Sergey On 17/06/14 21:16, Sergey Beryozkin wrote:

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Ray Gauss
I think for 2.0 we should consider splitting out parsers into their own projects for a streamlined dependency hierarchy then reassembling them with something like a tika-parsers-all artifact. On June 17, 2014 at 5:08:38 PM, Nick Burch (apa...@gagravarr.org) wrote: On Tue, 17 Jun 2014, Sergey

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Sergey Beryozkin
Hi On 18/06/14 13:52, Ray Gauss wrote: I think for 2.0 we should consider splitting out parsers into their own projects for a streamlined dependency hierarchy then reassembling them with something like a tika-parsers-all artifact. something like that will make sense IMHO On June 17, 2014

Re: Working on a new Translation plugin using Joshua

2014-06-18 Thread Lewis John Mcgibbney
Nice Chris. On Tue, Jun 17, 2014 at 5:59 PM, dev-digest-h...@tika.apache.org wrote: In the meanwhile I should have a review board patch up soon too for the JoshuaTranslator. I'll keep my eyes peeled for this one. Thanks Lewis

[jira] [Commented] (TIKA-994) Type Detection Fault

2014-06-18 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14035714#comment-14035714 ] Tyler Palsulich commented on TIKA-994: -- application/vnd.solent.sdkm+xml is the proper

Patch: self-contained HTML using Data URI

2014-06-18 Thread Andrew Skiba
Hi, In the current code, the images from Word documents are referenced by embedded:xxx links in the generated HTML. This causes the browsers display x icon instead of the image. The proposed patch encodes the images using Data URI, if there is -Dtika.parsers.urlimages system property.

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Nick Burch
On Wed, 18 Jun 2014, Ray Gauss wrote: I think for 2.0 we should consider splitting out parsers into their own projects for a streamlined dependency hierarchy then reassembling them with something like a tika-parsers-all artifact. We had another thread on that not that long ago, where someone

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Sergey Beryozkin
Hi Nick On 18/06/14 16:38, Nick Burch wrote: On Wed, 18 Jun 2014, Ray Gauss wrote: I think for 2.0 we should consider splitting out parsers into their own projects for a streamlined dependency hierarchy then reassembling them with something like a tika-parsers-all artifact. We had another

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Sergey Beryozkin
On 18/06/14 16:58, Sergey Beryozkin wrote: Hi Nick On 18/06/14 16:38, Nick Burch wrote: On Wed, 18 Jun 2014, Ray Gauss wrote: I think for 2.0 we should consider splitting out parsers into their own projects for a streamlined dependency hierarchy then reassembling them with something like a

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Nick Burch
On Wed, 18 Jun 2014, Sergey Beryozkin wrote: The reason we need it is that CXF can not ship all of Tika Parser dependencies because CXF will only offer a light-weight Tika-aware handler. Sounds like you just want to depend on tika-core then, and not tika-parsers. That'll give you mime magic

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Nick Burch
On Wed, 18 Jun 2014, Sergey Beryozkin wrote: Can we start with adding a section to Tika docs documenting the core dependencies of the tike-parsers module to make the life a bit easier for developers who do not expect the specific parser implementations immediately downloaded ? Are you not

[jira] [Commented] (TIKA-1063) OpenDocument basic style support

2014-06-18 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14035883#comment-14035883 ] Axel Dörfler commented on TIKA-1063: Hi Tyler, if I had known that a little nudge

[jira] [Commented] (TIKA-1063) OpenDocument basic style support

2014-06-18 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14035971#comment-14035971 ] Tyler Palsulich commented on TIKA-1063: --- Hi, ;) I tried switching it and they still

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Ken Krugler
Hi Nick, On Jun 18, 2014, at 9:07am, Nick Burch apa...@gagravarr.org wrote: On Wed, 18 Jun 2014, Sergey Beryozkin wrote: The reason we need it is that CXF can not ship all of Tika Parser dependencies because CXF will only offer a light-weight Tika-aware handler. Sounds like you just want

[jira] [Created] (TIKA-1343) Create a Tika Translator implementation that uses JoshuaDecoder

2014-06-18 Thread Chris A. Mattmann (JIRA)
Chris A. Mattmann created TIKA-1343: --- Summary: Create a Tika Translator implementation that uses JoshuaDecoder Key: TIKA-1343 URL: https://issues.apache.org/jira/browse/TIKA-1343 Project: Tika

Review Request 22761: Create a Tika Translator implementation that uses JoshuaDecoder

2014-06-18 Thread Chris Mattmann
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22761/ --- Review request for tika. Bugs: tika-1343

Re: Review Request 22761: Create a Tika Translator implementation that uses JoshuaDecoder

2014-06-18 Thread Chris Mattmann
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/22761/ --- (Updated June 18, 2014, 10:04 p.m.) Review request for tika. Bugs: tika-1343

[jira] [Commented] (TIKA-1343) Create a Tika Translator implementation that uses JoshuaDecoder

2014-06-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14036503#comment-14036503 ] Chris A. Mattmann commented on TIKA-1343: - Review Board up here:

[jira] [Resolved] (TIKA-994) Type Detection Fault

2014-06-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved TIKA-994. Resolution: Fixed Thanks to [~tpalsulich] for the resolution, closing. Type Detection

Re: Can some of tika-parsers module dependencies be made optional ?

2014-06-18 Thread Ray Gauss
The point of a tika-parsers-all artifact would be a single dependency that re-aggregates everything so that downstream projects could work the same way they do now and not worry about missing dependencies. Meanwhile people that just want PDF parsing could declare only the tike-parser-pdf

Is TikaExceptionMapper in tika-server actually used?

2014-06-18 Thread Allison, Timothy B.
All, In working on adding the stacktrace from a parse exception to the server response, I'm trying to find the most jax-rsly elegant way of handling exceptions. There seems to be a bit of duplicated code, some with good reason, for exception handling. Is TikaExceptionMapper actually used