[jira] [Created] (TIKA-1298) testEmbeddedPDFEmbeddingAnotherDocument fails with PDFBox 1.8.5 and java 1.6

2014-05-16 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1298: - Summary: testEmbeddedPDFEmbeddingAnotherDocument fails with PDFBox 1.8.5 and java 1.6 Key: TIKA-1298 URL: https://issues.apache.org/jira/browse/TIKA-1298 Project: Tika

Extended fix for TIKA-1169

2014-05-16 Thread Matthias Krueger
I came across some other .jnilib binaries which were detected as .class files and caused issues. It seems there are more Mach-o binary magic variants depending on 32/64 Bit architecture and endianness. Fix is attached. Let me know if I should rather clone the closed TIKA-1169 and attach it

Re: Extended fix for TIKA-1169

2014-05-16 Thread Ken Krugler
Hi Matthias, A new issue would be great, as that's what we use to tag changes in SVN. Also a test case (a .jnilib binary that currently fails) would be good. -- Ken On May 15, 2014, at 12:55pm, Matthias Krueger c...@mkr.io wrote: I came across some other .jnilib binaries which were detected

[jira] [Commented] (TIKA-1295) Make some Dublin Core items multi-valued

2014-05-16 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13998861#comment-13998861 ] Tim Allison commented on TIKA-1295: --- Fixed bug that initially made me notice this issue

[GitHub] tika pull request: WIP TIKA-1292: Fixing the MimeTypes class to co...

2014-05-16 Thread cstamas
GitHub user cstamas opened a pull request: https://github.com/apache/tika/pull/7 WIP TIKA-1292: Fixing the MimeTypes class to consider clusters of magics by priority This changes MimeTypes class to consider clusters of magics instead of first found to resolve priority clashes like

[jira] [Commented] (TIKA-1169) Fails to parse jnilib file

2014-05-16 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13999825#comment-13999825 ] ASF GitHub Bot commented on TIKA-1169: -- Github user asfgit closed the pull request at:

[jira] [Commented] (TIKA-1169) Fails to parse jnilib file

2014-05-16 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13999705#comment-13999705 ] ASF GitHub Bot commented on TIKA-1169: -- GitHub user mkr opened a pull request:

Re: [DISCUSS] Nightly Jenkins Builds for Trunk

2014-05-16 Thread Lewis John Mcgibbney
Hi Nick/Others, Please see link below for Tika trunk build on Oracle JDK's (latest) 6 and 7 respectively. We also have a now deprecated Tika trunk build which was doing zilch... we also have a currently disabled cob configured to run with Oracle JDK8 (latest) when this become available to build

[jira] [Assigned] (TIKA-1300) Switch default PDFBox parser to NonSequentialParser

2014-05-16 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reassigned TIKA-1300: - Assignee: Tim Allison Switch default PDFBox parser to NonSequentialParser

Re: JAXRS, endpoints and a / welcome page - any ideas why it's broken?

2014-05-16 Thread Chris Mattmann
Hi Guys, Some thoughts here: -Original Message- From: Nick Burch apa...@gagravarr.org Reply-To: dev@tika.apache.org dev@tika.apache.org Date: Wednesday, May 14, 2014 6:22 AM To: dev@tika.apache.org dev@tika.apache.org Subject: Re: JAXRS, endpoints and a / welcome page - any ideas why

[jira] [Updated] (TIKA-1302) Let's run Tika against a large batch of docs nightly

2014-05-16 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1302: -- Description: Many thanks to [~lewismc] for TIKA-1301! Once we get that up and running for nightly

[jira] [Created] (TIKA-1302) Let's run Tika against a large batch of docs nightly

2014-05-16 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1302: - Summary: Let's run Tika against a large batch of docs nightly Key: TIKA-1302 URL: https://issues.apache.org/jira/browse/TIKA-1302 Project: Tika Issue Type:

[jira] [Updated] (TIKA-1302) Let's run Tika against a large batch of docs nightly

2014-05-16 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1302: -- Description: Many thanks to [~lewismc] for TIKA-1301! Once we get nightly builds up and running

Re: JAXRS, endpoints and a / welcome page - any ideas why it's broken?

2014-05-16 Thread Sergey Beryozkin
Hi Nick, On 14/05/14 14:22, Nick Burch wrote: On Wed, 14 May 2014, Sergey Beryozkin wrote: UnpackerResource has no Path annotation so it is defaulted to /. Every endpoint method within the class does have one though. I would've expected it to match based on those, is that not the case?

[GitHub] tika pull request: [TIKA-1247] WIP: Exploded parsers: asm, audio, ...

2014-05-16 Thread cstamas
Github user cstamas closed the pull request at: https://github.com/apache/tika/pull/5 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

Re: parser metadata empty after tika detect

2014-05-16 Thread Nick Burch
On Fri, 16 May 2014, aliosha79 wrote: For this purpose i have write these few code lines: File f = new File(MyEmail.eml); is= new FileInputStream(f); Tika tika = new Tika(); String mimeType = tika.detect(is); This will most likely use a fair bit (to possibly all) of

[jira] [Commented] (TIKA-1298) testEmbeddedPDFEmbeddingAnotherDocument fails with PDFBox 1.8.5 and java 1.6

2014-05-16 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13998779#comment-13998779 ] Tim Allison commented on TIKA-1298: --- One reason to upgrade to Java 1.7 if you haven't

[jira] [Updated] (TIKA-1302) Let's run Tika against a large batch of docs nightly

2014-05-16 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1302: -- Description: Many thanks to [~lewismc] for TIKA-1301! Once we get nightly builds up and running

Re: tika install fail on os x 10.9.2

2014-05-16 Thread Ramirez, Paul M (398J)
Annie, I haven't built tika in a while but if it's a typical maven build the details of the test output will be captured in one of the files in the target directory. If you find those details and post them here that would help troubleshoot what is going on. Thanks, Paul Ramirez On May 8,