[jira] [Updated] (TIKA-1311) Centralize JSON handling of Metadata

2014-06-02 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1311: -- Attachment: TIKA-1311.patch [~gagravarr] and [~rgauss], if you have a chance to review this, I'd

Re: [jira] [Commented] (TIKA-93) OCR support

2014-06-02 Thread Tyler Palsulich
Hi, Tesseract is by itself a project that written on C/C++ and should be compiled differently for each platform. Good point! We should figure out a way to fail gracefully when Tesseract isn't installed, right? Unless there is, in fact, some pure Java OCR implementation. Another thought, we

Re: [jira] [Commented] (TIKA-93) OCR support

2014-06-02 Thread Nick Burch
On Mon, 2 Jun 2014, Tyler Palsulich wrote: Good point! We should figure out a way to fail gracefully when Tesseract isn't installed, right? Unless there is, in fact, some pure Java OCR implementation. I believe the standard policy is that a parser which can't work should either thrown an

Re: [jira] [Commented] (TIKA-93) OCR support

2014-06-02 Thread Tyler Palsulich
I believe the standard policy is that a parser which can't work should either thrown an exception during construction, or return an empty set of types to a call to getSupportedTypes. Either one lets it be gracefully skipped over How do we know when Tesseract is installed? There isn't an

Re: [jira] [Commented] (TIKA-93) OCR support

2014-06-02 Thread Nick Burch
On Mon, 2 Jun 2014, Tyler Palsulich wrote: How do we know when Tesseract is installed? There isn't an easy, cross-platform Java method to check if a given program is installed. Maybe, we make the user specify the install location in some config file? Then, don't have to worry about Tesseract

Re: [jira] [Commented] (TIKA-93) OCR support

2014-06-02 Thread Mattmann, Chris A (3980)
+1, I talked to Tyler a little bit ago and told him to grep for exiftool in Tika :) he will scope. -Original Message- From: Nick Burch apa...@gagravarr.org Reply-To: dev@tika.apache.org dev@tika.apache.org Date: Monday, June 2, 2014 8:36 AM To: dev@tika.apache.org dev@tika.apache.org

[jira] [Commented] (TIKA-1311) Centralize JSON handling of Metadata

2014-06-02 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015501#comment-14015501 ] Nick Burch commented on TIKA-1311: -- I've had a quick look, and at first glance it looks

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-02 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015531#comment-14015531 ] Lewis John McGibbney commented on TIKA-1258: Some one can commit this patch?

Re: Hello

2014-06-02 Thread Lewis John Mcgibbney
Hi Tyler, On Fri, May 30, 2014 at 8:55 AM, dev-digest-h...@tika.apache.org wrote: Thanks, Tim! I'm more of an IntelliJ guy myself. IDEA has a feature where you can check out a project directly from Subversion, which works pretty well. Eclipse also has this feature. Just for the heads up.

[jira] [Commented] (TIKA-1168) The IWork NumbersContentHandler returns unsupported Metadata PropertyType

2014-06-02 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015538#comment-14015538 ] Nick Burch commented on TIKA-1168: -- I've just tried with a recent svn snapshot build, and

[jira] [Resolved] (TIKA-1258) Update NetCDF dependency

2014-06-02 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-1258. -- Resolution: Fixed Patch applied in r1599263, thanks! Update NetCDF dependency

[jira] [Commented] (TIKA-1311) Centralize JSON handling of Metadata

2014-06-02 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015560#comment-14015560 ] Chris A. Mattmann commented on TIKA-1311: - Hey Tim this would be a perfect

[jira] [Resolved] (TIKA-1212) Recursive Extraction of Archive File

2014-06-02 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-1212. -- Resolution: Invalid The problem is that you're not tracking how far down the rabbit hole you've gone

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015608#comment-14015608 ] Hudson commented on TIKA-1258: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #16 (See

[jira] [Commented] (TIKA-1258) Update NetCDF dependency

2014-06-02 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015661#comment-14015661 ] Hudson commented on TIKA-1258: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #16 (See

[jira] [Updated] (TIKA-1158) Wrong info on site for Container Aware Detector

2014-06-02 Thread Tyler Palsulich (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tyler Palsulich updated TIKA-1158: -- Attachment: TIKA-1158.patch Here is a patch for the site. After applying it, run {{mvn site}}

Example code in documentation?

2014-06-02 Thread Nick Burch
Hi All Currently, we have some example code on the website, and some in the wiki, neither of which gets checked to ensure it compiles, neither unit tested. However, it is easy to add I've noticed that a couple of ASF projects now have their example code in svn, and use a new-ish cms feature

[jira] [Resolved] (TIKA-1158) Wrong info on site for Container Aware Detector

2014-06-02 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-1158. -- Resolution: Fixed Fix Version/s: (was: 1.4) 1.6 Thanks for this, patch

Re: JAXRS, endpoints and a / welcome page - any ideas why it's broken?

2014-06-02 Thread Nick Burch
On Tue, 20 May 2014, Sergey Beryozkin wrote: Maybe we should post to users@, and see if anyone says they do? Sounds good, please ask or I can do it, let me know please As our jaxrs guru, can you? :) I've just asked at the users list Based on the silence, I don't think the unpacker

Re: JAXRS, endpoints and a / welcome page - any ideas why it's broken?

2014-06-02 Thread Mattmann, Chris A (3980)
I got it :) Sent from my iPhone On Jun 2, 2014, at 12:35 PM, Nick Burch apa...@gagravarr.org wrote: On Tue, 20 May 2014, Sergey Beryozkin wrote: Maybe we should post to users@, and see if anyone says they do? Sounds good, please ask or I can do it, let me know please As our jaxrs guru,

[jira] [Created] (TIKA-1316) Old Site Code in Trunk

2014-06-02 Thread Tyler Palsulich (JIRA)
Tyler Palsulich created TIKA-1316: - Summary: Old Site Code in Trunk Key: TIKA-1316 URL: https://issues.apache.org/jira/browse/TIKA-1316 Project: Tika Issue Type: Improvement Affects

Re: Unpacker resource

2014-06-02 Thread Mattmann, Chris A (3980)
+1 to file a separate issue, Tim. ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-5th floor