[GitHub] tika pull request: TIKA-1369 Avoid ThreadLocal usage from Memory L...

2014-10-06 Thread vilmospapp
GitHub user vilmospapp opened a pull request: https://github.com/apache/tika/pull/17 TIKA-1369 Avoid ThreadLocal usage from Memory Leak Hi @chrismattmann , Based on our discussion from https://github.com/apache/tika/pull/15 I've added the ThreadLocal clean up part, so theor

[jira] [Commented] (TIKA-1369) Date parsing and thread safety in ImageMetadataExtractor

2014-10-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14160234#comment-14160234 ] ASF GitHub Bot commented on TIKA-1369: -- GitHub user vilmospapp opened a pull request:

[jira] [Comment Edited] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14160312#comment-14160312 ] Tim Allison edited comment on TIKA-1437 at 10/6/14 2:04 PM: No

[jira] [Commented] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14160312#comment-14160312 ] Tim Allison commented on TIKA-1437: --- No encoding detector will be perfect. Are you sur

[GitHub] tika pull request: TIKA-1354 Add test method with nonfunctional fo...

2014-10-06 Thread hlavki
GitHub user hlavki opened a pull request: https://github.com/apache/tika/pull/18 TIKA-1354 Add test method with nonfunctional fork parser There is something wrong with pax commons logging so ForkParser doesn't work in general. Test method: testForkParserPdf() I suppose

[jira] [Commented] (TIKA-1354) ForkParser doesn't work in OSGI container

2014-10-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14160340#comment-14160340 ] ASF GitHub Bot commented on TIKA-1354: -- GitHub user hlavki opened a pull request:

Re: OCR with tika-server

2014-10-06 Thread kevin slote
Ok, I am signed up. https://wiki.apache.org/tika/Kevin%20Slote On Fri, Oct 3, 2014 at 11:02 PM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Kevin glad it is now fixed with you! > > If you get a chance, please feel free to document > this on the wiki: > > https://wiki.apach

[jira] [Commented] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14160745#comment-14160745 ] Johannes Mockenhaupt commented on TIKA-1435: Chris thanks. Here's what I found

[jira] [Comment Edited] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14160745#comment-14160745 ] Johannes Mockenhaupt edited comment on TIKA-1435 at 10/6/14 7:13 PM:

[jira] [Updated] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johannes Mockenhaupt updated TIKA-1435: --- Attachment: netcdf-deps-changes.diff Changed dependencies netcdf 4.2.20 -> 4.3.22 > Up

[jira] [Commented] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14160756#comment-14160756 ] Johannes Mockenhaupt commented on TIKA-1435: Hm, the netcdf upgrade also about

[jira] [Comment Edited] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14160756#comment-14160756 ] Johannes Mockenhaupt edited comment on TIKA-1435 at 10/6/14 7:15 PM:

[jira] [Issue Comment Deleted] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johannes Mockenhaupt updated TIKA-1435: --- Comment: was deleted (was: Changed dependencies netcdf 4.2.20 -> 4.3.22) > Update rome

Tesseract OCR always activeated parser for images

2014-10-06 Thread Lewis John Mcgibbney
Hi Folks, Now, once I install Tesseract, it is run for every image I pass through Tika server or Tika app. This is not okay as it does not give me the type of MD I am looking for. This is a just a note to folks, to say that AFAIK you would need to unregister the the parser from [0] then rebuild fro

Re: Tesseract OCR always activeated parser for images

2014-10-06 Thread Tyler Palsulich
Confirmed. This is why we ran into TIKA-1422. But, Chris' patch may provide the backwards compatibility you're looking for. What do you think? Tyler On Mon, Oct 6, 2014 at 7:47 PM, Lewis John Mcgibbney < lewis.mcgibb...@gmail.com> wrote: > Hi Folks, > Now, once I install Tesseract, it is run for

[jira] [Commented] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Shuai Liu (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161285#comment-14161285 ] Shuai Liu commented on TIKA-1437: - Thanks Tim, but i imbedded my response below, i hope you

[jira] [Comment Edited] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Shuai Liu (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161285#comment-14161285 ] Shuai Liu edited comment on TIKA-1437 at 10/7/14 12:43 AM: --- Thank

[jira] [Comment Edited] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Shuai Liu (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161285#comment-14161285 ] Shuai Liu edited comment on TIKA-1437 at 10/7/14 12:44 AM: --- Thank

[jira] [Comment Edited] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Shuai Liu (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161285#comment-14161285 ] Shuai Liu edited comment on TIKA-1437 at 10/7/14 12:44 AM: --- Thank