[GitHub] tika pull request: TIKA-1369 Avoid ThreadLocal usage from Memory L...

2014-10-06 Thread vilmospapp
GitHub user vilmospapp opened a pull request: https://github.com/apache/tika/pull/17 TIKA-1369 Avoid ThreadLocal usage from Memory Leak Hi @chrismattmann , Based on our discussion from https://github.com/apache/tika/pull/15 I've added the ThreadLocal clean up part, so

[jira] [Commented] (TIKA-1369) Date parsing and thread safety in ImageMetadataExtractor

2014-10-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160234#comment-14160234 ] ASF GitHub Bot commented on TIKA-1369: -- GitHub user vilmospapp opened a pull request:

[jira] [Comment Edited] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160312#comment-14160312 ] Tim Allison edited comment on TIKA-1437 at 10/6/14 2:04 PM: No

[jira] [Commented] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160312#comment-14160312 ] Tim Allison commented on TIKA-1437: --- No encoding detector will be perfect. Are you

[GitHub] tika pull request: TIKA-1354 Add test method with nonfunctional fo...

2014-10-06 Thread hlavki
GitHub user hlavki opened a pull request: https://github.com/apache/tika/pull/18 TIKA-1354 Add test method with nonfunctional fork parser There is something wrong with pax commons logging so ForkParser doesn't work in general. Test method: testForkParserPdf() I suppose

[jira] [Commented] (TIKA-1354) ForkParser doesn't work in OSGI container

2014-10-06 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160340#comment-14160340 ] ASF GitHub Bot commented on TIKA-1354: -- GitHub user hlavki opened a pull request:

Re: OCR with tika-server

2014-10-06 Thread kevin slote
Ok, I am signed up. https://wiki.apache.org/tika/Kevin%20Slote On Fri, Oct 3, 2014 at 11:02 PM, Mattmann, Chris A (3980) chris.a.mattm...@jpl.nasa.gov wrote: Kevin glad it is now fixed with you! If you get a chance, please feel free to document this on the wiki:

[jira] [Commented] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160745#comment-14160745 ] Johannes Mockenhaupt commented on TIKA-1435: Chris thanks. Here's what I found

[jira] [Comment Edited] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160745#comment-14160745 ] Johannes Mockenhaupt edited comment on TIKA-1435 at 10/6/14 7:13 PM:

[jira] [Updated] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johannes Mockenhaupt updated TIKA-1435: --- Attachment: netcdf-deps-changes.diff Changed dependencies netcdf 4.2.20 - 4.3.22

[jira] [Commented] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160756#comment-14160756 ] Johannes Mockenhaupt commented on TIKA-1435: Hm, the netcdf upgrade also about

[jira] [Comment Edited] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14160756#comment-14160756 ] Johannes Mockenhaupt edited comment on TIKA-1435 at 10/6/14 7:15 PM:

[jira] [Issue Comment Deleted] (TIKA-1435) Update rome dependency to 1.5

2014-10-06 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Johannes Mockenhaupt updated TIKA-1435: --- Comment: was deleted (was: Changed dependencies netcdf 4.2.20 - 4.3.22) Update rome

Tesseract OCR always activeated parser for images

2014-10-06 Thread Lewis John Mcgibbney
Hi Folks, Now, once I install Tesseract, it is run for every image I pass through Tika server or Tika app. This is not okay as it does not give me the type of MD I am looking for. This is a just a note to folks, to say that AFAIK you would need to unregister the the parser from [0] then rebuild

Re: Tesseract OCR always activeated parser for images

2014-10-06 Thread Tyler Palsulich
Confirmed. This is why we ran into TIKA-1422. But, Chris' patch may provide the backwards compatibility you're looking for. What do you think? Tyler On Mon, Oct 6, 2014 at 7:47 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Folks, Now, once I install Tesseract, it is run for

[jira] [Commented] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Shuai Liu (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14161285#comment-14161285 ] Shuai Liu commented on TIKA-1437: - Thanks Tim, but i imbedded my response below, i hope

[jira] [Comment Edited] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Shuai Liu (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14161285#comment-14161285 ] Shuai Liu edited comment on TIKA-1437 at 10/7/14 12:44 AM: ---

[jira] [Comment Edited] (TIKA-1437) encoding issue in AutoDetectReader

2014-10-06 Thread Shuai Liu (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14161285#comment-14161285 ] Shuai Liu edited comment on TIKA-1437 at 10/7/14 12:44 AM: ---