[GitHub] tika pull request: Corrected and Improved

2016-04-06 Thread reevapp
Github user reevapp closed the pull request at: https://github.com/apache/tika/pull/99 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enable

[jira] [Commented] (TIKA-1936) Clean up parsers not cleaning up resources

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228852#comment-15228852 ] Tim Allison commented on TIKA-1936: --- Looks like an unhappy day in Hudson's world, probabl

[GitHub] tika pull request: Corrected and Improved

2016-04-06 Thread reevapp
GitHub user reevapp reopened a pull request: https://github.com/apache/tika/pull/99 Corrected and Improved The original code did not work at all, the WebClient was an Instance Variable and not only it was not thread-safe but also it would only work for the very first request (all s

[GitHub] tika pull request: Corrected and Improved

2016-04-06 Thread reevapp
GitHub user reevapp opened a pull request: https://github.com/apache/tika/pull/99 Corrected and Improved The original code did not work at all, the WebClient was an Instance Variable and not only it was not thread-safe but also it would only work for the very first request (all sub

[jira] [Commented] (TIKA-1936) Clean up parsers not cleaning up resources

2016-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228818#comment-15228818 ] Hudson commented on TIKA-1936: -- FAILURE: Integrated in tika-2.x #78 (See [https://builds.apac

tika-2.x - Build # 78 - Failure

2016-04-06 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x (build #78) Status: Failure Check console output at https://builds.apache.org/job/tika-2.x/78/ to view the results.

Re: FW: Apache Tika used to parse the Panama papers!

2016-04-06 Thread Tilman Hausherr
Yes I read about that too :-) It would be interesting to hear whether they had any problems, and whether they made any support requests, and were these answered successfully? Were there any files that failed or did poorly? Or was everything so good that no help was needed at all? I'm delight

[jira] [Commented] (TIKA-1936) Clean up parsers not cleaning up resources

2016-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228745#comment-15228745 ] Hudson commented on TIKA-1936: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #947 (See [https://b

FW: Apache Tika used to parse the Panama papers!

2016-04-06 Thread Allison, Timothy B.
Looks like quite a few MSG files! -Original Message- From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.gov] Sent: Tuesday, April 05, 2016 6:47 PM To: dev@tika.apache.org Cc: pr...@apache.org Subject: Apache Tika used to parse the Panama papers! FYI: http://www.forbes.com/

FW: Apache Tika used to parse the Panama papers!

2016-04-06 Thread Allison, Timothy B.
Looks like quite a few PDFs [0]... Couldn't have done it without you! Cheers, Tim P.S. Tip of the hat to Andreas for rt the link! [0] https://twitter.com/bigdata/status/717346207312392192 -Original Message- From: Mattmann, Chris A (3980) [mailto:chris.a.mattm...@jpl.nasa.

[jira] [Updated] (TIKA-1935) ISArchiveParser not releasing resources

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1935: -- Issue Type: Sub-task (was: Improvement) Parent: TIKA-1936 > ISArchiveParser not releasing resour

[jira] [Updated] (TIKA-1934) GeographicInformationParserTest leaving behind temp file in trunk

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1934: -- Issue Type: Sub-task (was: Improvement) Parent: TIKA-1936 > GeographicInformationParserTest leav

[jira] [Updated] (TIKA-1932) Clear resources in ParserDecorator

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1932: -- Issue Type: Sub-task (was: Bug) Parent: TIKA-1936 > Clear resources in ParserDecorator > ---

[jira] [Updated] (TIKA-1933) ForkParser leaves tmp jars behind on Windows (at least)

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1933: -- Issue Type: Sub-task (was: Improvement) Parent: TIKA-1936 > ForkParser leaves tmp jars behind on

[jira] [Updated] (TIKA-1929) Need to close resources on exception in sqlite parser

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1929: -- Issue Type: Sub-task (was: Bug) Parent: TIKA-1936 > Need to close resources on exception in sqli

[jira] [Updated] (TIKA-1930) Clean up resources from grib parser

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1930: -- Issue Type: Sub-task (was: Bug) Parent: TIKA-1936 > Clean up resources from grib parser > --

[jira] [Commented] (TIKA-1936) Clean up parsers not cleaning up resources

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228633#comment-15228633 ] Tim Allison commented on TIKA-1936: --- Made a few modifications to: * MATParser * NetCDFPa

[jira] [Resolved] (TIKA-1935) ISArchiveParser not releasing resources

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1935. --- Resolution: Fixed > ISArchiveParser not releasing resources > --- >

[jira] [Commented] (TIKA-1935) ISArchiveParser not releasing resources

2016-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228405#comment-15228405 ] Hudson commented on TIKA-1935: -- FAILURE: Integrated in tika-trunk-jdk1.7 #946 (See [https://b

tika-trunk-jdk1.7 - Build # 946 - Failure

2016-04-06 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-trunk-jdk1.7 (build #946) Status: Failure Check console output at https://builds.apache.org/job/tika-trunk-jdk1.7/946/ to view the results.

[jira] [Commented] (TIKA-1934) GeographicInformationParserTest leaving behind temp file in trunk

2016-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228377#comment-15228377 ] Hudson commented on TIKA-1934: -- SUCCESS: Integrated in tika-2.x #77 (See [https://builds.apac

[jira] [Commented] (TIKA-1935) ISArchiveParser not releasing resources

2016-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228376#comment-15228376 ] Hudson commented on TIKA-1935: -- SUCCESS: Integrated in tika-2.x #77 (See [https://builds.apac

Re: @ApacheTika , and release related tweets question

2016-04-06 Thread Mattmann, Chris A (3980)
FYI I updated the front page with a news item link to the Panama papers. ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office:

Re: @ApacheTika , and release related tweets question

2016-04-06 Thread Mattmann, Chris A (3980)
++1 on all the feedback from you two below :) ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527

[jira] [Commented] (TIKA-1934) GeographicInformationParserTest leaving behind temp file in trunk

2016-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228271#comment-15228271 ] Hudson commented on TIKA-1934: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #945 (See [https://b

[jira] [Commented] (TIKA-1932) Clear resources in ParserDecorator

2016-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228272#comment-15228272 ] Hudson commented on TIKA-1932: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #945 (See [https://b

[jira] [Created] (TIKA-1936) Clean up parsers not cleaning up resources

2016-04-06 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1936: - Summary: Clean up parsers not cleaning up resources Key: TIKA-1936 URL: https://issues.apache.org/jira/browse/TIKA-1936 Project: Tika Issue Type: Improvement

[jira] [Commented] (TIKA-1932) Clear resources in ParserDecorator

2016-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228262#comment-15228262 ] Hudson commented on TIKA-1932: -- SUCCESS: Integrated in tika-2.x #76 (See [https://builds.apac

[jira] [Created] (TIKA-1935) ISArchiveParser not releasing resources

2016-04-06 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1935: - Summary: ISArchiveParser not releasing resources Key: TIKA-1935 URL: https://issues.apache.org/jira/browse/TIKA-1935 Project: Tika Issue Type: Improvement

[jira] [Updated] (TIKA-1934) GeographicInformationParserTest leaving behind temp file in trunk

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1934: -- Description: GeographicInformationParser needs to release TemporaryResources. (was: Need to close TikaIn

[jira] [Commented] (TIKA-1835) LinkContentHandler skips iframe and rel tags

2016-04-06 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228229#comment-15228229 ] Markus Jelsma commented on TIKA-1835: - Hello Ken - i agree, script src is indeed missin

[jira] [Resolved] (TIKA-1934) GeographicInformationParserTest leaving behind temp file in trunk

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1934. --- Resolution: Fixed > GeographicInformationParserTest leaving behind temp file in trunk > ---

[jira] [Created] (TIKA-1934) GeographicInformationParserTest leaving behind temp file in trunk

2016-04-06 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1934: - Summary: GeographicInformationParserTest leaving behind temp file in trunk Key: TIKA-1934 URL: https://issues.apache.org/jira/browse/TIKA-1934 Project: Tika Issue

Re: @ApacheTika , and release related tweets question

2016-04-06 Thread Bob Paulin
Hi Nick, This is awesome and I think should be great for the community! I looked to commons as an example https://twitter.com/ApacheCommons . Looks like they tweet out the releases with a link to the mailing list comments. Might be a good precedent to follow to bring attention to the fact th

[jira] [Created] (TIKA-1933) ForkParser leaves tmp jars behind on Windows (at least)

2016-04-06 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1933: - Summary: ForkParser leaves tmp jars behind on Windows (at least) Key: TIKA-1933 URL: https://issues.apache.org/jira/browse/TIKA-1933 Project: Tika Issue Type: Impr

[jira] [Commented] (TIKA-1932) Clear resources in ParserDecorator

2016-04-06 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228196#comment-15228196 ] Hudson commented on TIKA-1932: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #944 (See [https://b

[jira] [Resolved] (TIKA-1932) Clear resources in ParserDecorator

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1932. --- Resolution: Fixed Fix Version/s: 1.13 2.0 I think I finally got it right. Ar

@ApacheTika , and release related tweets question

2016-04-06 Thread Nick Burch
Hi All Firstly, in case you haven't heard, we've setup a twitter account for the project! It's @ApacheTika - https://twitter.com/ApacheTika One thing we'll want to use it for is project publicity, linking to interesting things going on around the project, such as today's post on how the pan

[jira] [Created] (TIKA-1932) Clear resources in ParserDecorator

2016-04-06 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1932: - Summary: Clear resources in ParserDecorator Key: TIKA-1932 URL: https://issues.apache.org/jira/browse/TIKA-1932 Project: Tika Issue Type: Bug Reporter:

[jira] [Commented] (TIKA-1931) Revert mp4 parser version because of new permanent hangs with 1.1.18

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15228128#comment-15228128 ] Tim Allison commented on TIKA-1931: --- https://github.com/sannies/mp4parser/issues/187 > R

[jira] [Updated] (TIKA-1931) Revert mp4 parser version because of new permanent hangs with 1.1.18

2016-04-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1931: -- Attachment: mp4_parser_timeouts.zip Triggering files > Revert mp4 parser version because of new permanen

[jira] [Created] (TIKA-1931) Revert mp4 parser version because of new permanent hangs with 1.1.18

2016-04-06 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1931: - Summary: Revert mp4 parser version because of new permanent hangs with 1.1.18 Key: TIKA-1931 URL: https://issues.apache.org/jira/browse/TIKA-1931 Project: Tika Is

RE: Apache Tika used to parse the Panama papers!

2016-04-06 Thread Vasu Jain
Looks like someone took USC- CS572's assignments to a new level. ;) > Date: Wed, 6 Apr 2016 09:28:49 +0200 > Subject: Re: Apache Tika used to parse the Panama papers! > From: bdelacre...@apache.org > To: chris.a.mattm...@jpl.nasa.gov; pr...@apache.org > CC: dev@tika.apache.org > > Hi, > > On Wed,

Re: Apache Tika used to parse the Panama papers!

2016-04-06 Thread Bertrand Delacretaz
Hi, On Wed, Apr 6, 2016 at 12:46 AM, Mattmann, Chris A (3980) > http://www.forbes.com/sites/thomasbrewster/2016/04/05/panama-papers-amazon-encryption-epic-leak Note that this also mentions Apache Solr. -Bertrand