Re: 1.20?

2018-11-30 Thread loompa
Hi,
On Wed, 21 Nov 2018 at 13:00, Tim Allison  wrote:

> Dave,
>   Should I try to get the Docker plugin working again?
>

That would be great. I think I may have went down the wrong path building
an image at package time, as there doesn't seem to be an easy way to
publish it as an Apache labelled org on Dockerhub unless it builds from
source.

I have some time over the weekend, so could update to where I got to and
see what you think.

Cheers,
Dave


Re: [VOTE] Release Apache Tika 1.19.1 Candidate #2

2018-10-08 Thread loompa
Hello,

On Thu, 4 Oct 2018 at 23:03, Tim Allison  wrote:

> A candidate for the Tika 1.19.1 release is available at:
>   https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
>   https://github.com/apache/tika/tree/1.19.1-rc2/
>
> The SHA-512 checksum of the archive is
>
> 4f89216eb3332288c4839139e4af78395fefb3c03be4a6d41a8c9ffadebf69e1732afced25e7fe3c563fb6ce95726a89bd9924c69ddab8e6875a45eec1564fcb
>
> In addition, a staged maven repository is available here:
>
> https://repository.apache.org/content/repositories/orgapachetika-1045/org/apache/tika
>
> Please vote on releasing this package as Apache Tika 1.19.1.
>
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.19.1
> [ ] -1 Do not release this package because...
>

+1 from me.

Thanks for rolling the release Tim!

Cheers,
Dave


Re: ***UNCHECKED*** Fwd: MODERATE for annou...@apache.org

2018-09-27 Thread loompa
+1 from me too.

On Wed, 26 Sep 2018, 13:46 Tim Allison,  wrote:

> All,
>
> >It is ok to include the sha512 checksums in text on the page but you also
> need an https link to the checksum.
>
> It feels from the above like the checksums on the page are ok, but
> what really matters are the checksums via the https links.  If this is
> the case, would anyone object to getting rid of the checksums on our
> webpage and just using the https links?  This would same a tedious
> manual step of updating the website w each release.
>
> Thank you.
>
> Cheers,
>
>   Tim
> On Wed, Sep 19, 2018 at 1:25 PM Craig Russell 
> wrote:
> >
> > Hi Tim,
> >
> > Download page looks good now. Thanks for taking care of this so
> expeditiously.
> >
> > Regards,
> >
> > Craig
> >
> > On Sep 19, 2018, at 8:35 AM, Tim Allison  wrote:
> >
> > Accidentally dropped Craig in the last email.  Doh!
> >
> > Craig,
> >
> > I just fixed our downloads page...I think.  Let me know if we need to
> > do anything else...or if I botched anything in the announcement email.
> >
> > Thank you, again.
> > On Wed, Sep 19, 2018 at 11:10 AM Tim Allison 
> wrote:
> >
> >
> > Thank you, Craig.
> >
> > To confirm, I got the info right in the announcement email...what we
> > need to fix is our downloads page.  I can do that now.
> >
> > Thank you, again.
> >
> > Cheers,
> >
> >  Tim
> > On Wed, Sep 19, 2018 at 10:50 AM Private LIst Moderation
> >  wrote:
> >
> >
> > Hi Tina devs,
> >
> > I've moderated this announcement due to the urgency of the release.
> >
> > For future releases, please change the downloads page:
> >
> > It is ok to include the sha512 checksums in text on the page but you
> also need an https link to the checksum.
> >
> > The link to the KEYS should link to the KEYS file in your distribution
> directory. The people.apache.org site should not be used.
> >
> > Regards,
> >
> > Craig
> >
> > Announcements of Apache project releases must contain a link to the
> relevant
> > download page, which might be hosted on an Apache site or a third party
> site
> > such as github.com . [1]
> >
> > The download page must provide public download links where current
> official
> > source releases and accompanying cryptographic files may be obtained. [2]
> >
> > Links to the download artifacts must support downloads from mirrors.
> Links to
> > metadata (SHA, ASC) must be from https://www.apache.org/dist/ <
> https://www.apache.org/dist/>/
> > ** MD5 is no longer considered useful and should not be used. SHA is
> required. **
> > Links to KEYS must be from https://www.apache.org/dist/ <
> https://www.apache.org/dist/>/ not release
> > specific.
> >
> > Announcements that contain a link to the dyn/closer page alone will be
> > rejected by the moderators.
> >
> > Announcements that contain a link to a web page that does not include a
> link
> > to a mirror to the artifact plus links to the signature and at least one
> sha
> > checksum will be rejected.
> >
> > Announcements that link to dist.apache.org 
> will not be accepted.
> > Likewise ones which link to SVN or Git code repos.
> >
> > [1]
> http://www.apache.org/legal/release-policy.html#release-announcements <
> http://www.apache.org/legal/release-policy.html#release-announcements>[2]
> https://www.apache.org/dev/release-distribution#download-links <
> https://www.apache.org/dev/release-distribution#download-links>
> >
> >
> > Begin forwarded message:
> >
> > From: announce-reject-1537286294.48587.jgagknhepoajmbbkh...@apache.org
> > Subject: MODERATE for annou...@apache.org
> > Date: September 18, 2018 at 8:58:14 AM PDT
> > To: Recipient list not shown: ;
> > Cc: announce-allow-tc.1537286294.efngohokkjgkacicfpnk-tallison=
> apache@apache.org
> > Reply-To:
> announce-accept-1537286294.48587.jgagknhepoajmbbkh...@apache.org
> >
> >
> > To approve:
> >  announce-accept-1537286294.48587.jgagknhepoajmbbkh...@apache.org
> > To reject:
> >  announce-reject-1537286294.48587.jgagknhepoajmbbkh...@apache.org
> > To give a reason to reject:
> > %%% Start comment
> > %%% End comment
> >
> >
> > From: Tim Allison 
> > Subject: [ANNOUNCE] Apache Tika 1.19 released
> > Date: September 18, 2018 at 8:58:02 AM PDT
> > To: dev@tika.apache.org, u...@tika.apache.org, annou...@apache.org
> >
> >
> > The Apache Tika project is pleased to announce the release of Apache
> > Tika 1.19. The release contents have been pushed out to the main
> > Apache release site and to the Maven Central sync, so the releases
> > should be available as soon as the mirrors get the syncs.
> >
> > Apache Tika is a toolkit for detecting and extracting metadata and
> > structured text content from various documents using existing parser
> > libraries.
> >
> > Apache Tika 1.19 contains a number of improvements and bug fixes.
> > Details can be found in the changes file:
> > http://www.apache.org/dist/tika/CHANGES-1.19.txt
> >
> > Apache Tika is available on the download page:
> > http://ti

Re: [VOTE] Release Apache Tika 1.19.1 Candidate #1

2018-09-27 Thread loompa
On Wed, 26 Sep 2018 at 20:20, Tim Allison  wrote:

> A candidate for the Tika 1.19.1 release is available at:
>   https://dist.apache.org/repos/dist/dev/tika/
>
> The release candidate is a zip archive of the sources in:
>   https://github.com/apache/tika/tree/1.19.1-rc1/
>
> The SHA-512 checksum of the archive is
>
> 88c79c106d78983effc9b41147b46b3722cb7afb8c847d340d3504f56488b8a7267fd634efe638afd2a2c52419fe6b84249ac6e641d5c8c5e6e4795f004b9a45
>
> In addition, a staged maven repository is available here:
>
> https://repository.apache.org/content/repositories/orgapachetika-1044/org/apache/tika
>
> Please vote on releasing this package as Apache Tika 1.19.1.
>
> The vote is open for the next 72 hours and passes if a majority of at
> least three +1 Tika PMC votes are cast.
>
> [ ] +1 Release this package as Apache Tika 1.19.1
> [ ] -1 Do not release this package because...
>

+1 - Checksum OK, Signatures OK (although need to get you some trust, Tim)
and test results looked good.

I noticed a minor issue on a clean Ubuntu 18.04 with the Python rotation
script when I didn't have python-tk installed the rotation script fails and
thus the build.  I've got a patch for the check so it looks for this but
don't think it is worth stopping this RC for, so will fire it in JIRA.

Thanks for rolling this RC.

Cheers,
Dave


Re: Branch_1x build broke?

2018-05-24 Thread loompa
Hey Chris,

This is happening to me with Tesseract enabled but only on my MacBook.

Are you running this on OSX?

Been trying to get some time to dig into it as it works perfectly on my
Windows and Linux setups.

Cheers,
Dave



On Thu, 24 May 2018, 17:09 Chris Mattmann,  wrote:

> Tim,
>
>
>
> Are you seeing this?
>
>
>
> Results :
>
>
>
> Failed tests:
>
>
> PDFParserTest.testEmbeddedDocsWithOCROnly:1250->TikaTest.assertContains:103
> pdf_haystack not found in:
>
> http://www.w3.org/1999/xhtml";>
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
>  content="application/vnd.openxmlformats-officedocument.wordprocessingml.document"
> />
>
> 
>
>  content="org.apache.tika.parser.microsoft.ooxml.OOXMLParser" />
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> 
>
> Outer_haystack
>
> Outer_haystack
>
> 
>
> 
>
> Outer_haystack
>
> 
>
> Outer_haystack
>
> 
>
> Outer_haystack
>
> 
>
> 
>
> 
>
> 
>
> attached.pdf
>
> dehayslack dehaystack dehayslack
> dehaystack dehaystack dehaystack pd'
>
>
>
> 
>
> 
>
> 
>
>
>
> 
>
>
>
> 
>
>
>
> Haystack
>
>
>
> Needle
>
>
>
> Haystack
>
>
>
> 
>
>
>
> 
>
>
>
> 
>
>
>
> 
>
>
>
> 
>
> 
>
>
>
> Tests run: 1009, Failures: 1, Errors: 0, Skipped: 30
>
>
>
> [INFO]
> 
>
> [INFO] Reactor Summary:
>
> [INFO]
>
> [INFO] Apache Tika parent . SUCCESS [
> 1.565 s]
>
> [INFO] Apache Tika core ... SUCCESS [
> 32.977 s]
>
> [INFO] Apache Tika parsers  FAILURE [05:52
> min]
>
> [INFO] Apache Tika XMP  SKIPPED
>
> [INFO] Apache Tika serialization .. SKIPPED
>
> [INFO] Apache Tika batch .. SKIPPED
>
> [INFO] Apache Tika language detection . SKIPPED
>
> [INFO] Apache Tika application  SKIPPED
>
> [INFO] Apache Tika OSGi bundle  SKIPPED
>
> [INFO] Apache Tika translate .. SKIPPED
>
> [INFO] Apache Tika server . SKIPPED
>
> [INFO] Apache Tika examples ... SKIPPED
>
> [INFO] Apache Tika Java-7 Components .. SKIPPED
>
> [INFO] Apache Tika eval ... SKIPPED
>
> [INFO] Apache Tika Deep Learning (powered by DL4J)  SKIPPED
>
> [INFO] Apache Tika Natural Language Processing  SKIPPED
>
> [INFO] Apache Tika  SKIPPED
>
> [INFO]
> 
>
> [INFO] BUILD FAILURE
>
> [INFO]
> 
>
> [INFO] Total time: 06:27 min
>
> [INFO] Finished at: 2018-05-24T09:04:59-07:00
>
> [INFO] Final Memory: 72M/1029M
>
> [INFO]
> 
>
> [ERROR] Failed to execute goal
> org.apache.maven.plugins:maven-surefire-plugin:2.18.1:test (default-test)
> on project tika-parsers: There are test failures.
>
> [ERROR]
>
> [ERROR] Please refer to
> /Users/mattmann/tmp/tika2.0.0/tika-parsers/target/surefire-reports for the
> individual test results.
>
> [ERROR] -> [Help 1]
>
> [ERROR]
>
> [ERROR] To see the full stack trace of the errors, re-run Maven with the
> -e switch.
>
> [ERROR] Re-run Maven using the -X switch to enable full debug logging.
>
> [ERROR]
>
> [ERROR] For more information about the errors and possible solutions,
> please read the following articles:
>
> [ERROR] [Help 1]
> http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException
>
> [ERROR]
>
> [ERROR] After correcting the problems, you can resume the build with the
> command
>
> [ERROR]   mvn  -rf :tika-parsers
>
>
>
> Keeps failing for me.
>
> nonas:tika2.0.0 mattmann$ java -version
>
> java version "1.8.0_144"
>
> Java(TM) SE Runtime Environment (build 1.8.0_144-b01)
>
> Java HotSpot(TM) 64-Bit Server VM (build 25.144-b01, mixed mode)
>
> nonas:tika2.0.0 mattmann$
>
>
>
> Any ideas?
>
>
>
> Cheers,
>
> Chris
>
>
>
>