[RESULT] WAS Re: [VOTE] Apache Nutch 1.20 Release

2024-04-24 Thread lewis john mcgibbney
Hi user@ & dev@,
I’m glad to conclude the Nutch 1.20 release candidate VOTE thread with the
following RESULT’s.

[5] +1 Release this package as Apache Nutch 1.20
snagel*
balakuntala*
blackice*
Joe Gilvary
lewismc*

[ ] -1 Do not release this package because…

*Nutch Project Management Committee-binding

The Nutch 1.20 release candidate has passed the community VOTE. I will
therefore promote this release casndidate.

Thanks for VOTE’ing and for everyone who contributed to the Apache Nutch
1.20 release.

lewismc

On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney 
wrote:

> Hi Folks,
>
> A first candidate for the Nutch 1.20 release is available at [0] where
> accompanying SHA512 and ASC signatures can also be found.
> Information on verifying releases can be found at [1].
>
> The release candidate comprises a .zip and tar.gz archive of the sources
> at [2] and complementary binary distributions. In addition, a staged maven
> repository is available at [3].
>
> The Nutch 1.20 release report is available at [4].
>
> Please vote on releasing this package as Apache Nutch 1.20. The vote is
> open for at least the next 72 hours and passes if a majority of at least
> three +1 Nutch PMC votes are cast.
>
> [ ] +1 Release this package as Apache Nutch 1.20.
>
> [ ] -1 Do not release this package because…
>
> Cheers,
> lewismc
> P.S. Here is my +1.
>
> [0] https://dist.apache.org/repos/dist/dev/nutch/1.20
> [1] http://nutch.apache.org/downloads.html#verify
> [2] https://github.com/apache/nutch/tree/release-1.20
> [3]
> https://repository.apache.org/content/repositories/orgapachenutch-1021/
> [4] https://s.apache.org/ovjf3
>
> --
> http://home.apache.org/~lewismc/
> http://people.apache.org/keys/committer/lewismc
>


-- 
http://home.apache.org/~lewismc/
http://people.apache.org/keys/committer/lewismc


Re: [VOTE] Apache Nutch 1.20 Release

2024-04-22 Thread BlackIce
My apologies, don't have a working environment to be testing this time.

On Sun, Apr 21, 2024, 02:36 Joe Gilvary  wrote:

> I crawled and indexed (to Solr 9.2) a mix of PDF, RTF, EPUB, plain text,
> and more with no issues today. Please accept my +1 as well.
>
>  Thanks, stay safe, stay healthy,
>
>  Joe
>
> Ar 19/04/2024 01:07, scríobh Shashanka Balakuntala:
>
> Hi Lewis,
> Thanks for preparing this.
>
> Here is my +1
>
> I have verified the build being successful and unit tests running,
> additionally ran a small crawl in local hadoop and verified the same.
>
> *Regards*
>   Shashanka Balakuntala Srinivasa
>
>
>
> On Fri, 19 Apr 2024 at 4:47 AM, Joe Gilvary 
> wrote:
>
>> Just catching up now after the eclipse road trip, kicking the tires on
>> the bin rc it's looking good to me. Maybe it's my imagination, or maybe
>> it's just old PDFs that I pointed Nutch at, but it seems that Tika
>> complains more often. I'll try to get a more thorough run through this
>> weekend, but overall, I'm looking forward to seeing the actual release
>> running a production workload.
>>
>>  Thanks, stay safe, stay healthy,
>>
>>  Joe
>>
>> Ar 16/04/2024 10:46, scríobh lewis john mcgibbney:
>>
>> Hi user@, dev@,
>> Please consider reviewing the Nutch 1.20 release candidate. This is a
>> critical prerequisite for us making releases of software at TheASF.
>> Thank you
>> lewismc
>>
>> On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney  
>> 
>> wrote:
>>
>>
>> Hi Folks,
>>
>> A first candidate for the Nutch 1.20 release is available at [0] where
>> accompanying SHA512 and ASC signatures can also be found.
>> Information on verifying releases can be found at [1].
>>
>> The release candidate comprises a .zip and tar.gz archive of the sources
>> at [2] and complementary binary distributions. In addition, a staged maven
>> repository is available at [3].
>>
>> The Nutch 1.20 release report is available at [4].
>>
>> Please vote on releasing this package as Apache Nutch 1.20. The vote is
>> open for at least the next 72 hours and passes if a majority of at least
>> three +1 Nutch PMC votes are cast.
>>
>> [ ] +1 Release this package as Apache Nutch X.XX.
>>
>> [ ] -1 Do not release this package because…
>>
>> Cheers,
>> lewismc
>> P.S. Here is my +1.
>>
>> [0] https://dist.apache.org/repos/dist/dev/nutch/1.20
>> [1] http://nutch.apache.org/downloads.html#verify
>> [2] https://github.com/apache/nutch/tree/release-1.20
>> [3]https://repository.apache.org/content/repositories/orgapachenutch-1021/
>> [4] https://s.apache.org/ovjf3
>>
>> --http://home.apache.org/~lewismc/http://people.apache.org/keys/committer/lewismc
>>
>>
>>
>


Re: [VOTE] Apache Nutch 1.20 Release

2024-04-20 Thread Joe Gilvary
I crawled and indexed (to Solr 9.2) a mix of PDF, RTF, EPUB, plain text, 
and more with no issues today. Please accept my +1 as well.


 Thanks, stay safe, stay healthy,

 Joe

Ar 19/04/2024 01:07, scríobh Shashanka Balakuntala:

Hi Lewis,
Thanks for preparing this.

Here is my +1

I have verified the build being successful and unit tests running, 
additionally ran a small crawl in local hadoop and verified the same.


_Regards_
Shashanka Balakuntala Srinivasa



On Fri, 19 Apr 2024 at 4:47 AM, Joe Gilvary 
 wrote:


Just catching up now after the eclipse road trip, kicking the
tires on the bin rc it's looking good to me. Maybe it's my
imagination, or maybe it's just old PDFs that I pointed Nutch at,
but it seems that Tika complains more often. I'll try to get a
more thorough run through this weekend, but overall, I'm looking
forward to seeing the actual release running a production workload.

 Thanks, stay safe, stay healthy,

 Joe

Ar 16/04/2024 10:46, scríobh lewis john mcgibbney:

Hi user@, dev@,
Please consider reviewing the Nutch 1.20 release candidate. This is a
critical prerequisite for us making releases of software at TheASF.
Thank you
lewismc

On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney  

wrote:


Hi Folks,

A first candidate for the Nutch 1.20 release is available at [0] where
accompanying SHA512 and ASC signatures can also be found.
Information on verifying releases can be found at [1].

The release candidate comprises a .zip and tar.gz archive of the sources
at [2] and complementary binary distributions. In addition, a staged maven
repository is available at [3].

The Nutch 1.20 release report is available at [4].

Please vote on releasing this package as Apache Nutch 1.20. The vote is
open for at least the next 72 hours and passes if a majority of at least
three +1 Nutch PMC votes are cast.

[ ] +1 Release this package as Apache Nutch X.XX.

[ ] -1 Do not release this package because…

Cheers,
lewismc
P.S. Here is my +1.

[0]https://dist.apache.org/repos/dist/dev/nutch/1.20
[1]http://nutch.apache.org/downloads.html#verify
[2]https://github.com/apache/nutch/tree/release-1.20
[3]
https://repository.apache.org/content/repositories/orgapachenutch-1021/
[4]https://s.apache.org/ovjf3

--
http://home.apache.org/~lewismc/
http://people.apache.org/keys/committer/lewismc





Re: [VOTE] Apache Nutch 1.20 Release

2024-04-18 Thread Shashanka Balakuntala
Hi Lewis,
Thanks for preparing this.

Here is my +1

I have verified the build being successful and unit tests running,
additionally ran a small crawl in local hadoop and verified the same.

*Regards*
  Shashanka Balakuntala Srinivasa



On Fri, 19 Apr 2024 at 4:47 AM, Joe Gilvary 
wrote:

> Just catching up now after the eclipse road trip, kicking the tires on the
> bin rc it's looking good to me. Maybe it's my imagination, or maybe it's
> just old PDFs that I pointed Nutch at, but it seems that Tika complains
> more often. I'll try to get a more thorough run through this weekend, but
> overall, I'm looking forward to seeing the actual release running a
> production workload.
>
>  Thanks, stay safe, stay healthy,
>
>  Joe
>
> Ar 16/04/2024 10:46, scríobh lewis john mcgibbney:
>
> Hi user@, dev@,
> Please consider reviewing the Nutch 1.20 release candidate. This is a
> critical prerequisite for us making releases of software at TheASF.
> Thank you
> lewismc
>
> On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney  
> 
> wrote:
>
>
> Hi Folks,
>
> A first candidate for the Nutch 1.20 release is available at [0] where
> accompanying SHA512 and ASC signatures can also be found.
> Information on verifying releases can be found at [1].
>
> The release candidate comprises a .zip and tar.gz archive of the sources
> at [2] and complementary binary distributions. In addition, a staged maven
> repository is available at [3].
>
> The Nutch 1.20 release report is available at [4].
>
> Please vote on releasing this package as Apache Nutch 1.20. The vote is
> open for at least the next 72 hours and passes if a majority of at least
> three +1 Nutch PMC votes are cast.
>
> [ ] +1 Release this package as Apache Nutch X.XX.
>
> [ ] -1 Do not release this package because…
>
> Cheers,
> lewismc
> P.S. Here is my +1.
>
> [0] https://dist.apache.org/repos/dist/dev/nutch/1.20
> [1] http://nutch.apache.org/downloads.html#verify
> [2] https://github.com/apache/nutch/tree/release-1.20
> [3]https://repository.apache.org/content/repositories/orgapachenutch-1021/
> [4] https://s.apache.org/ovjf3
>
> --http://home.apache.org/~lewismc/http://people.apache.org/keys/committer/lewismc
>
>
>


Re: [VOTE] Apache Nutch 1.20 Release

2024-04-18 Thread Joe Gilvary
Just catching up now after the eclipse road trip, kicking the tires on 
the bin rc it's looking good to me. Maybe it's my imagination, or maybe 
it's just old PDFs that I pointed Nutch at, but it seems that Tika 
complains more often. I'll try to get a more thorough run through this 
weekend, but overall, I'm looking forward to seeing the actual release 
running a production workload.


 Thanks, stay safe, stay healthy,

 Joe

Ar 16/04/2024 10:46, scríobh lewis john mcgibbney:

Hi user@, dev@,
Please consider reviewing the Nutch 1.20 release candidate. This is a
critical prerequisite for us making releases of software at TheASF.
Thank you
lewismc

On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney
wrote:


Hi Folks,

A first candidate for the Nutch 1.20 release is available at [0] where
accompanying SHA512 and ASC signatures can also be found.
Information on verifying releases can be found at [1].

The release candidate comprises a .zip and tar.gz archive of the sources
at [2] and complementary binary distributions. In addition, a staged maven
repository is available at [3].

The Nutch 1.20 release report is available at [4].

Please vote on releasing this package as Apache Nutch 1.20. The vote is
open for at least the next 72 hours and passes if a majority of at least
three +1 Nutch PMC votes are cast.

[ ] +1 Release this package as Apache Nutch X.XX.

[ ] -1 Do not release this package because…

Cheers,
lewismc
P.S. Here is my +1.

[0]https://dist.apache.org/repos/dist/dev/nutch/1.20
[1]http://nutch.apache.org/downloads.html#verify
[2]https://github.com/apache/nutch/tree/release-1.20
[3]
https://repository.apache.org/content/repositories/orgapachenutch-1021/
[4]https://s.apache.org/ovjf3

--
http://home.apache.org/~lewismc/
http://people.apache.org/keys/committer/lewismc





Re: [VOTE] Apache Nutch 1.20 Release

2024-04-16 Thread lewis john mcgibbney
Hi user@, dev@,
Please consider reviewing the Nutch 1.20 release candidate. This is a
critical prerequisite for us making releases of software at TheASF.
Thank you
lewismc

On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney 
wrote:

> Hi Folks,
>
> A first candidate for the Nutch 1.20 release is available at [0] where
> accompanying SHA512 and ASC signatures can also be found.
> Information on verifying releases can be found at [1].
>
> The release candidate comprises a .zip and tar.gz archive of the sources
> at [2] and complementary binary distributions. In addition, a staged maven
> repository is available at [3].
>
> The Nutch 1.20 release report is available at [4].
>
> Please vote on releasing this package as Apache Nutch 1.20. The vote is
> open for at least the next 72 hours and passes if a majority of at least
> three +1 Nutch PMC votes are cast.
>
> [ ] +1 Release this package as Apache Nutch X.XX.
>
> [ ] -1 Do not release this package because…
>
> Cheers,
> lewismc
> P.S. Here is my +1.
>
> [0] https://dist.apache.org/repos/dist/dev/nutch/1.20
> [1] http://nutch.apache.org/downloads.html#verify
> [2] https://github.com/apache/nutch/tree/release-1.20
> [3]
> https://repository.apache.org/content/repositories/orgapachenutch-1021/
> [4] https://s.apache.org/ovjf3
>
> --
> http://home.apache.org/~lewismc/
> http://people.apache.org/keys/committer/lewismc
>


-- 
http://home.apache.org/~lewismc/
http://people.apache.org/keys/committer/lewismc


Re: [VOTE] Apache Nutch 1.20 Release

2024-04-11 Thread Lewis John McGibbney
Hi Seb,

On 2024/04/11 13:30:53 Sebastian Nagel wrote:
> 
> https://github.com/sebastian-nagel/nutch-test-single-node-cluster/

I think we should make this into an integration test suite and run it as part 
of CI. I’ve been meaning and wanting to do this for the __longest__ time…!

> 
> One note about the CHANGES.md: it's now a mixture of HTML and plain text.
> It does not use the potential of markdown, e.g. sections / headlines for
> the releases to make the change log navigable via a table of contents.
> The embedded HTML makes it less readable if viewed in a text editor.
> The rendering on Github [5] is acceptable with only minor glitches,
> mostly the placement of multiple lines in a single paragraph:
>https://github.com/apache/nutch/blob/branch-1.20/CHANGES.md
> We also have a change log on Jira:
>https://s.apache.org/ovjf3
> That's why I wouldn't call the CHANGES.md a "blocker". We should
> update the formatting after the release to make it again easily
> readable in source code and improve the document structure utilizing
> the markdown markup.

Excellent suggestion. I was focusing on including the hyperlinks and clearly 
compromised other change log benefits. I will address this after the release. 
Thank you


Re: [VOTE] Apache Nutch 1.20 Release

2024-04-11 Thread Sebastian Nagel

Hi Lewis,

here's my +1

 * signatures of release packages are valid
 * build from the source package successful, unit tests pass
 * tested few Nutch tools in the binary package (local mode)
 * run a sample crawl and tested many Nutch tools on a single-node cluster
   running Hadoop 3.4.0, see
   https://github.com/sebastian-nagel/nutch-test-single-node-cluster/

One note about the CHANGES.md: it's now a mixture of HTML and plain text.
It does not use the potential of markdown, e.g. sections / headlines for
the releases to make the change log navigable via a table of contents.
The embedded HTML makes it less readable if viewed in a text editor.
The rendering on Github [5] is acceptable with only minor glitches,
mostly the placement of multiple lines in a single paragraph:
  https://github.com/apache/nutch/blob/branch-1.20/CHANGES.md
We also have a change log on Jira:
  https://s.apache.org/ovjf3
That's why I wouldn't call the CHANGES.md a "blocker". We should
update the formatting after the release to make it again easily
readable in source code and improve the document structure utilizing
the markdown markup.

~Sebastian

On 4/9/24 23:28, lewis john mcgibbney wrote:

Hi Folks,

A first candidate for the Nutch 1.20 release is available at [0] where 
accompanying SHA512 and ASC signatures can also be found.

Information on verifying releases can be found at [1].

The release candidate comprises a .zip and tar.gz archive of the sources at [2] 
and complementary binary distributions. In addition, a staged maven repository 
is available at [3].


The Nutch 1.20 release report is available at [4].

Please vote on releasing this package as Apache Nutch 1.20. The vote is open for 
at least the next 72 hours and passes if a majority of at least three +1 Nutch 
PMC votes are cast.


[ ] +1 Release this package as Apache Nutch X.XX.

[ ] -1 Do not release this package because…

Cheers,
lewismc
P.S. Here is my +1.

[0] https://dist.apache.org/repos/dist/dev/nutch/1.20 

[1] http://nutch.apache.org/downloads.html#verify 

[2] https://github.com/apache/nutch/tree/release-1.20 

[3] https://repository.apache.org/content/repositories/orgapachenutch-1021/ 


[4] https://s.apache.org/ovjf3 

--
http://home.apache.org/~lewismc/ 
http://people.apache.org/keys/committer/lewismc 



[VOTE] Apache Nutch 1.20 Release

2024-04-09 Thread lewis john mcgibbney
Hi Folks,

A first candidate for the Nutch 1.20 release is available at [0] where
accompanying SHA512 and ASC signatures can also be found.
Information on verifying releases can be found at [1].

The release candidate comprises a .zip and tar.gz archive of the sources at
[2] and complementary binary distributions. In addition, a staged maven
repository is available at [3].

The Nutch 1.20 release report is available at [4].

Please vote on releasing this package as Apache Nutch 1.20. The vote is
open for at least the next 72 hours and passes if a majority of at least
three +1 Nutch PMC votes are cast.

[ ] +1 Release this package as Apache Nutch X.XX.

[ ] -1 Do not release this package because…

Cheers,
lewismc
P.S. Here is my +1.

[0] https://dist.apache.org/repos/dist/dev/nutch/1.20
[1] http://nutch.apache.org/downloads.html#verify
[2] https://github.com/apache/nutch/tree/release-1.20
[3] https://repository.apache.org/content/repositories/orgapachenutch-1021/
[4] https://s.apache.org/ovjf3

--
http://home.apache.org/~lewismc/
http://people.apache.org/keys/committer/lewismc