[ANNOUNCE] Apache Nutch 1.20 Release

2024-04-30 Thread lewis john mcgibbney
The Apache Nutch Project Management Committee is pleased to announce the release of Apache Nutch v1.20. We strongly encourage users to upgrade to this release. Nutch is a well matured, production ready Web crawler. Nutch 1.x enables fine grained configuration, relying on Apache Hadoop™ data

[jira] [Closed] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions

2024-04-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-3054. --- > Address deprecation of Node16 for all GitHub Acti

[jira] [Resolved] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions

2024-04-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-3054. - Resolution: Fixed > Address deprecation of Node16 for all GitHub Acti

[jira] [Updated] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions

2024-04-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3054: Affects Version/s: 1.20 > Address deprecation of Node16 for all GitHub Acti

[jira] [Created] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions

2024-04-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3054: --- Summary: Address deprecation of Node16 for all GitHub Actions Key: NUTCH-3054 URL: https://issues.apache.org/jira/browse/NUTCH-3054 Project: Nutch

[jira] [Work started] (NUTCH-3054) Address deprecation of Node16 for all GitHub Actions

2024-04-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3054 started by Lewis John McGibbney. --- > Address deprecation of Node16 for all GitHub Acti

[jira] [Commented] (NUTCH-3049) Investigate using Records

2024-04-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17842208#comment-17842208 ] Lewis John McGibbney commented on NUTCH-3049: - I think that each of the Writable classes

Re: [DISCUSS] Consolidating Nutch Continuous Integration

2024-04-29 Thread Lewis John McGibbney
Hi Sebastian, Understood. If it ain’t broke don’t fix it. Thanks for the input. On 2024/04/28 12:08:27 Sebastian Nagel wrote: > > From my side: no. It may not harm to have both. > > Best, > Sebastian

[jira] [Created] (NUTCH-3053) Upgrade build and CI to JDK17

2024-04-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3053: --- Summary: Upgrade build and CI to JDK17 Key: NUTCH-3053 URL: https://issues.apache.org/jira/browse/NUTCH-3053 Project: Nutch Issue Type: Sub

[jira] [Created] (NUTCH-3052) Investigate using sealed classes

2024-04-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3052: --- Summary: Investigate using sealed classes Key: NUTCH-3052 URL: https://issues.apache.org/jira/browse/NUTCH-3052 Project: Nutch Issue Type: Sub

[jira] [Created] (NUTCH-3051) Investigate using new pattern matching syntax in switch expressions

2024-04-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3051: --- Summary: Investigate using new pattern matching syntax in switch expressions Key: NUTCH-3051 URL: https://issues.apache.org/jira/browse/NUTCH-3051

[jira] [Created] (NUTCH-3050) Investigate use of the enhanced instanceof operator

2024-04-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3050: --- Summary: Investigate use of the enhanced instanceof operator Key: NUTCH-3050 URL: https://issues.apache.org/jira/browse/NUTCH-3050 Project: Nutch

[jira] [Created] (NUTCH-3049) Investigate using Records

2024-04-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3049: --- Summary: Investigate using Records Key: NUTCH-3049 URL: https://issues.apache.org/jira/browse/NUTCH-3049 Project: Nutch Issue Type: Sub-task

[jira] [Created] (NUTCH-3048) Investigate where/if new string utility methods could be used

2024-04-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3048: --- Summary: Investigate where/if new string utility methods could be used Key: NUTCH-3048 URL: https://issues.apache.org/jira/browse/NUTCH-3048 Project

[jira] [Created] (NUTCH-3047) Use multi-line text blocks

2024-04-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3047: --- Summary: Use multi-line text blocks Key: NUTCH-3047 URL: https://issues.apache.org/jira/browse/NUTCH-3047 Project: Nutch Issue Type: Sub-task

[jira] [Updated] (NUTCH-3046) Use compact strings

2024-04-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3046: Description: Follow the guidance at [https://www.baeldung.com/java-migrate-8

[jira] [Created] (NUTCH-3046) Use compact strings

2024-04-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3046: --- Summary: Use compact strings Key: NUTCH-3046 URL: https://issues.apache.org/jira/browse/NUTCH-3046 Project: Nutch Issue Type: Sub-task

[jira] [Created] (NUTCH-3045) Upgrade from Java 11 to 17

2024-04-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3045: --- Summary: Upgrade from Java 11 to 17 Key: NUTCH-3045 URL: https://issues.apache.org/jira/browse/NUTCH-3045 Project: Nutch Issue Type: Task

[ANNOUNCE] Apache Nutch 1.20 Release

2024-04-28 Thread lewis john mcgibbney
The Apache Nutch Project https://nutch.apache.org/download/ Please verify signatures using the KEYS file https://raw.githubusercontent.com/apache/nutch/master/KEYS when downloading the release. This release includes more than 60 bug fixes and improvements, the full list of changes can be seen in

[ANNOUNCE] Apache Nutch 1.20 Release

2024-04-28 Thread lewis john mcgibbney
The Apache Nutch Project https://nutch.apache.org/download/ Please verify signatures using the KEYS file https://raw.githubusercontent.com/apache/nutch/master/KEYS when downloading the release. This release includes more than 60 bug fixes and improvements, the full list of changes can be seen in

Re: [DISCUSS] Consolidating Nutch Continuous Integration

2024-04-25 Thread Lewis John McGibbney
A better reference for the GitHub Actions can be found at https://github.com/apache/nutch/actions lewismc On 2024/04/25 14:40:35 lewis john mcgibbney wrote: > Hi dev@, > > We currently maintains a combination of Jenkins [0] and GitHub Actions [1] > for CI. > > For the longe

[DISCUSS] Consolidating Nutch Continuous Integration

2024-04-25 Thread lewis john mcgibbney
Hi dev@, We currently maintains a combination of Jenkins [0] and GitHub Actions [1] for CI. For the longest time, we relied solely on Jenkins. This was really useful particularly when committers were pulling build artifacts from Jenkins nightly and relied on SVN trunk being stable. The Jenkins

[RESULT] WAS Re: [VOTE] Apache Nutch 1.20 Release

2024-04-24 Thread lewis john mcgibbney
ttee-binding The Nutch 1.20 release candidate has passed the community VOTE. I will therefore promote this release casndidate. Thanks for VOTE’ing and for everyone who contributed to the Apache Nutch 1.20 release. lewismc On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney wrote: > H

[RESULT] WAS Re: [VOTE] Apache Nutch 1.20 Release

2024-04-24 Thread lewis john mcgibbney
ttee-binding The Nutch 1.20 release candidate has passed the community VOTE. I will therefore promote this release casndidate. Thanks for VOTE’ing and for everyone who contributed to the Apache Nutch 1.20 release. lewismc On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney wrote: > H

Re: Help posting question

2024-04-24 Thread Lewis John McGibbney
Hi Sheham, On 2024/04/20 08:47:41 Sheham Izat wrote: > The Fetcher job was aborted, does that still mean that it went through the > entire list of seed urls? Yes it processed the entire generated segment but the fetcher… * hung on https://disneyland.disney.go.com/, https://api.onlyoffice.com/,

[jira] [Updated] (NUTCH-3042) Use GitHub cache action to improve CI execution time

2024-04-19 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3042: Description: With the Ant+Ivy build architecture, the current GitHub actions

[jira] [Created] (NUTCH-3042) Use GitHub cache action to improve CI execution time

2024-04-19 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3042: --- Summary: Use GitHub cache action to improve CI execution time Key: NUTCH-3042 URL: https://issues.apache.org/jira/browse/NUTCH-3042 Project: Nutch

[jira] [Work started] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters

2024-04-19 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3041 started by Lewis John McGibbney. --- > Address confusing logging in o.a.n.net.URLExemptionFilt

[jira] [Updated] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters

2024-04-19 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3041: Description: URLExemptionFilter impementations are used to allow exemptions

[jira] [Updated] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters

2024-04-19 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3041: Description: URLExemptionFilter impementations are used to allow exemptions

Re: Help posting question

2024-04-19 Thread Lewis John McGibbney
Hi Sheham, On 2024/04/19 15:18:01 Sheham Izat wrote: > > My questions are: > > 1) What do I need to do to get Nutch to continue working even if there are > hung threads? >From what I can see in the log you provided, nothing is preventing Nutch from >continuing to work. The Fetcher job

[jira] [Created] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters

2024-04-19 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3041: --- Summary: Address confusing logging in o.a.n.net.URLExemptionFilters Key: NUTCH-3041 URL: https://issues.apache.org/jira/browse/NUTCH-3041 Project

[jira] [Closed] (COMDEV-544) Improve comdev website navigation to GSoC mentor resources

2024-04-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/COMDEV-544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed COMDEV-544. --- > Improve comdev website navigation to GSoC mentor resour

[jira] [Resolved] (COMDEV-544) Improve comdev website navigation to GSoC mentor resources

2024-04-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/COMDEV-544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved COMDEV-544. - Resolution: Fixed Thanks [~rbowen] for merging. > Improve comdev webs

[jira] [Commented] (COMDEV-544) Improve comdev website navigation to GSoC mentor resources

2024-04-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/COMDEV-544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17838694#comment-17838694 ] Lewis John McGibbney commented on COMDEV-544: - [~sebb] thank you, I was on a mob ile device

[jira] [Updated] (COMDEV-544) Improve comdev website navigation to GSoC mentor resources

2024-04-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/COMDEV-544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated COMDEV-544: Description: h1. Purpose Improve comdev website navigation to Google Summer

[jira] [Commented] (COMDEV-544) Improve comdev website navigation to GSoC mentor resources

2024-04-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/COMDEV-544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17838692#comment-17838692 ] Lewis John McGibbney commented on COMDEV-544: - Thank you both. > Improve comdev webs

[jira] [Created] (COMDEV-544) Improve comdev website navigation to GSoC mentor resources

2024-04-18 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created COMDEV-544: --- Summary: Improve comdev website navigation to GSoC mentor resources Key: COMDEV-544 URL: https://issues.apache.org/jira/browse/COMDEV-544 Project

Re: Where is GSoC communications taking place?

2024-04-18 Thread Lewis John Mcgibbney
nks, > Sanyam Goel > > On Thu, Apr 18, 2024 at 4:07 AM lewis john mcgibbney > wrote: > >> Hi dev@, >> Can someone please point me to the GSoC happenings? I’ve not heard >> anything >> since been approved on the Nutch mailing list >> https://list

Where is GSoC communications taking place?

2024-04-17 Thread lewis john mcgibbney
Hi dev@, Can someone please point me to the GSoC happenings? I’ve not heard anything since been approved on the Nutch mailing list https://lists.apache.org/thread/tk8x6sf2mt1lt0v10j30djqjk6vwpgb2 Thanks in advance. lewismc -- http://home.apache.org/~lewismc/

Re: [VOTE] Apache Nutch 1.20 Release

2024-04-16 Thread lewis john mcgibbney
Hi user@, dev@, Please consider reviewing the Nutch 1.20 release candidate. This is a critical prerequisite for us making releases of software at TheASF. Thank you lewismc On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney wrote: > Hi Folks, > > A first candidate for the Nutch 1.2

Re: [VOTE] Apache Nutch 1.20 Release

2024-04-16 Thread lewis john mcgibbney
Hi user@, dev@, Please consider reviewing the Nutch 1.20 release candidate. This is a critical prerequisite for us making releases of software at TheASF. Thank you lewismc On Tue, Apr 9, 2024 at 2:28 PM lewis john mcgibbney wrote: > Hi Folks, > > A first candidate for the Nutch 1.2

Re: [VOTE] Apache Nutch 1.20 Release

2024-04-11 Thread Lewis John McGibbney
Hi Seb, On 2024/04/11 13:30:53 Sebastian Nagel wrote: > > https://github.com/sebastian-nagel/nutch-test-single-node-cluster/ I think we should make this into an integration test suite and run it as part of CI. I’ve been meaning and wanting to do this for the __longest__ time…! > > One

Re: Mentor request for lewismc

2024-04-09 Thread Lewis John McGibbney
> > ACK! > > > > Kind regards, > > Furkan Kamaci > > > > On Sun, Apr 7, 2024 at 8:45 PM lewis john mcgibbney > > wrote: > > > > > Hi Nutch PMC, > > > Please acknowledge and approve my request to mentor this years GSoC > > > program. > > > An ACK is sufficient. > > > Thank you > > > lewismc > > > > > >

[VOTE] Apache Nutch 1.20 Release

2024-04-09 Thread lewis john mcgibbney
Hi Folks, A first candidate for the Nutch 1.20 release is available at [0] where accompanying SHA512 and ASC signatures can also be found. Information on verifying releases can be found at [1]. The release candidate comprises a .zip and tar.gz archive of the sources at [2] and complementary

[VOTE] Apache Nutch 1.20 Release

2024-04-09 Thread lewis john mcgibbney
Hi Folks, A first candidate for the Nutch 1.20 release is available at [0] where accompanying SHA512 and ASC signatures can also be found. Information on verifying releases can be found at [1]. The release candidate comprises a .zip and tar.gz archive of the sources at [2] and complementary

Re: [DISCUSS] Interest in rolling a release

2024-04-09 Thread lewis john mcgibbney
ad one test failure and > maybe we should fix that before the release. > > On Sat, Apr 6, 2024 at 2:39 AM lewis john mcgibbney > wrote: > >> Hi dev@, >> What is the current status of Gora with regards to rolling a 0.10? Or >> Maybe >> a 1.0? >> Thanks >

[jira] [Resolved] (NUTCH-3038) Address issues discovered during 1.20 release management dryrun

2024-04-08 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-3038. - Resolution: Fixed > Address issues discovered during 1.20 release managem

[jira] [Closed] (NUTCH-3038) Address issues discovered during 1.20 release management dryrun

2024-04-08 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-3038. --- Thanks [~snagel]  > Address issues discovered during 1.20 release management dry

[jira] [Work stopped] (NUTCH-3038) Address issues discovered during 1.20 release management dryrun

2024-04-08 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3038 stopped by Lewis John McGibbney. --- > Address issues discovered during 1.20 release management dry

[jira] [Commented] (TIKA-4232) Create and execute unit tests for tika-helm

2024-04-08 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17835077#comment-17835077 ] Lewis John McGibbney commented on TIKA-4232: It turns out that the original GitHub action I

Mentor request for lewismc

2024-04-07 Thread lewis john mcgibbney
Hi Nutch PMC, Please acknowledge and approve my request to mentor this years GSoC program. An ACK is sufficient. Thank you lewismc

[jira] [Work started] (NUTCH-3038) Address issues discovered during 1.20 release management dryrun

2024-04-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3038 started by Lewis John McGibbney. --- > Address issues discovered during 1.20 release management dry

[jira] [Updated] (NUTCH-3038) Address issues discovered during 1.20 release management dryrun

2024-04-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3038: Description: During the 1.20 release management dryrun I discovered the following

[jira] [Created] (NUTCH-3038) Address issues discovered during 1.20 release management dryrun

2024-04-05 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3038: --- Summary: Address issues discovered during 1.20 release management dryrun Key: NUTCH-3038 URL: https://issues.apache.org/jira/browse/NUTCH-3038 Project

[DISCUSS] Interest in rolling a release

2024-04-05 Thread lewis john mcgibbney
Hi dev@, What is the current status of Gora with regards to rolling a 0.10? Or Maybe a 1.0? Thanks lewismc -- http://home.apache.org/~lewismc/ http://people.apache.org/keys/committer/lewismc

[jira] [Closed] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-04-04 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-3032. --- Thanks [~jglvary] and congratulations on your first contribution to Apache Nutch

Re: [ANNOUNCE] Apache Tika 2.9.2 released

2024-04-02 Thread lewis john mcgibbney
luence/display/TIKA/Release+Process+for+tika-helm > > Help... > > On Tue, Apr 2, 2024 at 3:22 PM Tim Allison wrote: > > > > I did a global and thoughtless find/replace. Please review and merge > > if this makes sense: https://github.com/apache/tika-helm/pull/19 > > &

[jira] [Created] (TIKA-4233) Check tika-helm for deprecated k8s APIs

2024-03-30 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created TIKA-4233: -- Summary: Check tika-helm for deprecated k8s APIs Key: TIKA-4233 URL: https://issues.apache.org/jira/browse/TIKA-4233 Project: Tika Issue Type

[jira] [Created] (TIKA-4232) Create and execute unit tests for tika-helm

2024-03-30 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created TIKA-4232: -- Summary: Create and execute unit tests for tika-helm Key: TIKA-4232 URL: https://issues.apache.org/jira/browse/TIKA-4232 Project: Tika Issue

[jira] [Commented] (TIKA-4227) Register tika-helm Chart in artifacthub.io

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17832505#comment-17832505 ] Lewis John McGibbney commented on TIKA-4227: Available at [https://artifacthub.io/packages

[jira] [Resolved] (TIKA-4227) Register tika-helm Chart in artifacthub.io

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-4227. Resolution: Fixed > Register tika-helm Chart in artifacthub

[jira] [Closed] (TIKA-4227) Register tika-helm Chart in artifacthub.io

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed TIKA-4227. -- > Register tika-helm Chart in artifacthub

tika-helm now on artifacthub.io

2024-03-30 Thread lewis john mcgibbney
Hi user@, dev@, For those running Tika on Kubernetes, you can now conveniently find the Helm Chart via artifacthub.io https://artifacthub.io/packages/helm/apache-tika/tika I’ll build in a little more automation so that this thing just takes care of itself. Thanks to all contributors. lewismc

tika-helm now on artifacthub.io

2024-03-30 Thread lewis john mcgibbney
Hi user@, dev@, For those running Tika on Kubernetes, you can now conveniently find the Helm Chart via artifacthub.io https://artifacthub.io/packages/helm/apache-tika/tika I’ll build in a little more automation so that this thing just takes care of itself. Thanks to all contributors. lewismc

[jira] [Updated] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3032: Fix Version/s: 1.20 > Indexing plugin as an adapter for end user's own P

[jira] [Assigned] (NUTCH-3032) Indexing plugin as an adapter for end user's own POJO instances

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-3032: --- Assignee: Joe Gilvary > Indexing plugin as an adapter for end user's

[jira] [Work stopped] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2856 stopped by Lewis John McGibbney. --- > Implement a protocol-smb plugin based on hierynomus/s

[jira] [Work stopped] (NUTCH-2887) Migrate to JUnit 5 Jupiter

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2887 stopped by Lewis John McGibbney. --- > Migrate to JUnit 5 Jupi

[jira] [Closed] (NUTCH-2832) Create tutorial on sending Nutch logs to Elasticsearch

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-2832. --- > Create tutorial on sending Nutch logs to Elasticsea

[jira] [Resolved] (NUTCH-2832) Create tutorial on sending Nutch logs to Elasticsearch

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2832. - Resolution: Won't Fix Given the license changes regarding the concerned backend

[jira] [Resolved] (NUTCH-3036) Upgrade org.seleniumhq.selenium:selenium-java dependency in lib-selenium

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-3036. - Resolution: Fixed > Upgrade org.seleniumhq.selenium:selenium-java depende

[jira] [Closed] (NUTCH-3036) Upgrade org.seleniumhq.selenium:selenium-java dependency in lib-selenium

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-3036. --- > Upgrade org.seleniumhq.selenium:selenium-java dependency in lib-selen

[jira] [Closed] (NUTCH-3035) Update license and notice file for release of 1.20

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-3035. --- > Update license and notice file for release of 1

[jira] [Resolved] (NUTCH-3035) Update license and notice file for release of 1.20

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-3035. - Resolution: Fixed > Update license and notice file for release of 1

[jira] [Resolved] (NUTCH-3037) Upgrade org.apache.kafka:kafka_2.12: to v3.7.0

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-3037. - Resolution: Fixed > Upgrade org.apache.kafka:kafka_2.12: to v3.

[jira] [Closed] (NUTCH-3037) Upgrade org.apache.kafka:kafka_2.12: to v3.7.0

2024-03-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-3037. --- > Upgrade org.apache.kafka:kafka_2.12: to v3.

[jira] [Created] (TIKA-4227) Register tika-helm Chart in artifacthub.io

2024-03-26 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created TIKA-4227: -- Summary: Register tika-helm Chart in artifacthub.io Key: TIKA-4227 URL: https://issues.apache.org/jira/browse/TIKA-4227 Project: Tika Issue Type

Re: Tika chart cannot be reached

2024-03-26 Thread Lewis John McGibbney
Hi Pietro, On 2024/03/26 08:13:39 Pietro Susca wrote: > > Francesco request's is that repo url in not working > also tika is not searchable on the helm repo hub Do you mean here - https://artifacthub.io/ ? If you want it to be searchable via that platform then i can try to make an entry. If

Re: Tika chart cannot be reached

2024-03-26 Thread Lewis John McGibbney
Hi Francesco, Thanks for letting us know that the repository was unreachable… I can only conclude that this was intermittent. I can easily fetch and deploy the Chart as follows helm repo add tika https://apache.jfrog.io/artifactory/tika helm install tika tika/tika --set image.tag=latest-full -n

[jira] [Work stopped] (NUTCH-3037) Upgrade org.apache.kafka:kafka_2.12: to v3.7.0

2024-03-21 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3037 stopped by Lewis John McGibbney. --- > Upgrade org.apache.kafka:kafka_2.12: to v3.

[jira] [Updated] (NUTCH-3037) Upgrade org.apache.kafka:kafka_2.12: to v3.7.0

2024-03-21 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3037: Flags: Patch > Upgrade org.apache.kafka:kafka_2.12: to v3.

[jira] [Work started] (NUTCH-3037) Upgrade org.apache.kafka:kafka_2.12: to v3.7.0

2024-03-21 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3037 started by Lewis John McGibbney. --- > Upgrade org.apache.kafka:kafka_2.12: to v3.

[jira] [Created] (NUTCH-3037) Upgrade org.apache.kafka:kafka_2.12: to v3.7.0

2024-03-21 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3037: --- Summary: Upgrade org.apache.kafka:kafka_2.12: to v3.7.0 Key: NUTCH-3037 URL: https://issues.apache.org/jira/browse/NUTCH-3037 Project: Nutch

[jira] [Work stopped] (NUTCH-3036) Upgrade org.seleniumhq.selenium:selenium-java dependency in lib-selenium

2024-03-14 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3036 stopped by Lewis John McGibbney. --- > Upgrade org.seleniumhq.selenium:selenium-java dependency in

[jira] [Work started] (NUTCH-3036) Upgrade org.seleniumhq.selenium:selenium-java dependency in lib-selenium

2024-03-14 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3036 started by Lewis John McGibbney. --- > Upgrade org.seleniumhq.selenium:selenium-java dependency in

[jira] [Created] (NUTCH-3036) Upgrade org.seleniumhq.selenium:selenium-java dependency in lib-selenium

2024-03-14 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3036: --- Summary: Upgrade org.seleniumhq.selenium:selenium-java dependency in lib-selenium Key: NUTCH-3036 URL: https://issues.apache.org/jira/browse/NUTCH-3036

[jira] [Commented] (IVY-1651) Augment 'Child elements’ section of 'File System Resolver' documentation

2024-03-13 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/IVY-1651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17826781#comment-17826781 ] Lewis John McGibbney commented on IVY-1651: --- PR available at [https://github.com/apache/ant-ivy

[jira] [Created] (IVY-1651) Augment 'Child elements’ section of 'File System Resolver' documentation

2024-03-13 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created IVY-1651: - Summary: Augment 'Child elements’ section of 'File System Resolver' documentation Key: IVY-1651 URL: https://issues.apache.org/jira/browse/IVY-1651 Project

[jira] [Commented] (NUTCH-3029) Host specific max. and min. intervals in adaptive scheduler

2024-03-13 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17826776#comment-17826776 ] Lewis John McGibbney commented on NUTCH-3029: - Hi [~martin.dj] [~markus17] it looks like we

Re: Self Introduction - Xuanwo

2024-03-13 Thread Lewis John McGibbney
Nice welcome Xuanwo thanks for introeucing yourself. lewismc On 2024/03/10 05:20:20 Xuanwo wrote: > Hello, everyone > > I'm Xuanwo, and I'm following the "Contribute" guide in > comdev-working-groups[1] to introduce myself and kickstart my contributions :) > > My personal vision is "Empowering

Re: [QUESTION] What should community do in GSoC timeline?

2024-03-13 Thread Lewis John McGibbney
Hi Xuanwo, It’s been a few years since I participated in GSoC as a mentor… but this year I intend to. Let me see if I can provide answers to some of your questions. On 2024/03/11 03:07:29 Xuanwo wrote: > > 2024-02-22: Potential GSoC contributors discuss application ideas with > mentoring

[jira] [Closed] (NUTCH-3033) Upgrade Ivy to v2.5.2

2024-03-13 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-3033. --- > Upgrade Ivy to v2.5.2 > - > > Key

[jira] [Resolved] (NUTCH-3033) Upgrade Ivy to v2.5.2

2024-03-13 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-3033. - Resolution: Fixed > Upgrade Ivy to v2.

Re: [DISCUSS] Release Nutch 1.20

2024-03-12 Thread Lewis John McGibbney
I submitted a patch for the Ivy 2.5.2 upgrade. If folks could have a look at that it would be ideal. https://github.com/apache/nutch/pull/803 I am free to roll a release candidate towards the end of this week. lewismc On 2024/03/10 15:08:36 Lewis John McGibbney wrote: > Nice  > I wee t

[jira] [Updated] (NUTCH-3033) Upgrade Ivy to v2.5.2

2024-03-12 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3033: Due Date: 12/Mar/24 (was: 11/Mar/24) > Upgrade Ivy to v2.

[jira] [Work stopped] (NUTCH-3033) Upgrade Ivy to v2.5.2

2024-03-12 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3033 stopped by Lewis John McGibbney. --- > Upgrade Ivy to v2.

Re: Differences in retrieve pattern between Ivy 2.5.0/2.5.1 & 2.5.2?

2024-03-12 Thread Lewis John McGibbney
Thanks for this guidance Stefan :) I was able to get a patch together at https://github.com/apache/nutch/pull/803 Hopefully this helps others who may be confused as I was. Thank you lewsmc On 2024/03/12 18:57:51 Stefan Bodewig wrote: > On 2024-03-11, lewis john mcgibbney wrote: > &g

[GSoC 2024 PROPOSAL] Overhaul the legacy Nutch plugin framework and replace it with PF4J

2024-03-12 Thread lewis john mcgibbney
Hi user@ & dev@, I decided to write up a GSoC’24 proposal and encourage interested applicants to register your interest in the JIRA issue or else reach out to the Nutch PMC over on dev@nutch.apache.org (please CC lewi...@apache.org). Title: Overhaul the legacy Nutch plugin framework and replace

[GSoC 2024 PROPOSAL] Overhaul the legacy Nutch plugin framework and replace it with PF4J

2024-03-12 Thread lewis john mcgibbney
Hi user@ & dev@, I decided to write up a GSoC’24 proposal and encourage interested applicants to register your interest in the JIRA issue or else reach out to the Nutch PMC over on d...@nutch.apache.org (please CC lewi...@apache.org). Title: Overhaul the legacy Nutch plugin framework and replace

  1   2   3   4   5   6   7   8   9   10   >