[jira] [Resolved] (ANY23-461) Upgrade Any23 to JDK11

2021-03-27 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-461. Resolution: Fixed > Upgrade Any23 to JD

[jira] [Resolved] (ANY23-294) Create extractor plugin for IFC files

2021-03-24 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-294. Resolution: Won't Fix > Create extractor plugin for IFC fi

[jira] [Resolved] (ANY23-216) Any23 Firefox Extension

2021-03-24 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-216. Resolution: Won't Fix > Any23 Firefox Extens

[jira] [Resolved] (ANY23-10) Integrate Javascript engine to extract dynamic data

2021-03-24 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-10?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-10. --- Resolution: Won't Fix > Integrate Javascript engine to extract dynamic d

[jira] [Resolved] (ANY23-239) Any23 Chrome Extension

2021-03-24 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-239. Resolution: Won't Fix > Any23 Chrome Extens

[jira] [Resolved] (ANY23-371) Any23 cannot start in CMD in Windows 10

2021-03-24 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-371. Resolution: Cannot Reproduce I can't reproduce this issue and no-one else has tried

[jira] [Created] (ANY23-461) Upgrade Any23 to JDK11

2021-03-22 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-461: -- Summary: Upgrade Any23 to JDK11 Key: ANY23-461 URL: https://issues.apache.org/jira/browse/ANY23-461 Project: Apache Any23 Issue Type

[jira] [Resolved] (TIKA-3311) Add github workflows to Tika

2021-03-22 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-3311. Resolution: Fixed > Add github workflows to T

Re: [Gora] builds.apache.org

2021-03-22 Thread Lewis John McGibbney
Hi Alfonso, The Gora builds now reside on the Cloudbees Jenkins server (which I think you already figured out) https://ci-builds.apache.org/job/Gora/job/gora-pipeline/job/master/ I don't have time to look into configuring the job right now but if you look at the Any23 build we did a while back,

Re: [DISCUSS] What is holding back Release and graduation?

2021-03-22 Thread lewis john mcgibbney
I would also really encourage a release. It would be useful to consult https://cwiki.apache.org/confluence/display/incubator/ReleaseChecklist I don't think there is any blocking reason... as Frank mentioned... it's just not been prioritized. lewismc On Mon, Mar 22, 2021 at 10:10 AM wrote: > >

Re: [Discuss] Apache Gora Next Release

2021-03-21 Thread lewis john mcgibbney
Excellent work folks. I would suggest activating dependabot on the Gora repository. It may make our lives a bit easier to keep on top of dependency management. lewismc On Sun, Mar 21, 2021 at 6:02 AM wrote: > > dev Digest 21 Mar 2021 13:02:16 - Issue 1401 > > Topics (messages 13057 through

[jira] [Resolved] (NUTCH-2512) Nutch does not build under JDK9

2021-03-21 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2512. - Resolution: Won't Fix Fixed in NUTCH-2857 > Nutch does not build under J

[jira] [Work stopped] (NUTCH-2857) Upgrade from JDK1.8 --> JDK11

2021-03-21 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2857 stopped by Lewis John McGibbney. --- > Upgrade from JDK1.8 -->

[jira] [Resolved] (NUTCH-2857) Upgrade from JDK1.8 --> JDK11

2021-03-21 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2857. - Resolution: Fixed > Upgrade from JDK1.8 -->

[jira] [Work started] (NUTCH-2857) Upgrade from JDK1.8 --> JDK11

2021-03-16 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2857 started by Lewis John McGibbney. --- > Upgrade from JDK1.8 -->

Re: Moving past JDK 1.8

2021-03-12 Thread Lewis John McGibbney
I went ahead and created https://github.com/apache/nutch/pull/573 incase anyone is interested in pursuing this. lewismc On 2021/03/13 02:34:27, Lewis John McGibbney wrote: > Hi dev@, > Does anyone have any opinions about moving past JDK1.8? > Maybe JDK1.8 --> JDK11 LTS? > &g

[jira] [Created] (NUTCH-2857) Upgrade from JDK1.8 --> JDK11

2021-03-12 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2857: --- Summary: Upgrade from JDK1.8 --> JDK11 Key: NUTCH-2857 URL: https://issues.apache.org/jira/browse/NUTCH-2857 Project: Nutch Issue T

Moving past JDK 1.8

2021-03-12 Thread Lewis John McGibbney
Hi dev@, Does anyone have any opinions about moving past JDK1.8? Maybe JDK1.8 --> JDK11 LTS? >From what I can see the following files need updated modified: .github/workflows/master-build.yml modified: default.properties modified: ivy/mvn.template We would also

[jira] [Updated] (NUTCH-2854) Address ALL security vulnerabilities indicated by report-vulnerabilities ant target

2021-03-12 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2854: Description: NUTCH-2840 uncovered lots of security issues for us to work

Re: Specifying a different container registry in Airfloew Helm Chart

2021-03-11 Thread Lewis John McGibbney
airflow/blob/master/chart/values.yaml#L30-L34. > > On Wed, Mar 10, 2021 at 1:54 PM Lewis John McGibbney > wrote: > > > I found the `registry` key in values.yaml. > > https://github.com/apache/airflow/blob/master/chart/values.yaml#L626-L637 > > We are experimenting

Re: Specifying a different container registry in Airfloew Helm Chart

2021-03-10 Thread Lewis John McGibbney
I found the `registry` key in values.yaml. https://github.com/apache/airflow/blob/master/chart/values.yaml#L626-L637 We are experimenting with this and I will update here if we get it to work. I'll then pull a documentation patch together. lewismc On 2021/03/10 18:34:17, Lewis John McGibbney

Specifying a different container registry in Airfloew Helm Chart

2021-03-10 Thread Lewis John McGibbney
Hello users@, We wanted to turn on Airflow Sentry integration and deploying into K8s using Helm. It is not clear how one would define the 'new/custom' Airflow container which resides in our internal enterprise container registry i.e. JFrog's Artifactory. If someone can guide me on this then I

Re: Adding packages to the airflow image in helm chart

2021-03-10 Thread Lewis John McGibbney
tion-deployment.html#customizing-or-extending-the-production-image > > Or watch my talk from Airflow Summit last year: > > https://youtu.be/wDr3Y7q2XoI where I explained the various options you > > have. > > > > J. > > > > > > On Thu, Feb 25, 2021

[jira] [Commented] (TIKA-3311) Add github workflows to Tika

2021-03-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17296430#comment-17296430 ] Lewis John McGibbney commented on TIKA-3311: bq. Is it because PRs are not run through ci

[jira] [Updated] (NUTCH-2855) Update org.elasticsearch.client

2021-03-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2855: Fix Version/s: 1.19 > Update org.elasticsearch.cli

[jira] [Updated] (NUTCH-2855) Update org.elasticsearch.client

2021-03-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2855: Component/s: build > Update org.elasticsearch.cli

[jira] [Assigned] (NUTCH-2855) Update org.elasticsearch.client

2021-03-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2855: --- Assignee: Randall Williams > Update org.elasticsearch.cli

[jira] [Updated] (NUTCH-2855) Update org.elasticsearch.client

2021-03-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2855: Affects Version/s: 1.18 > Update org.elasticsearch.cli

[jira] [Updated] (NUTCH-2854) Address ALL security vulnerabilities indicated by report-vulnerabilities ant target

2021-03-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2854: Description: NUTCH-2840 uncovered lots of issues for us to work on. This is simply

[jira] [Created] (NUTCH-2854) Address ALL security vulnerabilities indicated by report-vulnerabilities ant target

2021-03-05 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2854: --- Summary: Address ALL security vulnerabilities indicated by report-vulnerabilities ant target Key: NUTCH-2854 URL: https://issues.apache.org/jira/browse/NUTCH-2854

Re: [VOTE] Release Apache Flagon UserALE.js (Incubating) 2.1.1

2021-03-04 Thread Lewis John McGibbney
Hi Josh, I sincerely apologize for late review. You gusy did a great job pulling this one together. > [ x ] Build and Unit Tests Pass > [ x ] Integration Tests Pass > [ x ] "Incubating" in References to Project and Distribution File Names > [ x ] Signatures and Hashes Match Keys > [ x ]

[jira] [Commented] (TIKA-94) Speech-to-text transcription

2021-03-03 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17294795#comment-17294795 ] Lewis John McGibbney commented on TIKA-94: -- This makes perfect sense to me. At the end of the day

[jira] [Updated] (TIKA-94) Speech-to-text transcription

2021-03-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-94: - Summary: Speech-to-text transcription (was: Speech recognition) > Speech-to-t

[jira] [Comment Edited] (TIKA-94) Speech recognition

2021-03-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17293291#comment-17293291 ] Lewis John McGibbney edited comment on TIKA-94 at 3/3/21, 4:07 AM

[jira] [Created] (TIKA-3311) Add github workflows to Tika

2021-03-02 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created TIKA-3311: -- Summary: Add github workflows to Tika Key: TIKA-3311 URL: https://issues.apache.org/jira/browse/TIKA-3311 Project: Tika Issue Type: Improvement

[jira] [Comment Edited] (TIKA-94) Speech recognition

2021-03-01 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17293291#comment-17293291 ] Lewis John McGibbney edited comment on TIKA-94 at 3/2/21, 2:06 AM

[jira] [Commented] (TIKA-94) Speech recognition

2021-03-01 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17293291#comment-17293291 ] Lewis John McGibbney commented on TIKA-94: -- [~tallison] I'm looking at [TesseractOCRParser|https

Re: [VOTE] Release Apache Flagon UserALE.js (Incubating) 2.1.1

2021-02-25 Thread lewis john mcgibbney
Hi Josh, Dropping private@ simply nothing is private. Thank you very much for pushing the release candidate. Excellent work recently. I'll review the release candidate and provide a VITE shortly. Thanks On Wed, Feb 24, 2021 at 5:53 PM Joshua Poore wrote: > Hi Folks, > > Doing parallel VOTEs on

Adding packages to the airflow image in helm chart

2021-02-24 Thread Lewis John McGibbney
Hi users@, I'm trying to configure the Helm Chart [0] with LDAP authentication. Does anyone know how I can add the ldap packages equivalent to executing the pip command below? pip install 'apache-airflow[ldap]' Do I need to build my own docker image FROM apache/airflow:${tag} and then

Re: Use WebUI username and password in webserver_config.py for LDAP authentication

2021-02-24 Thread Lewis John McGibbney
e bind connection, the Airflow LDAP backend will > then confirm if the user from the webform is authorised. > > Leo > > > On 23 Feb 2021, at 21:53, Lewis John McGibbney wrote: > > > > Hi Folks, > > Has anyone been able to successfully pass the usernam

Re: Use WebUI username and password in webserver_config.py for LDAP authentication

2021-02-23 Thread Lewis John McGibbney
17:21:30, Lewis John McGibbney wrote: > Hi users@, > > # > # Context # > # > With the following webserver_config.py code, when I provide the environment > variables $USERNAME and $PASSWORD, from the WebUI I can authenticate and > login to Airflow just fin

[jira] [Commented] (TIKA-94) Speech recognition

2021-02-19 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17287313#comment-17287313 ] Lewis John McGibbney commented on TIKA-94: -- I totally understand the use case. I will work

Use WebUI username and password in webserver_config.py for LDAP authentication

2021-02-19 Thread Lewis John McGibbney
Hi users@, # # Context # # With the following webserver_config.py code, when I provide the environment variables $USERNAME and $PASSWORD, from the WebUI I can authenticate and login to Airflow just fine. import os from flask_appbuilder.security.manager import AUTH_LDAP basedir

[jira] [Comment Edited] (TIKA-94) Speech recognition

2021-02-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17286834#comment-17286834 ] Lewis John McGibbney edited comment on TIKA-94 at 2/19/21, 3:49 AM

[jira] [Commented] (TIKA-94) Speech recognition

2021-02-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17286834#comment-17286834 ] Lewis John McGibbney commented on TIKA-94: -- [~peterkronenberg] took a cursory look at vosk today. I

[jira] [Created] (NUTCH-2852) Method invokes System.exit(...) 9 bugs

2021-02-18 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2852: --- Summary: Method invokes System.exit(...) 9 bugs Key: NUTCH-2852 URL: https://issues.apache.org/jira/browse/NUTCH-2852 Project: Nutch Issue

[jira] [Work started] (NUTCH-2852) Method invokes System.exit(...) 9 bugs

2021-02-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2852 started by Lewis John McGibbney. --- > Method invokes System.exit(...) 9 b

[jira] [Resolved] (NUTCH-2851) Random object created and used only once

2021-02-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2851. - Resolution: Fixed > Random object created and used only o

[jira] [Resolved] (NUTCH-2850) Method ignores exceptional return value

2021-02-18 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2850. - Resolution: Fixed > Method ignores exceptional return va

[jira] [Created] (NUTCH-2851) Random object created and used only once

2021-02-17 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2851: --- Summary: Random object created and used only once Key: NUTCH-2851 URL: https://issues.apache.org/jira/browse/NUTCH-2851 Project: Nutch Issue

[jira] [Work started] (NUTCH-2851) Random object created and used only once

2021-02-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2851 started by Lewis John McGibbney. --- > Random object created and used only o

[jira] [Created] (NUTCH-2850) Method ignores exceptional return value

2021-02-17 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2850: --- Summary: Method ignores exceptional return value Key: NUTCH-2850 URL: https://issues.apache.org/jira/browse/NUTCH-2850 Project: Nutch Issue

[jira] [Work started] (NUTCH-2850) Method ignores exceptional return value

2021-02-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2850 started by Lewis John McGibbney. --- > Method ignores exceptional return va

[jira] [Work stopped] (NUTCH-1860) Protocol IMAPS Support

2021-02-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-1860 stopped by Lewis John McGibbney. --- > Protocol IMAPS Supp

Advice on importing and archiving from mailmain/pipermail mailing lists

2021-02-16 Thread Lewis John McGibbney
Hi user@, I'm trying to establish Ponymail for a 501(c)(3) to improve searching around 50 mailing lists. Thanks to Humbedooh, I recently discovered that you guys refactored Ponymail --> Foal. It looks much better folks... excellent job. For example, I would like to archive mail from the

[jira] [Updated] (NUTCH-1860) Protocol IMAPS Support

2021-02-16 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1860: Description: Implementing the Internet Messaging Access Protocol within Nutch

[jira] [Commented] (NUTCH-1860) Protocol IMAPS Support

2021-02-16 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17285418#comment-17285418 ] Lewis John McGibbney commented on NUTCH-1860: - I'm back to work on this issue folks

[jira] [Work stopped] (NUTCH-2849) Replace remaining package.html files with package-info.java

2021-02-16 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2849 stopped by Lewis John McGibbney. --- > Replace remaining package.html files with package-info.j

[jira] [Resolved] (NUTCH-2849) Replace remaining package.html files with package-info.java

2021-02-16 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2849. - Resolution: Fixed Thanks for review [~snagel] > Replace remaining package.h

[jira] [Commented] (TIKA-94) Speech recognition

2021-02-14 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17284526#comment-17284526 ] Lewis John McGibbney commented on TIKA-94: -- Hi Peter, thanks for the comment and reference to Vosk

[jira] [Commented] (TIKA-94) Speech recognition

2021-02-12 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17284036#comment-17284036 ] Lewis John McGibbney commented on TIKA-94: -- [~chrismattmann] I'm taking this issue on with a team

[jira] [Assigned] (TIKA-94) Speech recognition

2021-02-12 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned TIKA-94: Assignee: Lewis John McGibbney (was: Chris A. Mattmann) > Speech recognit

[jira] [Work started] (NUTCH-2849) Replace remaining package.html files with package-info.java

2021-02-11 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2849 started by Lewis John McGibbney. --- > Replace remaining package.html files with package-info.j

[jira] [Resolved] (NUTCH-2842) Fix Javadoc warnings, errors and add Javadoc check to Github Action and Jenkins

2021-02-11 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2842. - Resolution: Fixed > Fix Javadoc warnings, errors and add Javadoc check to Git

[jira] [Work stopped] (NUTCH-2842) Fix Javadoc warnings, errors and add Javadoc check to Github Action and Jenkins

2021-02-11 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2842 stopped by Lewis John McGibbney. --- > Fix Javadoc warnings, errors and add Javadoc check to Git

[jira] [Created] (NUTCH-2849) Replace remaining package.html files with package-info.java

2021-02-11 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2849: --- Summary: Replace remaining package.html files with package-info.java Key: NUTCH-2849 URL: https://issues.apache.org/jira/browse/NUTCH-2849 Project

Re: Donating code from Nutch to Commons - commons-url?

2021-02-10 Thread Lewis John McGibbney
s also likely that one the HttpComponent project's > component like HttpCore or HttpClient lready has some of this > functionality. > > Gary > > On Mon, Feb 8, 2021, 21:48 Lewis John McGibbney wrote: > > > Hi dev@, > > My name is Lewis, I'm a dev over in the Nutc

[jira] [Updated] (NUTCH-2842) Fix Javadoc warnings, errors and add Javadoc check to Github Action and Jenkins

2021-02-09 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2842: Summary: Fix Javadoc warnings, errors and add Javadoc check to Github Action

[jira] [Commented] (NUTCH-2842) Fix Javadoc warnings and add Javadoc check to Github Action and Jenkins

2021-02-08 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17281521#comment-17281521 ] Lewis John McGibbney commented on NUTCH-2842: - Yeah this task is pretty heavy lifting. So far

Donating code from Nutch to Commons - commons-url?

2021-02-08 Thread Lewis John McGibbney
Hi dev@, My name is Lewis, I'm a dev over in the Nutch project - http://nutch.apache.org. It occurred to me that Nutch has some rather useful code related to URL processing. It can do things like * extract domain name from an input URL or String * get domain suffix * compare domains * get domain

[jira] [Created] (NUTCH-2848) Consider usefulness of StringUtil#isEmpty

2021-02-06 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2848: --- Summary: Consider usefulness of StringUtil#isEmpty Key: NUTCH-2848 URL: https://issues.apache.org/jira/browse/NUTCH-2848 Project: Nutch Issue

[jira] [Updated] (NUTCH-2848) Consider use of StringUtil#isEmpty

2021-02-06 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2848: Summary: Consider use of StringUtil#isEmpty (was: Consider usefulness

Re: [REPORT] Submitted

2021-02-03 Thread lewis john mcgibbney
Good job Frank. On Wed, Feb 3, 2021 at 4:54 PM Frank Greguska wrote: > > Final report has been submitted. View it here: > > https://cwiki.apache.org/confluence/display/INCUBATOR/February2021#sdap > > - Frank -- http://home.apache.org/~lewismc/ http://people.apache.org/keys/committer/lewismc

[jira] [Commented] (NUTCH-2842) Fix Javadoc warnings and add Javadoc check to Github Action and Jenkins

2021-02-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277668#comment-17277668 ] Lewis John McGibbney commented on NUTCH-2842: - Hi folks, I am nearly finished this behemoth

[jira] [Commented] (NUTCH-2842) Fix Javadoc warnings and add Javadoc check to Github Action and Jenkins

2021-02-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17277669#comment-17277669 ] Lewis John McGibbney commented on NUTCH-2842: - I should be finished some time this week

[jira] [Assigned] (NUTCH-2842) Fix Javadoc warnings and add Javadoc check to Github Action and Jenkins

2021-01-31 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2842: --- Assignee: Lewis John McGibbney > Fix Javadoc warnings and add Javadoc ch

[jira] [Work started] (NUTCH-2843) Duplicate declaration of dependencies in ivy.xml

2021-01-31 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2843 started by Lewis John McGibbney. --- > Duplicate declaration of dependencies in ivy.

[jira] [Work started] (NUTCH-2842) Fix Javadoc warnings and add Javadoc check to Github Action and Jenkins

2021-01-31 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2842 started by Lewis John McGibbney. --- > Fix Javadoc warnings and add Javadoc check to Github Act

[jira] [Work stopped] (NUTCH-2840) Fix 'report-vulnerabilities' ant target in build.xml

2021-01-31 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2840 stopped by Lewis John McGibbney. --- > Fix 'report-vulnerabilities' ant target in build.

[jira] [Resolved] (NUTCH-2840) Fix 'report-vulnerabilities' ant target in build.xml

2021-01-31 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2840?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2840. - Resolution: Fixed > Fix 'report-vulnerabilities' ant target in build.

[jira] [Resolved] (NUTCH-2819) Move spotbugs "installation" directory to avoid that spotbugs is shipped in Nutch runtime

2021-01-31 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2819. - Resolution: Fixed > Move spotbugs "installation" direc

Security vulnerability reduction for Nutch

2021-01-27 Thread Lewis John McGibbney
Hi dev@, This is a heads up that I have created a project titled "Security vulnerability reduction for the Apache Nutch Web crawler project" which will be taken on within USC's CSCI 401 senior computer science capstone program. A very brief description is below for anyone interested. This

[ANNOUNCE] CVE-2021-23901: An XML external entity (XXE) injection vulnerability exists in the Nutch DmozParser

2021-01-25 Thread lewis john mcgibbney
Description: An XML external entity (XXE) injection vulnerability was discovered in the Nutch DmozParser and is known to affect Nutch versions < 1.18. XML external entity injection (also known as XXE) is a web security vulnerability that allows an attacker to interfere with an application's

[ANNOUNCE] Apache Nutch 1.18

2021-01-24 Thread lewis john mcgibbney
What? The Apache Nutch team is pleased to announce the release of Apache Nutch v1.18. Nutch is a well matured, production ready Web crawler. Nutch 1.x enables fine grained configuration, relying on Apache Hadoop™ data structures. Where? Source and binary distributions are available for download

CVE-2021-23901: An XML external entity (XXE) injection vulnerability exists in the Nutch DmozParser

2021-01-24 Thread lewis john mcgibbney
Description: An XML external entity (XXE) injection vulnerability was discovered in the Nutch DmozParser and is known to affect Nutch versions < 1.18. XML external entity injection (also known as XXE) is a web security vulnerability that allows an attacker to interfere with an application's

[ANNOUNCE] Apache Nutch 1.18 Release

2021-01-24 Thread lewis john mcgibbney
*What?* The Apache Nutch team is pleased to announce the release of Apache Nutch v1.18. Nutch is a well matured, production ready Web crawler. Nutch 1.x enables fine grained configuration, relying on Apache Hadoop™ data structures. *Where?* Source and binary distributions are available for

[ANNOUNCE] Apache Nutch 1.18 Release

2021-01-24 Thread lewis john mcgibbney
*What?* The Apache Nutch team is pleased to announce the release of Apache Nutch v1.18. Nutch is a well matured, production ready Web crawler. Nutch 1.x enables fine grained configuration, relying on Apache Hadoop™ data structures. *Where?* Source and binary distributions are available for

[RESULT] WAS Re: [VOTE] Release Apache Nutch 1.18 RC1

2021-01-24 Thread lewis john mcgibbney
user@, dev@, The 72hr VOTE'ing period has elapsed. The RESULT's are as follows [5] +1 Release this package as Apache Nutch 1.18. Lewis John McGibbney* Ralf Kotowski* Jorge Luis Betancourt Gonzalez* Sebastian Nagel* Shashanka Balakuntala Srinivasa* [0] -1 Do not release this package because

[RESULT] WAS Re: [VOTE] Release Apache Nutch 1.18 RC1

2021-01-24 Thread lewis john mcgibbney
user@, dev@, The 72hr VOTE'ing period has elapsed. The RESULT's are as follows [5] +1 Release this package as Apache Nutch 1.18. Lewis John McGibbney* Ralf Kotowski* Jorge Luis Betancourt Gonzalez* Sebastian Nagel* Shashanka Balakuntala Srinivasa* [0] -1 Do not release this package because

Re: [VOTE] Release Apache Nutch 1.18 RC1

2021-01-24 Thread Lewis John McGibbney
-and-sums >"SHOULD NOT supply a MD5 or SHA-1 checksum file because these are > deprecated" > > > Best, > Sebastian > > On 1/21/21 2:22 AM, lewis john mcgibbney wrote: > > Hi Folks, > > ssh://g...@gitlab.padim.fim.uni-passau.de:13003/os

Re: [VOTE] Release Apache Nutch 1.18 RC1

2021-01-24 Thread Lewis John McGibbney
r. > at > org.apache.ivy.core.retrieve.RetrieveEngine.determineArtifactsToCopy(RetrieveEngine.java:413) > at > org.apache.ivy.core.retrieve.RetrieveEngine.retrieve(RetrieveEngine.java:122) > ... 43 more > > On Thu, Jan 21, 2021 at 2:22 AM lewis jo

[jira] [Closed] (NUTCH-2844) Link Alternatif Joker123

2021-01-24 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-2844. --- > Link Alternatif Joker123 > > >

[jira] [Resolved] (NUTCH-2844) Link Alternatif Joker123

2021-01-24 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2844. - Resolution: Not A Problem > Link Alternatif Joker

Re: [VOTE] Release Apache Tika 2.0.0-ALPHA Candidate #1

2021-01-21 Thread Lewis John McGibbney
d at: 2021-01-21T10:02:45-05:00 > [INFO] > ---- > > On Wed, Jan 20, 2021 at 7:29 PM Lewis John McGibbney > wrote: > > > > Hi Tim, > > FWIW here's my review > > > > SIGS BOTH LOOK GOO

Fwd: WebDataCommons releases 86.3 billion quads Microdata, Embedded JSON-LD, RDFa, and Microformat data originating from 15.3 million websites

2021-01-21 Thread lewis john mcgibbney
FYI folks -- Forwarded message - From: Lewis John Mcgibbney Date: Thu, Jan 21, 2021 at 1:04 PM Subject: Re: WebDataCommons releases 86.3 billion quads Microdata, Embedded JSON-LD, RDFa, and Microformat data originating from 15.3 million websites To: Web Data Commons

Fwd: WebDataCommons releases 86.3 billion quads Microdata, Embedded JSON-LD, RDFa, and Microformat data originating from 15.3 million websites

2021-01-21 Thread lewis john mcgibbney
FYI folks -- Forwarded message - From: Lewis John Mcgibbney Date: Thu, Jan 21, 2021 at 1:04 PM Subject: Re: WebDataCommons releases 86.3 billion quads Microdata, Embedded JSON-LD, RDFa, and Microformat data originating from 15.3 million websites To: Web Data Commons

[jira] [Commented] (NUTCH-2826) Migrate Nutch Site from Apache CMS to Hugo

2021-01-20 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17269026#comment-17269026 ] Lewis John McGibbney commented on NUTCH-2826: - This is pretty cool I never heard of Hugo

[VOTE] Release Apache Nutch 1.18 RC1

2021-01-20 Thread lewis john mcgibbney
Hi Folks, A first candidate for the Nutch 1.18 release is available at [0] where accompanying SHA512, ASC and MD5 signatures can also be found. Information on verifying releases can be found at [1]. The release candidate is a .zip and tar.gz archive of the sources in [2] In addition, a staged

[VOTE] Release Apache Nutch 1.18 RC1

2021-01-20 Thread lewis john mcgibbney
Hi Folks, A first candidate for the Nutch 1.18 release is available at [0] where accompanying SHA512, ASC and MD5 signatures can also be found. Information on verifying releases can be found at [1]. The release candidate is a .zip and tar.gz archive of the sources in [2] In addition, a staged

Re: [VOTE] Release Apache Tika 2.0.0-ALPHA Candidate #1

2021-01-20 Thread Lewis John McGibbney
Hi Tim, FWIW here's my review SIGS BOTH LOOK GOOD gpg --verify tika-2.0.0-ALPHA-src.zip.asc tika-2.0.0-ALPHA-src.zip gpg: Signature made Wed Jan 13 15:26:10 2021 PST gpg:using RSA key 184454FAD8697760F3E00D2E4A51A45B944FFD51 gpg: Good signature from "Tim Allison (ASF signing key)

<    6   7   8   9   10   11   12   13   14   15   >