Re: Possibility of using ci-hadoop.a.o for Nutch integration tests

2022-01-05 Thread lewis john mcgibbney
the Nutch project, > neither bandwidth to fix stuff if it gets broken. > > Just my thoughts. Looped in the dev lists, if others have any feedback. As > for the process, this would require a consensus from the Hadoop PMC > > -Ayush > > > On 06-Jan-2022, at 7:02

Re: Possibility of using ci-hadoop.a.o for Nutch integration tests

2022-01-05 Thread Lewis John Mcgibbney
andwidth to fix stuff if it gets broken. > > Just my thoughts. Looped in the dev lists, if others have any feedback. As > for the process, this would require a consensus from the Hadoop PMC > > -Ayush > > > On 06-Jan-2022, at 7:02 AM, lewis john mcgibbney > wrote: > >

Re: Possibility of using ci-hadoop.a.o for Nutch integration tests

2022-01-05 Thread Lewis John Mcgibbney
andwidth to fix stuff if it gets broken. > > Just my thoughts. Looped in the dev lists, if others have any feedback. As > for the process, this would require a consensus from the Hadoop PMC > > -Ayush > > > On 06-Jan-2022, at 7:02 AM, lewis john mcgibbney > wrote: > >

Re: Possibility of using ci-hadoop.a.o for Nutch integration tests

2022-01-05 Thread Lewis John Mcgibbney
andwidth to fix stuff if it gets broken. > > Just my thoughts. Looped in the dev lists, if others have any feedback. As > for the process, this would require a consensus from the Hadoop PMC > > -Ayush > > > On 06-Jan-2022, at 7:02 AM, lewis john mcgibbney > wrote: > >

Re: Possibility of using ci-hadoop.a.o for Nutch integration tests

2022-01-05 Thread Lewis John Mcgibbney
andwidth to fix stuff if it gets broken. > > Just my thoughts. Looped in the dev lists, if others have any feedback. As > for the process, this would require a consensus from the Hadoop PMC > > -Ayush > > > On 06-Jan-2022, at 7:02 AM, lewis john mcgibbney > wrote: > >

Possibility of using ci-hadoop.a.o for Nutch integration tests

2022-01-05 Thread lewis john mcgibbney
Hi general@, Not sure if this is the correct mailing list. Please redirect me if there is a more suitable location. Thank you I am PMC over on the Nutch project (https://nutch.apache.org). I would like to investigate whether we can build an integration testing capability for the project. This

Re: How to determine if customer XCom backend is being loaded

2022-01-05 Thread Lewis John McGibbney
in the DAG? Is it necessary to call this function? If someone can give me points then i will provide a pull request to augment the documentation as there doesn't appear to be any. Thanks lewismc On 2022/01/05 22:41:22 lewis john mcgibbney wrote: > Hi users@, > > We wish to use a cu

[jira] [Resolved] (ANY23-454) log4j:WARN No appenders could be found for logger (org.apache.any23.extractor.ExtractorRegistryImpl)

2022-01-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-454. Assignee: Lewis John McGibbney Resolution: Fixed Resolved in the upgrade

[jira] [Updated] (ANY23-454) log4j:WARN No appenders could be found for logger (org.apache.any23.extractor.ExtractorRegistryImpl)

2022-01-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated ANY23-454: --- Fix Version/s: 2.6 (was: 2.7) > log4j:WARN No appenders co

[jira] [Updated] (ANY23-535) Bump spotbugs-maven-plugin from 4.3.0 to 4.5.0.0

2022-01-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated ANY23-535: --- Fix Version/s: 2.6 (was: 2.7) > Bump spotbugs-maven-plu

[jira] [Updated] (ANY23-535) Bump spotbugs-maven-plugin from 4.3.0 to 4.5.0.0

2022-01-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated ANY23-535: --- Issue Type: Improvement (was: Bug) > Bump spotbugs-maven-plugin from 4.

[jira] [Updated] (ANY23-535) Bump spotbugs-maven-plugin from 4.3.0 to 4.5.0.0

2022-01-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated ANY23-535: --- Affects Version/s: 2.6 (was: 2.7) > Bump spotbugs-ma

[jira] [Resolved] (ANY23-535) Bump spotbugs-maven-plugin from 4.3.0 to 4.5.0.0

2022-01-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-535. Assignee: Lewis John McGibbney Resolution: Fixed > Bump spotbugs-maven-plu

[jira] [Resolved] (ANY23-555) Bump buildnumber-maven-plugin from 1.4 to 3.0.0

2022-01-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-555. Resolution: Fixed > Bump buildnumber-maven-plugin from 1.4 to 3.

[jira] [Resolved] (ANY23-553) Document MathUtils#md5 to warn that the weak hash algorithm is not to be used in a sensitive context

2022-01-05 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-553. Resolution: Fixed > Document MathUtils#md5 to warn that the weak hash algori

[jira] [Created] (ANY23-555) Bump buildnumber-maven-plugin from 1.4 to 3.0.0

2022-01-05 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-555: -- Summary: Bump buildnumber-maven-plugin from 1.4 to 3.0.0 Key: ANY23-555 URL: https://issues.apache.org/jira/browse/ANY23-555 Project: Apache Any23

Re: Addressing Nutch use of CMS WAS: [IMPORTANT] - ci.apache.org and CMS Shutdown end of January 2022

2022-01-04 Thread Lewis John McGibbney
Hi Gavin, On 2022/01/03 07:19:58 Gavin McDonald wrote: > Hi Lewis, > > Having checked again, all looks good from this end. > > One last place is: > https://svn.apache.org/repos/infra/websites/production/nutch/content/ > I assume you no longer use that area and I can safely remove? Correct we

Re: [VOTE] Release Apache Any23 2.6 RC#2

2022-01-04 Thread Lewis John McGibbney
Hi Peter, On 2022/01/04 23:38:41 Peter Ansell wrote: > Tracing the error in a debugger shows that the correct result, > "ISO-8859-1" is found by the meta tag detection method, but it is then > overridden with "windows-1252" because a carriage return character is > detected by the following code

Re: [VOTE] Release Apache Any23 2.6 RC#2

2022-01-04 Thread Lewis John McGibbney
Hi Peter, On 2022/01/04 22:41:56 Peter Ansell wrote: > > Hi Lewis, > > It is failing to build using Java-8 for me due to the following > exception when running the formatter-maven-plugin: I just downloaded JDK8 and can reproduce this. Also please note that Any23 has been using JDK11 for build

Re: [VOTE] Release Apache Any23 2.6 RC#2

2022-01-04 Thread lewis john mcgibbney
Thanks On Tue, Jan 4, 2022 at 9:49 AM Hans Brende wrote: > +1 from me! > > - Hans > > > On Tue, Jan 4, 2022 at 5:19 PM lewis john mcgibbney > wrote: > >> Hi Any23 PMC, >> >> Please VOTE on the 2nd release candidate for Apache Any23 2.6. Most >

[jira] [Updated] (NUTCH-2926) Implement persistent storage for Nutch Webserver resources

2022-01-04 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2926: Description: The Nutch webserver caches resources (seed lists, configuration, jobs

[jira] [Created] (NUTCH-2926) Implement persistent storage for Nutch Webserver resources

2022-01-04 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2926: --- Summary: Implement persistent storage for Nutch Webserver resources Key: NUTCH-2926 URL: https://issues.apache.org/jira/browse/NUTCH-2926 Project: Nutch

[jira] [Commented] (NUTCH-2925) Secure the Nutch REST API using Apache Shiro

2022-01-04 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17468743#comment-17468743 ] Lewis John McGibbney commented on NUTCH-2925: - [~markus17] didn't really like the idea

[jira] [Created] (NUTCH-2925) Secure the Nutch REST API using Apache Shiro

2022-01-04 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2925: --- Summary: Secure the Nutch REST API using Apache Shiro Key: NUTCH-2925 URL: https://issues.apache.org/jira/browse/NUTCH-2925 Project: Nutch

[VOTE] Release Apache Any23 2.6 RC#2

2022-01-03 Thread lewis john mcgibbney
Hi user@ and dev@, Please VOTE on the 2nd release candidate for Apache Any23 2.6. Most notably this RC addresses several security vulnerabilities by upgrading every single Any23 dependency. We solved 62 issues: https://issues.apache.org/jira/projects/ANY23/versions/12350556 Git source tag

[VOTE] Release Apache Any23 2.6 RC#2

2022-01-03 Thread lewis john mcgibbney
Hi user@ and dev@, Please VOTE on the 2nd release candidate for Apache Any23 2.6. Most notably this RC addresses several security vulnerabilities by upgrading every single Any23 dependency. We solved 62 issues: https://issues.apache.org/jira/projects/ANY23/versions/12350556 Git source tag

[jira] [Commented] (NUTCH-2923) Add Job Id in Job Failure messages

2022-01-03 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17468361#comment-17468361 ] Lewis John McGibbney commented on NUTCH-2923: - Yes it absolutely would. I didn't see

[jira] [Comment Edited] (NUTCH-2923) Add Job Id in Job Failure messages

2022-01-03 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17468361#comment-17468361 ] Lewis John McGibbney edited comment on NUTCH-2923 at 1/4/22, 5:11 AM

[RESULT] WAS Re: [VOTE] Release Apache Any23 2.6

2022-01-03 Thread lewis john mcgibbney
Hi user@, dev@, I'm going to bring this VOTE thread to a close with the following results. [3] +1, release as Any23 2.6 Andy Seaborne* Lewis John McGibbney* David Cockbill *Any23 PMC binding [0] +/-0, fine, but consider to fix few issues before... [0] -1, nope, because... (and please explain

[RESULT] WAS Re: [VOTE] Release Apache Any23 2.6

2022-01-03 Thread lewis john mcgibbney
Hi user@, dev@, I'm going to bring this VOTE thread to a close with the following results. [3] +1, release as Any23 2.6 Andy Seaborne* Lewis John McGibbney* David Cockbill *Any23 PMC binding [0] +/-0, fine, but consider to fix few issues before... [0] -1, nope, because... (and please explain

[jira] [Updated] (ANY23-553) Document MathUtils#md5 to warn that the weak hash algorithm is not to be used in a sensitive context

2022-01-03 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated ANY23-553: --- Description: Sonarcloud.io analysis has [identified a potential security

[jira] [Created] (ANY23-553) Document MathUtils#md5 to warn that the weak hash algorithm is not to be used in a sensitive context

2022-01-03 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-553: -- Summary: Document MathUtils#md5 to warn that the weak hash algorithm is not to be used in a sensitive context Key: ANY23-553 URL: https://issues.apache.org/jira

[jira] [Commented] (NUTCH-2923) Add Job Id in Job Failure messages

2022-01-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17467745#comment-17467745 ] Lewis John McGibbney commented on NUTCH-2923: - We can easily obtain it via {{job.getStatus

Addressing Nutch use of CMS WAS: [IMPORTANT] - ci.apache.org and CMS Shutdown end of January 2022

2022-01-02 Thread lewis john mcgibbney
Hi Gavin, Thanks for the email below. It was my understanding that the Nutch project no longer relied on the legacy CMS framework. I wrote a new website and published it at https://github.com/apache/nutch-site with the static content being served on the asf-site branch. The old CMS website

Addressing Nutch use of CMS WAS: [IMPORTANT] - ci.apache.org and CMS Shutdown end of January 2022

2022-01-02 Thread lewis john mcgibbney
Hi Gavin, Thanks for the email below. It was my understanding that the Nutch project no longer relied on the legacy CMS framework. I wrote a new website and published it at https://github.com/apache/nutch-site with the static content being served on the asf-site branch. The old CMS website

[jira] [Created] (ANY23-552) Change scope of Tika dependency to provided

2022-01-02 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-552: -- Summary: Change scope of Tika dependency to provided Key: ANY23-552 URL: https://issues.apache.org/jira/browse/ANY23-552 Project: Apache Any23

[jira] [Resolved] (ANY23-551) Bump tika.version from 2.2.0 to 2.2.1

2022-01-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-551. Resolution: Fixed > Bump tika.version from 2.2.0 to 2.

[jira] [Created] (ANY23-550) Bump maven-deploy-plugin from 3.0.0-M1 to 3.0.0-M2

2022-01-02 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-550: -- Summary: Bump maven-deploy-plugin from 3.0.0-M1 to 3.0.0-M2 Key: ANY23-550 URL: https://issues.apache.org/jira/browse/ANY23-550 Project: Apache Any23

[jira] [Commented] (NUTCH-2278) Handle alpha-2 language codes consistently

2022-01-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17467688#comment-17467688 ] Lewis John McGibbney commented on NUTCH-2278: - No problems Fengtan… a test case would

[jira] [Commented] (NUTCH-2923) Add Job Id in Job Failure messages

2022-01-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17467687#comment-17467687 ] Lewis John McGibbney commented on NUTCH-2923: - Hi Prakhar, I agree with you. Are you able

Re: AGE Tutorial?

2022-01-02 Thread Lewis John McGibbney
ou can also find documentation at > https://age.apache.org/docs/master/index.html > > Are you already familiar with PostgreSQL and/or OpenCypher? > > How much data do you wish to ingest? What format is the data currently in? > How many nodes? > > A contributor Muhammad Sho

[jira] [Updated] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2021-12-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2856: Summary: Implement a protocol-smb plugin based on hierynomus/smbj (was: Implement

[jira] [Commented] (NUTCH-2856) Implement an appropriately licensed protocol-smb plugin

2021-12-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17467111#comment-17467111 ] Lewis John McGibbney commented on NUTCH-2856: - Adding some notes from my research

[jira] [Updated] (NUTCH-2856) Implement a protocol-smb plugin based on

2021-12-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2856: Summary: Implement a protocol-smb plugin based on (was: Implement

[jira] [Work started] (NUTCH-2856) Implement an appropriately licensed protocol-smb plugin

2021-12-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-2856 started by Lewis John McGibbney. --- > Implement an appropriately licensed protocol-smb plu

[jira] [Updated] (NUTCH-2856) Implement an appropriately licensed protocol-smb plugin

2021-12-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2856: Issue Type: New Feature (was: Bug) > Implement an appropriately licensed proto

[jira] [Updated] (NUTCH-2856) Implement an appropriately licensed protocol-smb plugin

2021-12-30 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-2856: Summary: Implement an appropriately licensed protocol-smb plugin (was: protocol

[jira] [Commented] (NUTCH-2856) protocol-smb plugin is outdated

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466704#comment-17466704 ] Lewis John McGibbney commented on NUTCH-2856: - I'll take this one on. I intend to use https

[jira] [Assigned] (NUTCH-2856) protocol-smb plugin is outdated

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2856: --- Assignee: Lewis John McGibbney > protocol-smb plugin is outda

[jira] [Commented] (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implm

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466703#comment-17466703 ] Lewis John McGibbney commented on NUTCH-427: An old thread but I found an alternative SMB

[jira] [Resolved] (ANY23-549) Bump log4j2.version from 2.17.0 to 2.17.1

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-549. Resolution: Fixed > Bump log4j2.version from 2.17.0 to 2.1

[jira] [Created] (ANY23-549) Bump log4j2.version from 2.17.0 to 2.17.1

2021-12-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-549: -- Summary: Bump log4j2.version from 2.17.0 to 2.17.1 Key: ANY23-549 URL: https://issues.apache.org/jira/browse/ANY23-549 Project: Apache Any23

[jira] [Resolved] (TIKA-3539) jdom 2.0.6 dependency in tika-parser-news-module has unfixed CVE

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-3539. Resolution: Fixed > jdom 2.0.6 dependency in tika-parser-news-module has unfi

[jira] [Assigned] (TIKA-3539) jdom 2.0.6 dependency in tika-parser-news-module has unfixed CVE

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned TIKA-3539: -- Assignee: Lewis John McGibbney > jdom 2.0.6 dependency in tika-parser-n

[jira] [Commented] (TIKA-3539) jdom 2.0.6 dependency in tika-parser-news-module has unfixed CVE

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466682#comment-17466682 ] Lewis John McGibbney commented on TIKA-3539: This issue was fixed for 2.X in https

[jira] [Updated] (TIKA-3539) jdom 2.0.6 dependency in tika-parser-news-module has unfixed CVE

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-3539: --- Fix Version/s: 2.2.1 > jdom 2.0.6 dependency in tika-parser-news-module has unfi

[jira] [Resolved] (TIKA-3635) Upgrade to rome 1.18.0

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-3635. Resolution: Fixed > Upgrade to rome 1.1

[jira] [Resolved] (TIKA-3488) Security issue XXE in TIKA due to JDOM

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-3488. Resolution: Fixed > Security issue XXE in TIKA due to J

[jira] [Updated] (TIKA-3488) Security issue XXE in TIKA due to JDOM

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-3488: --- Fix Version/s: 2.2.1 > Security issue XXE in TIKA due to J

[jira] [Assigned] (TIKA-3488) Security issue XXE in TIKA due to JDOM

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned TIKA-3488: -- Assignee: Lewis John McGibbney > Security issue XXE in TIKA due to J

[jira] [Commented] (TIKA-3488) Security issue XXE in TIKA due to JDOM

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17466681#comment-17466681 ] Lewis John McGibbney commented on TIKA-3488: This was fixed in https://github.com/apache/tika

[jira] [Updated] (TIKA-3635) Upgrade to rome 1.18.0

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-3635: --- Fix Version/s: 2.2.1 > Upgrade to rome 1.1

[jira] [Created] (TIKA-3635) Upgrade to rome 1.18.0

2021-12-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created TIKA-3635: -- Summary: Upgrade to rome 1.18.0 Key: TIKA-3635 URL: https://issues.apache.org/jira/browse/TIKA-3635 Project: Tika Issue Type: Improvement

Nutch metrics documentation request for review/feedback

2021-12-29 Thread lewis john mcgibbney
Hi dev@, *What?* I've been chipping away at some documentation which would provide a one-stop-shop for understanding Nutch metrics. My first pass is available at https://cwiki.apache.org/confluence/display/NUTCH/Metrics This relates to the recent JIRA issue I filed about establishing a Nutch

!! Join the #nutch Slack channel !!

2021-12-29 Thread lewis john mcgibbney
Hi user@, dev@, I took the liberty of setting up a #nutch channel for our community to communicate in a lower latency manner. First join the-asf.slack.com Slack workspace https://infra.apache.org/slack.html Then simply join the #nutch channel. See you there :) Thanks lewismc --

!! Join the #nutch Slack channel !!

2021-12-29 Thread lewis john mcgibbney
Hi user@, dev@, I took the liberty of setting up a #nutch channel for our community to communicate in a lower latency manner. First join the-asf.slack.com Slack workspace https://infra.apache.org/slack.html Then simply join the #nutch channel. See you there :) Thanks lewismc --

[jira] [Resolved] (ANY23-548) Bump mockito-core from 3.3.3 to 4.2.0

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-548. Resolution: Fixed > Bump mockito-core from 3.3.3 to 4.

[jira] [Created] (ANY23-548) Bump mockito-core from 3.3.3 to 4.2.0

2021-12-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-548: -- Summary: Bump mockito-core from 3.3.3 to 4.2.0 Key: ANY23-548 URL: https://issues.apache.org/jira/browse/ANY23-548 Project: Apache Any23 Issue

[jira] [Resolved] (ANY23-547) Bump httpcore from 4.4.14 to 4.4.15

2021-12-29 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-547. Assignee: Lewis John McGibbney Resolution: Fixed > Bump httpcore from 4.4

[jira] [Created] (ANY23-547) Bump httpcore from 4.4.14 to 4.4.15

2021-12-29 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-547: -- Summary: Bump httpcore from 4.4.14 to 4.4.15 Key: ANY23-547 URL: https://issues.apache.org/jira/browse/ANY23-547 Project: Apache Any23 Issue

Re: Break out individual functions from IndexerJob -deleteGone flag?

2021-12-29 Thread Lewis John McGibbney
I also should note that the -deleteGone setting cannot be overriden via nutch-site.xml whereas similar settings do have equivalent configuration properties in nutch-default.xml https://github.com/apache/nutch/blob/master/conf/nutch-default.xml#L1361-L1373 On 2021/12/29 17:08:20 lewis john

Break out individual functions from IndexerJob -deleteGone flag?

2021-12-29 Thread lewis john mcgibbney
Hi dev@, Reading the code for the IndexerJob -deleteGone flag [0] you can clearly see that we bundle deletion requests for 404s, redirects and duplicates into one option. This of course has pros and cons. Does anyone wish to share their opinion on how this is implemented? My opinion is that 1. The

[jira] [Created] (ANY23-546) Implement sonarcloud.io in Any23 continuous integration

2021-12-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-546: -- Summary: Implement sonarcloud.io in Any23 continuous integration Key: ANY23-546 URL: https://issues.apache.org/jira/browse/ANY23-546 Project: Apache Any23

[jira] [Created] (ANY23-545) Integrate JenaTripleHandler.java into Any23 codebase.

2021-12-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-545: -- Summary: Integrate JenaTripleHandler.java into Any23 codebase. Key: ANY23-545 URL: https://issues.apache.org/jira/browse/ANY23-545 Project: Apache Any23

[jira] [Resolved] (ANY23-536) Upgrade to tika 2.2.0

2021-12-28 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-536. Resolution: Fixed > Upgrade to tika 2.

[jira] [Resolved] (ANY23-538) Replace existing logging with Slf4j over log4j2

2021-12-28 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-538. Resolution: Fixed > Replace existing logging with Slf4j over log

[jira] [Resolved] (ANY23-539) Introduce ossindex-maven-plugin support

2021-12-28 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-539. Resolution: Fixed > Introduce ossindex-maven-plugin supp

[jira] [Created] (ANY23-544) Bump snakeyaml from 1.29 to 1.30

2021-12-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-544: -- Summary: Bump snakeyaml from 1.29 to 1.30 Key: ANY23-544 URL: https://issues.apache.org/jira/browse/ANY23-544 Project: Apache Any23 Issue Type

[jira] [Resolved] (ANY23-544) Bump snakeyaml from 1.29 to 1.30

2021-12-28 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-544. Resolution: Fixed > Bump snakeyaml from 1.29 to 1

[jira] [Resolved] (ANY23-542) Bump jackson.version from 2.13.0 to 2.13.1

2021-12-28 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-542. Resolution: Fixed > Bump jackson.version from 2.13.0 to 2.1

[jira] [Resolved] (ANY23-543) Bump tika.version from 2.1.0 to 2.2.0

2021-12-28 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-543. Resolution: Fixed > Bump tika.version from 2.1.0 to 2.

[jira] [Created] (ANY23-543) Bump tika.version from 2.1.0 to 2.2.0

2021-12-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-543: -- Summary: Bump tika.version from 2.1.0 to 2.2.0 Key: ANY23-543 URL: https://issues.apache.org/jira/browse/ANY23-543 Project: Apache Any23 Issue

[jira] [Resolved] (ANY23-541) Bump rdf4j.version from 3.7.3 to 3.7.4

2021-12-28 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-541. Resolution: Fixed > Bump rdf4j.version from 3.7.3 to 3.

[jira] [Created] (ANY23-542) Bump jackson.version from 2.13.0 to 2.13.1

2021-12-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-542: -- Summary: Bump jackson.version from 2.13.0 to 2.13.1 Key: ANY23-542 URL: https://issues.apache.org/jira/browse/ANY23-542 Project: Apache Any23

[jira] [Resolved] (ANY23-540) Bump spotbugs-maven-plugin from 4.5.0.0 to 4.5.2.0

2021-12-28 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-540?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved ANY23-540. Resolution: Fixed > Bump spotbugs-maven-plugin from 4.5.0.0 to 4.5.

[jira] [Created] (ANY23-541) Bump rdf4j.version from 3.7.3 to 3.7.4

2021-12-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-541: -- Summary: Bump rdf4j.version from 3.7.3 to 3.7.4 Key: ANY23-541 URL: https://issues.apache.org/jira/browse/ANY23-541 Project: Apache Any23 Issue

[jira] [Created] (ANY23-540) Bump spotbugs-maven-plugin from 4.5.0.0 to 4.5.2.0

2021-12-28 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created ANY23-540: -- Summary: Bump spotbugs-maven-plugin from 4.5.0.0 to 4.5.2.0 Key: ANY23-540 URL: https://issues.apache.org/jira/browse/ANY23-540 Project: Apache Any23

AGE Tutorial?

2021-12-23 Thread lewis john mcgibbney
Hi users@, Does anyone know if a getting started AGE tutorial exists? What’s not clear to me is whether data is ingested directly into PostgreSQL or via AGE…? I’ve read the documentation regarding graph creation but I’ve not found documentation related to populating the graph or data ingestion

[jira] [Created] (NUTCH-2920) Implement a indexer-opensearch plugin

2021-12-17 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-2920: --- Summary: Implement a indexer-opensearch plugin Key: NUTCH-2920 URL: https://issues.apache.org/jira/browse/NUTCH-2920 Project: Nutch Issue Type

[jira] [Resolved] (NUTCH-2449) Usage of Tika LanguageIdentifier in language-identifier plugin

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-2449. - Resolution: Fixed > Usage of Tika LanguageIdentifier in language-identif

[jira] [Commented] (ANY23-536) Upgrade to tika 2.2.0

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461759#comment-17461759 ] Lewis John McGibbney commented on ANY23-536: https://github.com/apache/any23/pull/230

[jira] [Commented] (ANY23-539) Introduce ossindex-maven-plugin support

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/ANY23-539?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461758#comment-17461758 ] Lewis John McGibbney commented on ANY23-539: https://github.com/apache/any23/pull/230

Re: 2.2.0 JARs not pushed to Maven Central

2021-12-17 Thread Lewis John McGibbney
59 PM, lewis john mcgibbney > > wrote: > > > > I’ve been waiting on the M2 central Repository being updated with the 2.2.0 > > jars… > > I checked repository.Apache.org and they are NOT staged which I assume > > means that the staging repository has been cl

2.2.0 JARs not pushed to Maven Central

2021-12-17 Thread lewis john mcgibbney
I’ve been waiting on the M2 central Repository being updated with the 2.2.0 jars… I checked repository.Apache.org and they are NOT staged which I assume means that the staging repository has been closed which should have triggered the release to maven central. Anyone know what’s going on? lewismc

[jira] [Commented] (NUTCH-2278) Handle alpha-2 language codes consistently

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461637#comment-17461637 ] Lewis John McGibbney commented on NUTCH-2278: - Out of curiosity [~Fengtan] are you still

[jira] [Comment Edited] (NUTCH-2278) Handle alpha-2 language codes consistently

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461635#comment-17461635 ] Lewis John McGibbney edited comment on NUTCH-2278 at 12/17/21, 7:48 PM

[jira] [Commented] (NUTCH-2278) Handle alpha-2 language codes consistently

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461635#comment-17461635 ] Lewis John McGibbney commented on NUTCH-2278: - [~snagel] wdyt about this? > Handle alph

[jira] [Commented] (NUTCH-2919) Upgrade to Tika 2.2.0

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17461620#comment-17461620 ] Lewis John McGibbney commented on NUTCH-2919: - The artifacts have not yet made maven central

[jira] [Updated] (TIKA-3620) Language detection documentation needs attention

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated TIKA-3620: --- Fix Version/s: 2.2.0 > Language detection documentation needs attent

[jira] [Resolved] (TIKA-3620) Language detection documentation needs attention

2021-12-17 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved TIKA-3620. Resolution: Fixed https://tika.apache.org/2.2.0/detection.html#Language_Detection

<    1   2   3   4   5   6   7   8   9   10   >