[jira] [Commented] (NUTCH-2920) Implement a indexer-opensearch plugin

2023-02-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17693335#comment-17693335 ] ASF GitHub Bot commented on NUTCH-2920: --- tballison commented on PR #761: URL: https

[GitHub] [nutch] tballison commented on pull request #761: NUTCH-2920 -- first working attempt at an OpenSearchIndexWriter

2023-02-24 Thread via GitHub
tballison commented on PR #761: URL: https://github.com/apache/nutch/pull/761#issuecomment-1444379112 I'm less than entirely thrilled with using stored strings for credentials, but that's where we were with Elasticsearch. Again, if there's a better way, please let me know. -- This is an

[jira] [Commented] (NUTCH-2920) Implement a indexer-opensearch plugin

2023-02-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=1769#comment-1769 ] ASF GitHub Bot commented on NUTCH-2920: --- tballison commented on PR #761: URL: https

[GitHub] [nutch] tballison commented on pull request #761: NUTCH-2920 -- first working attempt at an OpenSearchIndexWriter

2023-02-24 Thread via GitHub
tballison commented on PR #761: URL: https://github.com/apache/nutch/pull/761#issuecomment-1444377846 The fiddly part (for me) was setting up the rest client to deal with a trust store. I followed https://opensearch.org/blog/connecting-java-high-level-rest-client-with-opensearch-over

[jira] [Commented] (NUTCH-2920) Implement a indexer-opensearch plugin

2023-02-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17693330#comment-17693330 ] ASF GitHub Bot commented on NUTCH-2920: --- tballison opened a new pull request, #761:

[GitHub] [nutch] tballison opened a new pull request, #761: NUTCH-2920 -- first working attempt at an OpenSearchIndexWriter

2023-02-24 Thread via GitHub
tballison opened a new pull request, #761: URL: https://github.com/apache/nutch/pull/761 …iter to OpenSearch Thanks for your contribution to [Apache Nutch](https://nutch.apache.org/)! Your help is appreciated! Before opening the pull request, please verify that * there is an

[jira] [Commented] (NUTCH-2985) Disable plugin urlfilter-validator by default

2023-02-24 Thread Markus Jelsma (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17693291#comment-17693291 ] Markus Jelsma commented on NUTCH-2985: -- +1 > Disable plugin urlfilter-validator by

[jira] [Assigned] (NUTCH-2973) Single domain names (eg https://localnet) can't be crawled - filtering fails

2023-02-24 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2973: -- Assignee: Sebastian Nagel > Single domain names (eg https://localnet) can't be crawled

[jira] [Assigned] (NUTCH-2983) nutch-default.xml improvements

2023-02-24 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2983: -- Assignee: Sebastian Nagel > nutch-default.xml improvements > -

[jira] [Assigned] (NUTCH-2985) Disable plugin urlfilter-validator by default

2023-02-24 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2985: -- Assignee: Sebastian Nagel > Disable plugin urlfilter-validator by default > --

[jira] [Assigned] (NUTCH-2972) Javadoc build fails using JDK 17

2023-02-24 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2972: -- Assignee: Sebastian Nagel > Javadoc build fails using JDK 17 > ---

[jira] [Commented] (NUTCH-2972) Javadoc build fails using JDK 17

2023-02-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17693260#comment-17693260 ] ASF GitHub Bot commented on NUTCH-2972: --- sebastian-nagel opened a new pull request,

[GitHub] [nutch] sebastian-nagel opened a new pull request, #760: NUTCH-2972 Javadoc build fails using JDK 17

2023-02-24 Thread via GitHub
sebastian-nagel opened a new pull request, #760: URL: https://github.com/apache/nutch/pull/760 - fix Javadoc issues when building with JDK 17 - note: the remaining 100 warnings are all about missing Javadocs for methods and variables -- This is an automated message from the Apache Git

[jira] [Assigned] (NUTCH-2984) Drop test proxy server and benchmark tool

2023-02-24 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-2984: -- Assignee: Sebastian Nagel > Drop test proxy server and benchmark tool > --

[jira] [Created] (NUTCH-2986) Depend urlfilter-validator on commons-validator routine UrlValidator

2023-02-24 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-2986: -- Summary: Depend urlfilter-validator on commons-validator routine UrlValidator Key: NUTCH-2986 URL: https://issues.apache.org/jira/browse/NUTCH-2986 Project: Nutch

[jira] [Created] (NUTCH-2985) Disable plugin urlfilter-validator by default

2023-02-24 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-2985: -- Summary: Disable plugin urlfilter-validator by default Key: NUTCH-2985 URL: https://issues.apache.org/jira/browse/NUTCH-2985 Project: Nutch Issue Type: B

[jira] [Commented] (NUTCH-2596) Upgrade from org.mortbay.jetty to org.eclipse.jetty

2023-02-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17693227#comment-17693227 ] ASF GitHub Bot commented on NUTCH-2596: --- sebastian-nagel opened a new pull request,

[GitHub] [nutch] sebastian-nagel opened a new pull request, #758: NUTCH-2596 Upgrade from org.mortbay.jetty to org.eclipse.jetty

2023-02-24 Thread via GitHub
sebastian-nagel opened a new pull request, #758: URL: https://github.com/apache/nutch/pull/758 This is the second part (after #574) to finally replace the `org.mortbay.jetty` packages by `org.eclipse.jetty`. This PR is based on and includes NUTCH-2984 / #757. -- This is an automated mess

[jira] [Created] (NUTCH-2984) Drop test proxy server and benchmark tool

2023-02-24 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-2984: -- Summary: Drop test proxy server and benchmark tool Key: NUTCH-2984 URL: https://issues.apache.org/jira/browse/NUTCH-2984 Project: Nutch Issue Type: Task