[jira] [Commented] (NUTCH-3073) Address Java compiler warnings

2024-10-06 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17887186#comment-17887186 ] Hudson commented on NUTCH-3073: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-t

[jira] [Resolved] (NUTCH-3073) Address Java compiler warnings

2024-10-06 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-3073. Resolution: Fixed > Address Java compiler warni

[jira] [Created] (NUTCH-3074) Augment Javadoc for org/apache/nutch/protocol/Content.java

2024-10-04 Thread Lewis John McGibbney (Jira)
Lewis John McGibbney created NUTCH-3074: --- Summary: Augment Javadoc for org/apache/nutch/protocol/Content.java Key: NUTCH-3074 URL: https://issues.apache.org/jira/browse/NUTCH-3074 Project: Nutch

[jira] [Updated] (NUTCH-3074) Augment Javadoc for org/apache/nutch/protocol/Content.java

2024-10-04 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-3074: Description: [~hiranchaudhuri]'s [question on user@|https://lists.apach

[jira] [Created] (NUTCH-3073) Address Java compiler warnings

2024-10-04 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-3073: -- Summary: Address Java compiler warnings Key: NUTCH-3073 URL: https://issues.apache.org/jira/browse/NUTCH-3073 Project: Nutch Issue Type: Improvement

[jira] [Created] (NUTCH-3072) Fetcher to stop QueueFeeder if aborting with "hung threads"

2024-10-04 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-3072: -- Summary: Fetcher to stop QueueFeeder if aborting with "hung threads" Key: NUTCH-3072 URL: https://issues.apache.org/jira/browse/NUTCH-3072 Proj

[jira] [Commented] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2024-10-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17886549#comment-17886549 ] Lewis John McGibbney commented on NUTCH-2856: - Sorry for late respons

[jira] [Assigned] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2024-10-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2856: --- Assignee: Hiran Chaudhuri (was: Hiran Chaudhuri) > Implement a proto

[jira] [Assigned] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2024-10-02 Thread Lewis John McGibbney (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney reassigned NUTCH-2856: --- Assignee: Hiran Chaudhuri (was: Lewis John McGibbney) > Implemen

[jira] [Comment Edited] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2024-10-02 Thread Hiran Chaudhuri (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17886498#comment-17886498 ] Hiran Chaudhuri edited comment on NUTCH-2856 at 10/2/24 7:5

[jira] [Commented] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2024-10-02 Thread Hiran Chaudhuri (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17886498#comment-17886498 ] Hiran Chaudhuri commented on NUTCH-2856: Opened [https://github.com/apache/n

[jira] [Assigned] (NUTCH-3068) Documentation on Nutch Homepage

2024-10-02 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-3068: -- Assignee: Sebastian Nagel > Documentation on Nutch Homep

[jira] [Resolved] (NUTCH-3070) Documentation has outdated links

2024-10-02 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-3070. Resolution: Fixed Thanks for reporting, [~hiranchaudhuri]! > Documentation has outda

[jira] [Assigned] (NUTCH-3070) Documentation has outdated links

2024-10-02 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel reassigned NUTCH-3070: -- Assignee: Sebastian Nagel > Documentation has outdated li

[jira] [Updated] (NUTCH-3070) Documentation has outdated links

2024-10-02 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-3070: --- Component/s: wiki > Documentation has outdated li

[jira] [Updated] (NUTCH-3069) Update protocol-smb reference

2024-10-02 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-3069: --- Component/s: wiki > Update protocol-smb refere

[jira] [Updated] (NUTCH-3071) Tutorial for Intranet Document Search outdated

2024-10-02 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-3071: --- Component/s: wiki > Tutorial for Intranet Document Search outda

[jira] [Updated] (NUTCH-3056) Injector to support resolving seed URLs

2024-10-02 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel updated NUTCH-3056: --- Component/s: injector > Injector to support resolving seed U

[jira] [Commented] (NUTCH-3071) Tutorial for Intranet Document Search outdated

2024-10-02 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17886400#comment-17886400 ] Sebastian Nagel commented on NUTCH-3071: Hi [~hiranchaudhuri], thanks

[jira] [Commented] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2024-10-02 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17886393#comment-17886393 ] Sebastian Nagel commented on NUTCH-2856: Hi [~hiranchaudhuri], yes and of co

[jira] [Commented] (NUTCH-3056) Injector to support resolving seed URLs

2024-10-01 Thread Hiran Chaudhuri (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17886248#comment-17886248 ] Hiran Chaudhuri commented on NUTCH-3056: _The price is the double fetchin

[jira] [Comment Edited] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2024-09-30 Thread Hiran Chaudhuri (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17885956#comment-17885956 ] Hiran Chaudhuri edited comment on NUTCH-2856 at 9/30/24 3:4

[jira] [Commented] (NUTCH-2856) Implement a protocol-smb plugin based on hierynomus/smbj

2024-09-30 Thread Hiran Chaudhuri (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17885956#comment-17885956 ] Hiran Chaudhuri commented on NUTCH-2856: With NUTCH-2429 being resolved i

[jira] [Updated] (NUTCH-3071) Tutorial for Intranet Document Search outdated

2024-09-30 Thread Hiran Chaudhuri (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hiran Chaudhuri updated NUTCH-3071: --- Description: On the page [https://cwiki.apache.org/confluence/display/NUTCH

[jira] [Updated] (NUTCH-3071) Tutorial for Intranet Document Search outdated

2024-09-30 Thread Hiran Chaudhuri (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hiran Chaudhuri updated NUTCH-3071: --- Description: On the page [https://cwiki.apache.org/confluence/display/NUTCH

[jira] [Updated] (NUTCH-3071) Tutorial for Intranet Document Search outdated

2024-09-30 Thread Hiran Chaudhuri (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hiran Chaudhuri updated NUTCH-3071: --- Description: On the page [https://cwiki.apache.org/confluence/display/NUTCH

[jira] [Created] (NUTCH-3071) Tutorial for Intranet Document Search outdated

2024-09-30 Thread Hiran Chaudhuri (Jira)
Hiran Chaudhuri created NUTCH-3071: -- Summary: Tutorial for Intranet Document Search outdated Key: NUTCH-3071 URL: https://issues.apache.org/jira/browse/NUTCH-3071 Project: Nutch Issue Type

[jira] [Created] (NUTCH-3070) Documentation has outdated links

2024-09-28 Thread Hiran Chaudhuri (Jira)
Hiran Chaudhuri created NUTCH-3070: -- Summary: Documentation has outdated links Key: NUTCH-3070 URL: https://issues.apache.org/jira/browse/NUTCH-3070 Project: Nutch Issue Type: Improvement

[jira] [Updated] (NUTCH-3069) Update protocol-smb reference

2024-09-28 Thread Hiran Chaudhuri (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hiran Chaudhuri updated NUTCH-3069: --- Component/s: documentation > Update protocol-smb refere

[jira] [Created] (NUTCH-3069) Update protocol-smb reference

2024-09-28 Thread Hiran Chaudhuri (Jira)
Hiran Chaudhuri created NUTCH-3069: -- Summary: Update protocol-smb reference Key: NUTCH-3069 URL: https://issues.apache.org/jira/browse/NUTCH-3069 Project: Nutch Issue Type: Improvement

[jira] [Created] (NUTCH-3068) Documentation on Nutch Homepage

2024-09-28 Thread Hiran Chaudhuri (Jira)
Hiran Chaudhuri created NUTCH-3068: -- Summary: Documentation on Nutch Homepage Key: NUTCH-3068 URL: https://issues.apache.org/jira/browse/NUTCH-3068 Project: Nutch Issue Type: Improvement

[jira] [Commented] (NUTCH-2812) Methods returning array may expose internal representation

2024-09-17 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882455#comment-17882455 ] Hudson commented on NUTCH-2812: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-t

[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons

2024-09-17 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882446#comment-17882446 ] Hudson commented on NUTCH-1806: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-t

[jira] [Commented] (NUTCH-2812) Methods returning array may expose internal representation

2024-09-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882442#comment-17882442 ] ASF GitHub Bot commented on NUTCH-2812: --- sebastian-nagel commented on PR #798:

Re: [PR] fix for NUTCH-2812 contributed by GabeHaegele [nutch]

2024-09-17 Thread via GitHub
sebastian-nagel merged PR #798: URL: https://github.com/apache/nutch/pull/798 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr

[jira] [Resolved] (NUTCH-2812) Methods returning array may expose internal representation

2024-09-17 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-2812. Resolution: Fixed > Methods returning array may expose internal representat

Re: [PR] fix for NUTCH-2812 contributed by GabeHaegele [nutch]

2024-09-17 Thread via GitHub
sebastian-nagel commented on PR #798: URL: https://github.com/apache/nutch/pull/798#issuecomment-2356188861 Thanks, @GabeHaegele! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Commented] (NUTCH-2812) Methods returning array may expose internal representation

2024-09-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-2812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882441#comment-17882441 ] ASF GitHub Bot commented on NUTCH-2812: --- sebastian-nagel merged PR #798:

[jira] [Resolved] (NUTCH-1942) Remove TopLevelDomain

2024-09-17 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-1942. Resolution: Done > Remove TopLevelDomain > -- > >

[jira] [Resolved] (NUTCH-1806) Delegate processing of URL domains to crawler commons

2024-09-17 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-1806. Resolution: Implemented Thanks, everybody! > Delegate processing of URL domains

Re: [PR] NUTCH-1806 Delegate processing of URL domains to crawler-commons [nutch]

2024-09-17 Thread via GitHub
sebastian-nagel merged PR #816: URL: https://github.com/apache/nutch/pull/816 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr

[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons

2024-09-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882439#comment-17882439 ] ASF GitHub Bot commented on NUTCH-1806: --- sebastian-nagel merged PR #816:

[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads

2024-09-16 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882160#comment-17882160 ] Hudson commented on NUTCH-3058: --- FAILURE: Integrated in Jenkins build Nutch » Nutch-t

Build failed in Jenkins: Nutch » Nutch-trunk #167

2024-09-16 Thread Apache Jenkins Server
See <https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/167/display/redirect?page=changes> Changes: [github] NUTCH-3058 Fetcher: counter for hung threads (#820) -- [...truncated 775.03 KB...] resolve-default: [ivy:resolve] :: loading se

[jira] [Resolved] (NUTCH-3058) Fetcher: counter for hung threads

2024-09-16 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-3058. Resolution: Implemented > Fetcher: counter for hung thre

[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads

2024-09-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882137#comment-17882137 ] ASF GitHub Bot commented on NUTCH-3058: --- sebastian-nagel commented on PR #820:

Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]

2024-09-16 Thread via GitHub
sebastian-nagel commented on PR #820: URL: https://github.com/apache/nutch/pull/820#issuecomment-2353542871 Thanks for the discussion, @lewismc! The new counters were added to https://cwiki.apache.org/confluence/display/NUTCH/Metrics -- This is an automated message from the Apache Git

[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads

2024-09-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17882133#comment-17882133 ] ASF GitHub Bot commented on NUTCH-3058: --- sebastian-nagel merged PR #820:

Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]

2024-09-16 Thread via GitHub
sebastian-nagel merged PR #820: URL: https://github.com/apache/nutch/pull/820 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr

[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads

2024-09-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881818#comment-17881818 ] ASF GitHub Bot commented on NUTCH-3058: --- lewismc commented on code in PR #820:

Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]

2024-09-14 Thread via GitHub
lewismc commented on code in PR #820: URL: https://github.com/apache/nutch/pull/820#discussion_r1759821839 ## src/java/org/apache/nutch/fetcher/Fetcher.java: ## @@ -419,27 +419,43 @@ else if (bandwidthTargetCheckCounter == bandwidthTargetCheckEveryNSecs

[jira] [Commented] (NUTCH-3059) Generator: selector job does not count reduce output records

2024-09-14 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881792#comment-17881792 ] Sebastian Nagel commented on NUTCH-3059: Note: the above test was run in ps

[jira] [Commented] (NUTCH-3059) Generator: selector job does not count reduce output records

2024-09-14 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881791#comment-17881791 ] Sebastian Nagel commented on NUTCH-3059: Ok, found the reason: it's b

[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads

2024-09-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881777#comment-17881777 ] ASF GitHub Bot commented on NUTCH-3058: --- sebastian-nagel commented on code i

Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]

2024-09-14 Thread via GitHub
sebastian-nagel commented on code in PR #820: URL: https://github.com/apache/nutch/pull/820#discussion_r1759730682 ## src/java/org/apache/nutch/fetcher/Fetcher.java: ## @@ -419,27 +419,43 @@ else if (bandwidthTargetCheckCounter == bandwidthTargetCheckEveryNSecs

[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881642#comment-17881642 ] ASF GitHub Bot commented on NUTCH-3057: --- CatChullain commented on PR #819:

Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]

2024-09-13 Thread via GitHub
CatChullain commented on PR #819: URL: https://github.com/apache/nutch/pull/819#issuecomment-2349623677 Thanks! Yeah, I see now that it says it's on my repo. I was thinking it was in the project's. -- This is an automated message from the Apache Git Service. To respond to t

[jira] [Commented] (NUTCH-3065) Format changelog as Markdown

2024-09-13 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881623#comment-17881623 ] Hudson commented on NUTCH-3065: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-t

[jira] [Commented] (NUTCH-3062) protocol-okhttp: optionally record HTTP and SSL/TLS versions

2024-09-13 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881626#comment-17881626 ] Hudson commented on NUTCH-3062: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-t

[jira] [Commented] (NUTCH-3061) URL filters to log name of the rule file rules are read from

2024-09-13 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881625#comment-17881625 ] Hudson commented on NUTCH-3061: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-t

[jira] [Commented] (NUTCH-3066) Protocol plugin unit tests fail randomly

2024-09-13 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881624#comment-17881624 ] Hudson commented on NUTCH-3066: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-t

[jira] [Resolved] (NUTCH-3061) URL filters to log name of the rule file rules are read from

2024-09-13 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-3061. Resolution: Implemented > URL filters to log name of the rule file rules are read f

[jira] [Commented] (NUTCH-3061) URL filters to log name of the rule file rules are read from

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881606#comment-17881606 ] ASF GitHub Bot commented on NUTCH-3061: --- sebastian-nagel merged PR #821:

Re: [PR] NUTCH-3061 URL filters to log name of the rules file [nutch]

2024-09-13 Thread via GitHub
sebastian-nagel merged PR #821: URL: https://github.com/apache/nutch/pull/821 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr

[jira] [Resolved] (NUTCH-3062) protocol-okhttp: optionally record HTTP and SSL/TLS versions

2024-09-13 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-3062. Resolution: Implemented > protocol-okhttp: optionally record HTTP and SSL/TLS versi

[jira] [Commented] (NUTCH-3062) protocol-okhttp: optionally record HTTP and SSL/TLS versions

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3062?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881602#comment-17881602 ] ASF GitHub Bot commented on NUTCH-3062: --- sebastian-nagel merged PR #822:

Re: [PR] NUTCH-3062 protocol-okhttp: optionally record HTTP and SSL/TLS versions [nutch]

2024-09-13 Thread via GitHub
sebastian-nagel merged PR #822: URL: https://github.com/apache/nutch/pull/822 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr

[jira] [Resolved] (NUTCH-3065) Format changelog as Markdown

2024-09-13 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-3065. Resolution: Implemented > Format changelog as Markd

[jira] [Commented] (NUTCH-3065) Format changelog as Markdown

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881599#comment-17881599 ] ASF GitHub Bot commented on NUTCH-3065: --- sebastian-nagel commented on PR #823:

[jira] [Commented] (NUTCH-3065) Format changelog as Markdown

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881600#comment-17881600 ] ASF GitHub Bot commented on NUTCH-3065: --- sebastian-nagel merged PR #823:

Re: [PR] NUTCH-3065 Format changelog as markdown [nutch]

2024-09-13 Thread via GitHub
sebastian-nagel merged PR #823: URL: https://github.com/apache/nutch/pull/823 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr

Re: [PR] NUTCH-3065 Format changelog as markdown [nutch]

2024-09-13 Thread via GitHub
sebastian-nagel commented on PR #823: URL: https://github.com/apache/nutch/pull/823#issuecomment-2349190026 Thanks for the review, @lewismc ! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[jira] [Resolved] (NUTCH-3066) Protocol plugin unit tests fail randomly

2024-09-13 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Nagel resolved NUTCH-3066. Resolution: Fixed > Protocol plugin unit tests fail rando

[jira] [Commented] (NUTCH-3066) Protocol plugin unit tests fail randomly

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881598#comment-17881598 ] ASF GitHub Bot commented on NUTCH-3066: --- sebastian-nagel merged PR #824:

Re: [PR] NUTCH-3066 Protocol plugin unit tests fail randomly [nutch]

2024-09-13 Thread via GitHub
sebastian-nagel merged PR #824: URL: https://github.com/apache/nutch/pull/824 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr

[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881592#comment-17881592 ] ASF GitHub Bot commented on NUTCH-3057: --- sebastian-nagel commented on PR #819:

Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]

2024-09-13 Thread via GitHub
sebastian-nagel commented on PR #819: URL: https://github.com/apache/nutch/pull/819#issuecomment-2349139153 > Delete the fix's branch after the merge? Or wait till after the next public release? Even if you delete the underlying branch, all changes and commit in the PR a

[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception

2024-09-13 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881575#comment-17881575 ] Hudson commented on NUTCH-3057: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-t

[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881551#comment-17881551 ] ASF GitHub Bot commented on NUTCH-3057: --- CatChullain commented on PR #819:

Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]

2024-09-13 Thread via GitHub
CatChullain commented on PR #819: URL: https://github.com/apache/nutch/pull/819#issuecomment-2348896110 Thanks! Merged now and resolved the Jira. Delete the fix's branch after the merge? Or wait till after the next public release? -- This is an automated message from the A

[jira] [Resolved] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception

2024-09-13 Thread Joe Gilvary (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joe Gilvary resolved NUTCH-3057. Fix Version/s: 1.21 Assignee: Joe Gilvary Resolution: Fixed > Arbitrary inde

[jira] [Commented] (NUTCH-3057) Arbitrary indexer "leaks" previous value into a field processed after an exception

2024-09-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881547#comment-17881547 ] ASF GitHub Bot commented on NUTCH-3057: --- CatChullain merged PR #819: URL: h

Re: [PR] NUTCH-3057 - Fix for index-arbitrary plugin improper retention and us… [nutch]

2024-09-13 Thread via GitHub
CatChullain merged PR #819: URL: https://github.com/apache/nutch/pull/819 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr

[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads

2024-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881433#comment-17881433 ] ASF GitHub Bot commented on NUTCH-3058: --- lewismc commented on PR #820: URL: h

[jira] [Commented] (NUTCH-3058) Fetcher: counter for hung threads

2024-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881432#comment-17881432 ] ASF GitHub Bot commented on NUTCH-3058: --- lewismc commented on code in PR #820:

Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]

2024-09-12 Thread via GitHub
lewismc commented on PR #820: URL: https://github.com/apache/nutch/pull/820#issuecomment-2347442238 Once this has had more thorough peer review it would be great to add it to the [Nutch Metrics documentation](https://cwiki.apache.org/confluence/display/NUTCH/Metrics). I am +1 for the

Re: [PR] NUTCH-3058 Fetcher: counter for hung threads [nutch]

2024-09-12 Thread via GitHub
lewismc commented on code in PR #820: URL: https://github.com/apache/nutch/pull/820#discussion_r1757740053 ## src/java/org/apache/nutch/fetcher/Fetcher.java: ## @@ -419,27 +419,43 @@ else if (bandwidthTargetCheckCounter == bandwidthTargetCheckEveryNSecs

[jira] [Commented] (NUTCH-3061) URL filters to log name of the rule file rules are read from

2024-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881430#comment-17881430 ] ASF GitHub Bot commented on NUTCH-3061: --- lewismc commented on PR #821: URL: h

Re: [PR] NUTCH-3061 URL filters to log name of the rules file [nutch]

2024-09-12 Thread via GitHub
lewismc commented on PR #821: URL: https://github.com/apache/nutch/pull/821#issuecomment-2347435213 +1 @sebastian-nagel applied cleanly against `master`. I think this logging is certainly useful. -- This is an automated message from the Apache Git Service. To respond to the message

[jira] [Commented] (NUTCH-3064) Upgrade com.maxmind.geoip2:geoip2 dependency in geoip-index to v4.2.0

2024-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3064?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881427#comment-17881427 ] ASF GitHub Bot commented on NUTCH-3064: --- lewismc opened a new pull request,

[PR] WIP NUTCH-3064 Upgrade com.maxmind.geoip2:geoip2 dependency in geoip-index to v4.2.0 [nutch]

2024-09-12 Thread via GitHub
lewismc opened a new pull request, #825: URL: https://github.com/apache/nutch/pull/825 **Work in Progress** This PR begins to address [NUTCH-3064](https://issues.apache.org/jira/browse/NUTCH-3064) by performing the upgrade of the com.maxmind.geoip2:geoip2 dependency to v4.2.0. It

[jira] [Commented] (NUTCH-3066) Protocol plugin unit tests fail randomly

2024-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881423#comment-17881423 ] ASF GitHub Bot commented on NUTCH-3066: --- lewismc commented on PR #824: URL: h

Re: [PR] NUTCH-3065 Format changelog as markdown [nutch]

2024-09-12 Thread via GitHub
lewismc commented on PR #823: URL: https://github.com/apache/nutch/pull/823#issuecomment-2347396771 +1 @sebastian-nagel LGTM applied against `master`. The `FAILED` test was against ``` [junit] Tests run: 14, Failures: 1, Errors: 0, Skipped: 4, Time elapsed: 4.735 sec

[jira] [Commented] (NUTCH-3065) Format changelog as Markdown

2024-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17881425#comment-17881425 ] ASF GitHub Bot commented on NUTCH-3065: --- lewismc commented on PR #823: URL: h

Re: [PR] NUTCH-3066 Protocol plugin unit tests fail randomly [nutch]

2024-09-12 Thread via GitHub
lewismc commented on PR #824: URL: https://github.com/apache/nutch/pull/824#issuecomment-2347394534 +1 @sebastian-nagel applied and tested against `master`. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons

2024-09-11 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17880958#comment-17880958 ] Sebastian Nagel commented on NUTCH-1806: > it seems odd to return a

[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons

2024-09-09 Thread Markus Jelsma (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17880239#comment-17880239 ] Markus Jelsma commented on NUTCH-1806: -- Yes, this seems fine when glancing a

[jira] [Created] (NUTCH-3067) Improve performance of FetchItemQueues if error state is preserved

2024-09-07 Thread Sebastian Nagel (Jira)
Sebastian Nagel created NUTCH-3067: -- Summary: Improve performance of FetchItemQueues if error state is preserved Key: NUTCH-3067 URL: https://issues.apache.org/jira/browse/NUTCH-3067 Project: Nutch

[jira] [Commented] (NUTCH-1806) Delegate processing of URL domains to crawler commons

2024-09-07 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-1806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17880036#comment-17880036 ] Sebastian Nagel commented on NUTCH-1806: Any comments on this? It's an

[jira] [Commented] (NUTCH-3063) Support for "addBinaryContent" from REST API

2024-09-06 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/NUTCH-3063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17879972#comment-17879972 ] Hudson commented on NUTCH-3063: --- SUCCESS: Integrated in Jenkins build Nutch » Nutch-t

  1   2   3   4   5   6   7   8   9   10   >