[jira] [Commented] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
[ https://issues.apache.org/jira/browse/NUTCH-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17846795#comment-17846795 ] Hudson commented on NUTCH-3041: --- SUCCESS: Integrated in Jenkins build Nutch ยป Nutch-trunk #162 (See [https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/162/]) NUTCH-3041 Address confusing logging in o.a.n.net.URLExemptionFilters (#813) (github: [https://github.com/apache/nutch/commit/8abc78a653eb7970def10031d732fb4c7aa0fb6f]) * (edit) src/plugin/urlfilter-ignoreexempt/src/java/org/apache/nutch/urlfilter/ignoreexempt/ExemptionUrlFilter.java * (edit) src/java/org/apache/nutch/net/URLExemptionFilters.java * (edit) src/plugin/urlfilter-ignoreexempt/README.md > Address confusing logging in o.a.n.net.URLExemptionFilters > --- > > Key: NUTCH-3041 > URL: https://issues.apache.org/jira/browse/NUTCH-3041 > Project: Nutch > Issue Type: Task > Components: net >Affects Versions: 1.19, 1.20 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney >Priority: Minor > Fix For: 1.21 > > > URLExemptionFilter impementations are used to allow exemptions to external > domain resources by overriding the {{db.ignore.external.links}} configuration > setting. This is useful when the crawl is focused to a domain but resources > like images are hosted on CDN. > Currently [URLExemptionFilters|#L47-L48]] provides the following logging > {quote}INFO o.a.n.n.URLExemptionFilters [LocalJobRunner Map Task Executor > #0|#0] Found 0 extensions at point:'org.apache.nutch.net.URLExemptionFilter' > {quote} > I find this confusing. It would be better to log *only* if an > URLExemptionFilter implementation is actually configured to be used at > runtime. > I will provide a patch for this. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Closed] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
[ https://issues.apache.org/jira/browse/NUTCH-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney closed NUTCH-3041. --- > Address confusing logging in o.a.n.net.URLExemptionFilters > --- > > Key: NUTCH-3041 > URL: https://issues.apache.org/jira/browse/NUTCH-3041 > Project: Nutch > Issue Type: Task > Components: net >Affects Versions: 1.19, 1.20 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney >Priority: Minor > Fix For: 1.21 > > > URLExemptionFilter impementations are used to allow exemptions to external > domain resources by overriding the {{db.ignore.external.links}} configuration > setting. This is useful when the crawl is focused to a domain but resources > like images are hosted on CDN. > Currently [URLExemptionFilters|#L47-L48]] provides the following logging > {quote}INFO o.a.n.n.URLExemptionFilters [LocalJobRunner Map Task Executor > #0|#0] Found 0 extensions at point:'org.apache.nutch.net.URLExemptionFilter' > {quote} > I find this confusing. It would be better to log *only* if an > URLExemptionFilter implementation is actually configured to be used at > runtime. > I will provide a patch for this. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Work stopped] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
[ https://issues.apache.org/jira/browse/NUTCH-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on NUTCH-3041 stopped by Lewis John McGibbney. --- > Address confusing logging in o.a.n.net.URLExemptionFilters > --- > > Key: NUTCH-3041 > URL: https://issues.apache.org/jira/browse/NUTCH-3041 > Project: Nutch > Issue Type: Task > Components: net >Affects Versions: 1.19, 1.20 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney >Priority: Minor > Fix For: 1.21 > > > URLExemptionFilter impementations are used to allow exemptions to external > domain resources by overriding the {{db.ignore.external.links}} configuration > setting. This is useful when the crawl is focused to a domain but resources > like images are hosted on CDN. > Currently [URLExemptionFilters|#L47-L48]] provides the following logging > {quote}INFO o.a.n.n.URLExemptionFilters [LocalJobRunner Map Task Executor > #0|#0] Found 0 extensions at point:'org.apache.nutch.net.URLExemptionFilter' > {quote} > I find this confusing. It would be better to log *only* if an > URLExemptionFilter implementation is actually configured to be used at > runtime. > I will provide a patch for this. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Resolved] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
[ https://issues.apache.org/jira/browse/NUTCH-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-3041. - Resolution: Fixed > Address confusing logging in o.a.n.net.URLExemptionFilters > --- > > Key: NUTCH-3041 > URL: https://issues.apache.org/jira/browse/NUTCH-3041 > Project: Nutch > Issue Type: Task > Components: net >Affects Versions: 1.19, 1.20 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney >Priority: Minor > Fix For: 1.21 > > > URLExemptionFilter impementations are used to allow exemptions to external > domain resources by overriding the {{db.ignore.external.links}} configuration > setting. This is useful when the crawl is focused to a domain but resources > like images are hosted on CDN. > Currently [URLExemptionFilters|#L47-L48]] provides the following logging > {quote}INFO o.a.n.n.URLExemptionFilters [LocalJobRunner Map Task Executor > #0|#0] Found 0 extensions at point:'org.apache.nutch.net.URLExemptionFilter' > {quote} > I find this confusing. It would be better to log *only* if an > URLExemptionFilter implementation is actually configured to be used at > runtime. > I will provide a patch for this. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (NUTCH-3041) Address confusing logging in o.a.n.net.URLExemptionFilters
[ https://issues.apache.org/jira/browse/NUTCH-3041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17846788#comment-17846788 ] ASF GitHub Bot commented on NUTCH-3041: --- lewismc merged PR #813: URL: https://github.com/apache/nutch/pull/813 > Address confusing logging in o.a.n.net.URLExemptionFilters > --- > > Key: NUTCH-3041 > URL: https://issues.apache.org/jira/browse/NUTCH-3041 > Project: Nutch > Issue Type: Task > Components: net >Affects Versions: 1.19, 1.20 >Reporter: Lewis John McGibbney >Assignee: Lewis John McGibbney >Priority: Minor > Fix For: 1.21 > > > URLExemptionFilter impementations are used to allow exemptions to external > domain resources by overriding the {{db.ignore.external.links}} configuration > setting. This is useful when the crawl is focused to a domain but resources > like images are hosted on CDN. > Currently [URLExemptionFilters|#L47-L48]] provides the following logging > {quote}INFO o.a.n.n.URLExemptionFilters [LocalJobRunner Map Task Executor > #0|#0] Found 0 extensions at point:'org.apache.nutch.net.URLExemptionFilter' > {quote} > I find this confusing. It would be better to log *only* if an > URLExemptionFilter implementation is actually configured to be used at > runtime. > I will provide a patch for this. -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: [PR] NUTCH-3041 Address confusing logging in o.a.n.net.URLExemptionFilters [nutch]
lewismc merged PR #813: URL: https://github.com/apache/nutch/pull/813 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [PR] Revert incorrect change [nutch-site]
lewismc commented on PR #2: URL: https://github.com/apache/nutch-site/pull/2#issuecomment-2112989006 Yes thank you @sebbASF -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org