[ 
https://issues.apache.org/jira/browse/NUTCH-2539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney resolved NUTCH-2539.
-----------------------------------------
    Resolution: Fixed

> Not correct naming of db.url.filters and db.url.normalizers in 
> nutch-default.xml
> --------------------------------------------------------------------------------
>
>                 Key: NUTCH-2539
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2539
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: 1.15
>            Reporter: Semyon Semyonov
>            Priority: Major
>             Fix For: 1.15
>
>
> There is a mismatch between config and code.
> In code, 
>  In CrawlDbFilter line 41:43
> > public static final String URL_FILTERING = "crawldb.url.filters";
> > public static final String URL_NORMALIZING = "crawldb.url.normalizers";
> > public static final String URL_NORMALIZING_SCOPE = 
> > "crawldb.url.normalizers.scope";
>  
> In nutch-default.xml
> > <property>
> > <name>db.url.normalizers</name>
> > <value>false</value>
> > <description>Normalize urls when updating crawldb</description>
> > </property>
> >
> > <property>
> > <name>db.url.filters</name>
> > <value>false</value>
> > <description>Filter urls when updating crawldb</description>
> > </property>
> These properties should be in line with code.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to