[ https://issues.apache.org/jira/browse/NUTCH-668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12658165#action_12658165 ]
Dennis Kubes commented on NUTCH-668: ------------------------------------ It uses two different StringUtils classes, one from commons lang, one from org.apache.hadoop.util.StringUtils. I just chose commons as I thought I would use that one more times. As it happens I only use it once in this patch. > Domain URL Filter > ----------------- > > Key: NUTCH-668 > URL: https://issues.apache.org/jira/browse/NUTCH-668 > Project: Nutch > Issue Type: Improvement > Affects Versions: 1.0.0 > Environment: All > Reporter: Dennis Kubes > Assignee: Dennis Kubes > Fix For: 1.0.0 > > Attachments: NUTCH-668-1-20081202.patch, NUTCH-668-2-20081204.patch, > NUTCH-668-3-20081213.patch > > > A URLFilter that adds the ability to filter out URLs by top level domain or > by hostname. A configuration file with a listing of URLs is used to denote > accepted urls. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.