[ https://issues.apache.org/jira/browse/NUTCH-668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653881#action_12653881 ]
Dennis Kubes commented on NUTCH-668: ------------------------------------ I agree. Being able to search for tlds like .com would make it much more flexible. Let me work up the changes and I will post a new patch (without my local path :)). Although I do want to get this in quickly I think the new functionality is worth the wait. > Domain URL Filter > ----------------- > > Key: NUTCH-668 > URL: https://issues.apache.org/jira/browse/NUTCH-668 > Project: Nutch > Issue Type: Improvement > Affects Versions: 1.0.0 > Environment: All > Reporter: Dennis Kubes > Assignee: Dennis Kubes > Fix For: 1.0.0 > > Attachments: NUTCH-668-1-20081202.patch, NUTCH-668-2-20081204.patch > > > A URLFilter that adds the ability to filter out URLs by top level domain or > by hostname. A configuration file with a listing of URLs is used to denote > accepted urls. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.