[ 
https://issues.apache.org/jira/browse/NUTCH-1942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14320479#comment-14320479
 ] 

Julien Nioche commented on NUTCH-1942:
--------------------------------------

we already rely on it for robots parsing so not much of a change. As for 
joining the ASF under another project (apache-commons?) there has been a 
discussion recently over there on joining the ASF which hasn't yet reached a 
conclusion. I personally think that CC is fine where it is. Feel free to join 
the discussion on the CC mailing list.


> Remove TopLevelDomain 
> ----------------------
>
>                 Key: NUTCH-1942
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1942
>             Project: Nutch
>          Issue Type: Task
>            Reporter: Julien Nioche
>            Priority: Minor
>              Labels: newbie
>             Fix For: 1.11
>
>
> We should leverage the domain related utilities from crawler-commons instead 
> of duplicating them in the `org.apache.nutch.util.domain` package. For 
> instance we could deprecate TopLevelDomain and call the corresponding class 
> in CC instead. The resources in CC are more up-to-date and it is less code to 
> maintain.
> This would be a good task for someone willing to get to know the Nutch 
> codebase better and impress us all with the extent of his/her skills.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to