[ 
https://issues.apache.org/jira/browse/NUTCH-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13447797#comment-13447797
 ] 

Ken Krugler commented on NUTCH-1465:
------------------------------------

Hi Lewis - I could start a thread, but I also don't want to flog a dead horse :)

I'm spending occasional small amounts of time trying to move code from Bixo 
over to CC, and the plan is for the 0.9 release of Bixo to switch over to using 
CC where possible.

But the lack of excitement among Droids, Heretrix, Common Crawl, Nutch, etc. 
has made it pretty clear getting wide-spread adoption would be an uphill 
battle, one that I don't have the time currently to fight.

-- Ken
                
> Support sitemaps in Nutch
> -------------------------
>
>                 Key: NUTCH-1465
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1465
>             Project: Nutch
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Lewis John McGibbney
>             Fix For: 1.6, 2.1
>
>
> I recently came across this rather stagnant codebase[0] which is ASL v2.0 
> licensed and appears to have been used successfully to parse sitemaps as per 
> the discussion here[1].
> [0] http://sourceforge.net/projects/sitemap-parser/
> [1] 
> http://lucene.472066.n3.nabble.com/Support-for-Sitemap-Protocol-and-Canonical-URLs-td630060.html

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to