Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "GoogleSummerOfCode/SitemapCrawler/weeklyreport" page has been changed by 
CihadGuzel:
https://wiki.apache.org/nutch/GoogleSummerOfCode/SitemapCrawler/weeklyreport?action=diff&rev1=7&rev2=8

Comment:
reproduce report1

  
  '''Title :''' Sitemap url injection is done.
  
- Sitemap list injection is provided on this week working. The Url path of 
sitemap files wanted to be injected can be defined from seed file. 
+ Sitemap list injection is provided on this week working. The url path of 
sitemap files wanted to be injected can be defined from seed file. 
  
  In addition, Some preparatory work related to following works is being done. 
  
  ----
  Example:
  
- If you have two sitemap files, you can define them in the seed file as follow:
+ If you have two sitemap files for "http://www.example.com/"; , you can define 
them in the seed file as follow:
  
   * ''http://www.example.com/     sitemaps: sitemap1.xml sitemap2.xml'' 
   * ''http://www.example2.com/
   * ''http://www.example3.com/''
+ 
+ Then you can run InjecterJob. So the sitemaps urls are injected to the db. 
The urls injected are signed as sitemap.
  
  || '''Week :''' 2 (1 June 2015 - 7 June 2015) ||
  

Reply via email to