sebastian-nagel commented on PR #808:
URL: https://github.com/apache/nutch/pull/808#issuecomment-2000233258

   Hi Lewis, it's done in three steps:
   1. run `ant report-licenses` (Rat task) for core and all plugins
   2. process all reports: list all combinations of <project, license>, try to 
extract the organization and project description from the ivy cache, normalize 
license names, etc.
   3. manually verify the output of step 1 and 2 and merge it with the existing 
license and notice files
   
   Step 1 and 2 are done by the Jupyter notebook attached to NUTCH-2290. 
Because the output (NOTICE-binary and LICENSE-binary) is somewhat noisy, manual 
verification is necessary.
   
   See also NUTCH-2981: Storm has some scripts to automatically generate the 
license reports.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@nutch.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to