[ http://issues.apache.org/jira/browse/NUTCH-59?page=comments#action_12365012 ]
James Jonas commented on NUTCH-59: ---------------------------------- Thanks, I have been tracking Nutch-139 and Nutch-192 and look forward to these patches being committed into the .8 trunk. James > meta data support in webdb > -------------------------- > > Key: NUTCH-59 > URL: http://issues.apache.org/jira/browse/NUTCH-59 > Project: Nutch > Type: New Feature > Reporter: Stefan Groschupf > Priority: Minor > Attachments: webDBMetaDataPatch.txt > > Meta data support in web db would very usefully for a new set of nutch > feature that needs long life meta data. > Actually page meta data need to be regenerated or lookup every 30 days a page > is re-fetched, in a long context web db meta data would bring a dramatically > performance improvement for such tasks. > Furthermore Storage of meta data in webdb would make a new generation of > linklist generation filters possible. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642 _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
