[ https://issues.apache.org/jira/browse/NUTCH-2750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sebastian Nagel updated NUTCH-2750: ----------------------------------- Summary: Improve CrawlDbReader & LinkDbReader reader handling (was: improve CrawlDbReader & LinkDbReader reader handling) > Improve CrawlDbReader & LinkDbReader reader handling > ---------------------------------------------------- > > Key: NUTCH-2750 > URL: https://issues.apache.org/jira/browse/NUTCH-2750 > Project: Nutch > Issue Type: Improvement > Components: crawldb, linkdb > Affects Versions: 1.16 > Reporter: Jurian Broertjes > Priority: Minor > > The current implementation in the CrawlDbReader re-opens readers for every > URL. This is not very efficient. I've implemented a modification time check > that only re-opens readers on updated crawlDB. > PR: https://github.com/apache/nutch/pull/483 -- This message was sent by Atlassian Jira (v8.3.4#803005)