Andrzej Bialecki wrote:
> Hmm.. I understand his point. But it means that I have to always put "if
(datum.getMetaData() == null)" check, which pollutes the code in all places that deal with metadata. Currently this is just CrawlDbReducer (but it already looks ugly there), but it will be like that in any place that wants to use metadata.

One thing to consider might be to add some methods to CrawlDatum like:

   public Writable getMeta(Writable key);

to minimize the null checks.

Or we can simply abandon this probably premature optimization. The MapReduce code now reuses keys and values (unless you're using a combiner...) so the allocation should be less of an issue.

Doug


-------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to