Hi Alex, On Tue, Jun 17, 2014 at 2:06 PM, <user-digest-h...@nutch.apache.org> wrote:
> > I am using nutch-2.x with GORA_97. You mean GORA-94, the Avro upgrade? With which gora- backend please? > Further investigation shows that DbUpdateReducer > calls > inlinkedScoreData.clear(); > I see this on line ~72 of DbUpdateReducer > > and it calls this function > > public void readFields(DataInput in) throws IOException { > Can you please point me to where ScoreDatum#readFields is called? > > And metaData.clear(); line clears all metadata. > Yes this should result in an empty HashMap data structure. > > Why metaData.clear(); line is needed in this function? > > It is poorly documented and this Class has not be altered for some time so off the top of my head I need to say that I do not know why. Based on the Javadoc for Writable, @Override readFields should "...should attempt to re-use storage in the existing object where possible." so I am not sure why we clear the metadata from the HashMap structure. I would need to debug this to understand. If you can provide more context on where ScoreDatum#readFields is called then I can set break point up until then. Thanks Alex Lewis