[ https://issues.apache.org/jira/browse/HADOOP-2110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sanjay Radia updated HADOOP-2110: --------------------------------- Attachment: brAsLongsPatch2.txt The attached patch uses an array of longs instead of an array of Class Block for the Block report (BR) A block is 2 longs (block id and block length). Java's overhead for an object is 16 bytes. For an array of objects each array entry is a separately allocated object. Comparison of the block report processing times. All times are in millisec and are measured from the data node sending the BR and receiving an acknowledgement from the NN that the BR was processed. Approx 50 improvement in the BR processing time BRs using Class Block || BRs using Longs Num Time Time || Time Time Blocks Total per 1K blocks || Total per 1K blocks ---------------------------------------------||----------------------- 40000 651ms 16ms || 230ms 5.75ms 100000 1258ms 12ms || 573ms 5.73ms 300000 3521ms 12ms || 1800ms 6.00ms 500000 5930ms 12ms || 2916ms 5.83ms > Block Report Optimization: Replace class instance > ------------------------------------------------- > > Key: HADOOP-2110 > URL: https://issues.apache.org/jira/browse/HADOOP-2110 > Project: Hadoop > Issue Type: Sub-task > Components: dfs > Affects Versions: 0.14.0 > Reporter: Robert Chansler > Assignee: Sanjay Radia > Fix For: 0.16.0 > > Attachments: brAsLongsPatch2.txt > > > Block report processing can be more efficient if instances of the block ID > class are replaced with a sequence of longs. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.