[ https://issues.apache.org/jira/browse/HDFS-6865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
James Thomas updated HDFS-6865: ------------------------------- Attachment: HDFS-6865.6.patch [~tlipcon], thanks for the review. Made all of your suggested changes except the one about calling {{writeChunk}} with larger buffers -- if you look at {{DFSOutputStream#writeChunk}}, you'll see that it has some assertions that check that only a single chunk has been passed in. It also calls {{Packet#writeData}} and {{Packet#writeChecksum}}, which expect that the written data will not overflow the packet. So we would need some logic that checks whether the write will overflow the packet and, if so, writes only enough to fill the packet, queues it, and then allocates a new packet and writes the remainder to that. Sounds like this is best left for later. > Byte array native checksumming on client side (HDFS changes) > ------------------------------------------------------------ > > Key: HDFS-6865 > URL: https://issues.apache.org/jira/browse/HDFS-6865 > Project: Hadoop HDFS > Issue Type: Sub-task > Components: hdfs-client, performance > Reporter: James Thomas > Assignee: James Thomas > Attachments: HDFS-6865.2.patch, HDFS-6865.3.patch, HDFS-6865.4.patch, > HDFS-6865.5.patch, HDFS-6865.6.patch, HDFS-6865.patch > > > Refactor FSOutputSummer to buffer data and use the native checksum > calculation functionality introduced in HADOOP-10975. -- This message was sent by Atlassian JIRA (v6.2#6252)