Hi,
Many places it has been written that to avoid huge no of disk seeks , we
store big blocks in HDFS , so that once we seek to the location , then
there is only data transfer rate which would be predominant , no more
seeks. I am not sure if I have understood this correctly.
My question is , no
From: Rahul Bhattacharjee [mailto:rahul.rec@gmail.com]
Subject: Why big block size for HDFS.
Many places it has been written that to avoid huge no of disk seeks , we store
big blocks in HDFS , so that once we seek to the location , then there is only
data transfer rate which would
When you seek to a position within a HDFS file, you are not seek from the
start of the first block and then one by one.
Actually DFSClient can skip some blocks until find one block, which offset
and block length includes your seek position.
On Mon, Apr 1, 2013 at 12:55 AM, Rahul
[mailto:rahul.rec@gmail.com]
*Subject:* Why big block size for HDFS.
** **
Many places it has been written that to avoid huge no of disk seeks , we
store big blocks in HDFS , so that once we seek to the location , then
there is only data transfer rate which would be predominant , no more