HDFS replication is transparent to Accumulo (though, the number of replicas is configurable in Accumulo, on a per-table basis). Its primary purpose is failure tolerance, but it *may* have an impact on read performance. I'm not certain how significant that is, though.
There is no separate read-only and write-only copies of data on HDFS. HDFS replication is at the block level, and files are updated by appending new blocks to the files. All blocks are readable, and only new blocks are written. On Thu, Nov 10, 2016 at 11:28 AM Yamini Joshi <yamini.1...@gmail.com> wrote: > Hello all > > Does the HDFS replication improve performance of queries on Accumulo or is > it transparent to the Accumulo system? If it does improve the performance > by some notion of load balancing, is there is a Read Only or Write Only > copy of data on HDFS for Accumulo? > > Best regards, > Yamini Joshi >