I have a 2 rack cluster. All of my files have a replication factor of 2. How does hdfs determine what node to use when serving the data? Does it always use the first rack? or is there an algorithm for this?
-- --- Get your facts first, then you can distort them as you please.--