Hi
I have claster in two datacenters
CLUSTER
|
+--------+---------+
| |
datacenter1 datacenter2
| |
rack1 rack1
| | |
+-a1 | +-b1
| | |
+-a2 | +-b3
|
rack2
+-b3
Cluster have file with repcica coefficient=5
All files's blocks resides on all servers of cluser.
When I work with standart MapReduce (MRv1) (called on b1) Map and
Rediuce task runs on all servers b1, b2, b3, a1, a2
When I work with YARN (MRv2) (called on b1) Map and Reduce task runs
only on b1, b2, b3
Can I run in YARN Map tasks on all servers?
--
Консультант 1-й категории
Костарев А.Ф.