Hi Kenrick,

Thanks for your input.

Regards,
Ravikant

On Tue, Nov 28, 2017 at 1:26 AM, Kenrick Fernandes <kenrick....@gmail.com>
wrote:

> Ravikant,
>
> As far as I know,  there is no way to do this (atleast not without changing
> the network names of the machines which is outside the scope of the Giraph
> code).
> However, a simple solution might be to make a mapping data structure that
> stores the underlying node IDs and then you can access them contiguously.
> Alternatively, exclude nodes only from the start or the end so that the
> contiguity
> is preserved.
>
> Regards,
> Kenrick
>
> On Mon, Nov 27, 2017 at 12:54 AM, Ravikant Dindokar <
> ravikant.i...@gmail.com> wrote:
>
>> Hi,
>>
>> I am trying to change the mapping of partitions to workers in
>>
>> createInitialPartitionOwners(Collection<WorkerInfo>
>> availableWorkerInfos, int maxWorkers) defined in org.apache.giraph.partition
>> .MasterGraphPartitionerImpl.java.
>>
>> The first argument to this method is the list of available workers and
>> each worker has a task id associated with it.
>>
>> When I am specifying the number of workers as 40, the task id are 1 to 40.
>>
>> I have removed one node from the yarn-cluster by following this answer
>> https://stackoverflow.com/questions/41820759/how-to-s
>> eparately-specify-a-set-of-nodes-for-hdfs-and-others-for-mapreduce-jobs.
>>
>> After this change, the task ids assigned to the workers are no longer
>> consecutive (e.g. for 40 workers the range of task id is 1- 43). Some
>> task-ids are excluded.
>>
>> Is there any way to get these task ids strictly consecutive?
>>
>> Thanks
>> Ravikant
>>
>
>

Reply via email to