mingmwang commented on code in PR #261:
URL: https://github.com/apache/arrow-ballista/pull/261#discussion_r984160816


##########
ballista/rust/core/proto/ballista.proto:
##########
@@ -521,12 +554,12 @@ message FetchPartition {
   uint32 port = 6;
 }
 
-// Mapping from partition id to executor id
 message PartitionLocation {
-  PartitionId partition_id = 1;
-  ExecutorMetadata executor_meta = 2;
-  PartitionStats partition_stats = 3;
-  string path = 4;
+  uint32 map_partition_id = 1;
+  PartitionId partition_id = 2;

Review Comment:
   Sure, will add more comments in the code. Here the new added 
`map_partition_id` is the partition_id of the map stage who produce this 
shuffle data.  But the original `PartitionId partition_id` has different 
meanings in different places.
   Sometimes it stands for a task_id, and here it stands for a shuffle 
partition id(composition of map_stage_id + reduce partition id + job_id), a 
mixed up of map and reduce infos together.
   So if we do not consider the backward compatibility, I would suggest to 
unnest this and make the PartitionLocation a plan struct like blow:
   
   ````
   message PartitionLocation {
     string job_id = 1;
     uint32 map_stage_id = 2;
     uint32 map_partition_id = 3;
     uint32 partition_id = 4;
     ExecutorMetadata executor_meta = 5;
     PartitionStats partition_stats = 6;
     string path = 7;
   }
   ````



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to