walterddr opened a new issue #7433:
URL: https://github.com/apache/pinot/issues/7433


   In many use cases such as Spark / Flink ingestion, multiple instances of 
workers will be instantiated to upload segments in parallel 
   
   Currently the default SegmentWriter and SegmentUploader cannot support this 
easily: One needs to generate slightly different TableConfig and 
BatchIngestionConfig in order to modify the tmp directory name, segment name 
and other usages. Otherwise, spawning multiples instances within the same host 
will cause File write conflict.
   
   Propose to: create a method to easily set parallelism index for 
SegmentWriter and Uploader; such as adding APIs to modify the parallelism index 
directly instead of via configuration changes.
   
   CC @npawar, @xiangfu0 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@pinot.apache.org
For additional commands, e-mail: commits-h...@pinot.apache.org

Reply via email to