Angel Conde  created HUDI-3994:
----------------------------------

             Summary: HoodieDeltaStreamer - Spark master shouldn't have a 
default
                 Key: HUDI-3994
                 URL: https://issues.apache.org/jira/browse/HUDI-3994
             Project: Apache Hudi
          Issue Type: Improvement
          Components: deltastreamer
            Reporter: Angel Conde 
             Fix For: 0.11.0


When trying to run HoodieDeltaStreamer on AWS Glue I found that the Spark 
master has no option to inherit from the environment as it defaults to 
{{{}local[2]{}}}. In these kind of Serverless environments where you do not 
have access to the master this configuration should be inherited

This can be seen on line 329 on 
[HoodieDeltaStreamer|https://github.com/apache/hudi/blob/master/hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/HoodieDeltaStreamer.java].

{{public String sparkMaster = "local[2]";}}

This should be changed for supporting this kind of scenarios, a 
JavaSparkContext option where no Spark master is defined should be there.

*Expected behavior*

The Spark master shouldn't have a default as there are some environments 
(usually serverless such as AWS Glue) where it will be inherited.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to