TheNeuralBit commented on a change in pull request #14520:
URL: https://github.com/apache/beam/pull/14520#discussion_r617968582



##########
File path: 
runners/spark/src/main/java/org/apache/beam/runners/spark/SparkPipelineRunner.java
##########
@@ -115,13 +114,6 @@ public PortablePipelineResult run(RunnerApi.Pipeline 
pipeline, JobInfo jobInfo)
             : GreedyPipelineFuser.fuse(trimmedPipeline).toPipeline();
 
     // File staging.
-    if (pipelineOptions.getFilesToStage() == null) {
-      pipelineOptions.setFilesToStage(
-          detectClassPathResourcesToStage(
-              SparkPipelineRunner.class.getClassLoader(), pipelineOptions));
-      LOG.info(
-          "PipelineOptions.filesToStage was not specified. Defaulting to files 
from the classpath");
-    }

Review comment:
       I think removing this broke Python PostCommits, see BEAM-12207. We're 
getting an NPE from the LOG statement below, presumably because 
`pipelineOptions.getFilesToStage() == null`. I'm not sure what the appropriate 
fix is, is it expected that now the null checking happens somewhere else, or 
should we add a null check back here?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to