Hi,
Maybe you could share some code, so we could have a better picture of
what is going on.
Last time I had to read from HDFS (normally in our pipelines HDFS is
just a sink), we used FileIO:
https://beam.apache.org/documentation/sdks/javadoc/2.3.0/index.html?org/apache/beam/sdk/io/FileIO.htm
Hello all -
I have installed a pseudo-distributed yarn and spark.
My beam pipeline reads a TextIO from file and it runs fine when I launch
the pipeline using --master spark://master.
However, I am having difficulties in getting this run with --master yarn.
I am pretty sure using TextIO from a loca