Re: Help in reading hadoop HDFS file into PCollection

2018-09-01 Thread Leonardo Campos | GameDuell
Hi, Maybe you could share some code, so we could have a better picture of what is going on. Last time I had to read from HDFS (normally in our pipelines HDFS is just a sink), we used FileIO: https://beam.apache.org/documentation/sdks/javadoc/2.3.0/index.html?org/apache/beam/sdk/io/FileIO.htm

Help in reading hadoop HDFS file into PCollection

2018-09-01 Thread Mahesh Vangala
Hello all - I have installed a pseudo-distributed yarn and spark. My beam pipeline reads a TextIO from file and it runs fine when I launch the pipeline using --master spark://master. However, I am having difficulties in getting this run with --master yarn. I am pretty sure using TextIO from a loca