Parquet file amazon s3a timeout

2017-05-17 Thread Karin Valisova
Hello! I'm working with some parquet files saved on amazon service and loading them to dataframe with Dataset df = spark.read() .parquet(parketFileLocation); however, after some time I get the "Timeout waiting for connection from pool" exception. I hope I'm not mistaken, but I think that there's

Re: apache-spark: Converting List of Rows into Dataset Java

2017-03-30 Thread Karin Valisova
.getOrCreate(); > List results = new LinkedList(); > JavaRDD jsonRDD = > new JavaSparkContext(sparkSession. > sparkContext()).parallelize(results); > > Dataset peopleDF = sparkSession.createDataFrame(jsonRDD, > Row.class

apache-spark: Converting List of Rows into Dataset Java

2017-03-28 Thread Karin Valisova
Hello! I am running Spark on Java and bumped into a problem I can't solve or find anything helpful among answered questions, so I would really appreciate your help. I am running some calculations, creating rows for each result: List results = new LinkedList(); for(something){

apache-spark: Converting List of Rows into Dataset Java

2017-03-27 Thread Karin Valisova
Hello! I am running Spark on Java and bumped into a problem I can't solve or find anything helpful among answered questions, so I would really appreciate your help. I am running some calculations, creating rows for each result: List results = new LinkedList(); for(something){