1. Does rdd.collect() return the lines in the same order as they are in input file? 2. Does df1.collect() return the rows in the same order as they are in rdd.collect()? 3. Does df2.collect() return the rows in the same order as they are in df1.collect()?
Please argument your answers with reference to any documentation in Spark API. I couldn't find it. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/RDD-Dataframe-and-Parquet-order-tp27220.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org