IT works well now, thanks

---Original---
From: "Sam Elamin"<hussam.ela...@gmail.com>
Date: 2017/2/14 19:54:36
To: "??????????"<1427357...@qq.com>;
Cc: "user"<user@spark.apache.org>;
Subject: Re: how to fix the order of data


Its because you are just printing on the rdd

You can sort the df like below


 
&#xA0;input.toDF().sort().collect()




or if you do not want to convert to a dataframe you can use the sort 
by&#xA0;sortByKey([ascending], [numTasks])




Regards

Sam












On Tue, Feb 14, 2017 at 11:41 AM, ?????????? <1427357...@qq.com> wrote:
&#xA0;&#xA0;&#xA0;HI&#xA0;&#xA0;all,
the&#xA0;belowing&#xA0;is&#xA0;my&#xA0;test&#xA0;code.&#xA0;I&#xA0;found&#xA0;the&#xA0;output&#xA0;of&#xA0;val&#xA0;input&#xA0;is&#xA0;different.&#xA0;how&#xA0;do&#xA0;i&#xA0;fix&#xA0;the&#xA0;order&#xA0;please?

scala>&#xA0;val&#xA0;input&#xA0;=&#xA0;sc.parallelize(&#xA0;Array(1,2,3))
input:&#xA0;org.apache.spark.rdd.RDD[Int]&#xA0;=&#xA0;ParallelCollectionRDD[13]&#xA0;at&#xA0;parallelize&#xA0;at&#xA0;<console>:24

scala>&#xA0;input.foreach(print)
132
scala>&#xA0;input.foreach(print)
213
scala>&#xA0;input.foreach(print)
312

Reply via email to