cache() method returns new RDD so you have to use something like this: val person = sc.textFile("hdfs://namenode_host:8020/user/person.txt").map(_.split(",")).map(p => Person(p(0).trim.toInt, p(1)))
val cached = person.cache cached.count when you rerun count on cached you will see that cache works On Wed, Apr 1, 2015 at 9:35 AM, fightf...@163.com <fightf...@163.com> wrote: > Hi > That is just the issue. After running person.cache we then run > person.count > however, there still not be any cache performance showed from web ui > storage. > > Thanks, > Sun. > > ------------------------------ > fightf...@163.com > > > *From:* Taotao.Li <taotao...@datayes.com> > *Date:* 2015-04-01 14:02 > *To:* fightfate <fightf...@163.com> > *CC:* user <user@spark.apache.org> > *Subject:* Re: rdd.cache() not working ? > rerun person.count and you will see the performance of cache. > > person.cache would not cache it right now. It'll actually cache this RDD > after one action[person.count here] > > ------------------------------ > *发件人: *fightf...@163.com > *收件人: *"user" <user@spark.apache.org> > *发送时间: *星期三, 2015年 4 月 01日 下午 1:21:25 > *主题: *rdd.cache() not working ? > > Hi, all > > Running the following code snippet through spark-shell, however cannot see > any cached storage partitions in web ui. > > Does this mean that cache now working ? Cause if we issue person.count > again that we cannot say any time consuming > > performance upgrading. Hope anyone can explain this for a little. > > Best, > > Sun. > > case class Person(id: Int, col1: String) > > > val person = > sc.textFile("hdfs://namenode_host:8020/user/person.txt").map(_.split(",")).map(p > => Person(p(0).trim.toInt, p(1))) > > person.cache > > person.count > > ------------------------------ > fightf...@163.com > > > > -- > > > *---------------------------------------------------------------------------* > > *Thanks & Best regards* > > 李涛涛 Taotao · Li | Fixed Income@Datayes | Software Engineer > > 地址:上海市浦东新区陆家嘴西路99号万向大厦8楼, 200120 > Address :Wanxiang Towen 8F, Lujiazui West Rd. No.99, Pudong New District, > Shanghai, 200120 > > 电话|Phone:021-60216502 手机|Mobile: +86-18202171279 > >