subject:"答复\: 答复\: 答复\: How to store 10M records in HDFS to speed up further filtering\?"

Re: 答复: 答复: 答复: How to store 10M records in HDFS to speed up further filtering?

2017-04-20 Thread Ryan

yan <ryan.hd....@gmail.com> > *发送时间:* 2017年4月17日 16:48:47 > *收件人:* 莫涛 > *抄送:* user > *主题:* Re: 答复: 答复: How to store 10M records in HDFS to speed up further > filtering? > > how about the event timeline on executors? It seems add more executor > could help. > &g

答复: 答复: 答复: How to store 10M records in HDFS to speed up further filtering?

2017-04-20 Thread 莫涛

ail.com> 发送时间: 2017年4月17日 16:48:47 收件人: 莫涛抄送: user 主题: Re: 答复: 答复: How to store 10M records in HDFS to speed up further filtering? how about the event timeline on executors? It seems add more executor could help. 1. I found a jira(https://issues.apache.org/jira/browse/SPARK-11621) that state

答复: 答复: 答复: How to store 10M records in HDFS to speed up further filtering?

2017-04-20 Thread 莫涛

件人: Jörn Franke <jornfra...@gmail.com<mailto:jornfra...@gmail.com>> 发送时间: 2017年4月17日 22:37:48 收件人: 莫涛抄送: user@spark.apache.org<mailto:user@spark.apache.org> 主题: Re: 答复: How to store 10M records in HDFS to speed up further filtering? Yes 5 mb is a difficult size, too small for HDFS too b