Re: Why percentile and distinct are not done in one job?

2018-07-31 Thread 吴晓菊
I mean in AnalyzeColumnCommand.scala the first one to compute percentiles and the second one to compute columnStats. Chrysan Wu 吴晓菊 Phone:+86 17717640807 2018-07-30 23:28 GMT+08:00 Reynold Xin : > Which API are you talking about? > > On Mon, Jul 30, 2018 at 7:03 AM 吴晓菊 wrote: > >> I noticed

Re: Why percentile and distinct are not done in one job?

2018-07-30 Thread Reynold Xin
Which API are you talking about? On Mon, Jul 30, 2018 at 7:03 AM 吴晓菊 wrote: > I noticed that in column analyzing, 2 jobs will run separately to > calculate percentiles and then distinct. Why not combine into one job since > HyperLogLog also supports merge? > > Chrysan Wu > Phone:+86 17717640807

Why percentile and distinct are not done in one job?

2018-07-30 Thread 吴晓菊
I noticed that in column analyzing, 2 jobs will run separately to calculate percentiles and then distinct. Why not combine into one job since HyperLogLog also supports merge? Chrysan Wu Phone:+86 17717640807