Please check this doc: https://kylin.apache.org/docs/howto/howto_optimize_build.html
陈熹(chenxi07)-技术产品中心 <chenx...@qiyi.com> 于2018年11月5日周一 下午3:25写道: > Hi: > > I’m sorry the picture is dead again. > > I upload it as attachment this time > > > > -- > > Best regards, > > > > Xi Chen > > > > > > *From:* 陈熹(chenxi07)-技术产品中心 <chenx...@qiyi.com> > *Sent:* Monday, November 5, 2018 3:04 PM > *To:* dev@kylin.apache.org > *Subject:* How to increase split number for Fact distinct columns when > using spark engine?(picture added) > > > > Hi, ALL: > > I’m using spark engine to build cube. > > Now I found the bottleneck of build time lies in the #3 Step Name: Extract > Fact Table Distinct Columns. > > When I look into the spark application, I found there is only two splits > regardless of how large the input sequence file is. > > I wonder how to increase the number of split for this step? > > I’m new to spark and any help will be great thanks! > > > > P.S. Spark job of #3 Step Name: Extract Fact Table Distinct Columns. > > -- > > Best regards, > > > > Xi Chen > > > > > -- Best regards, Shaofeng Shi 史少锋