Hi, ALL: I’m using spark engine to build cube. Now I found the bottleneck of build time lies in the #3 Step Name: Extract Fact Table Distinct Columns. When I look into the spark application, I found there is only two splits regardless of how large the input sequence file is. I wonder how to increase the number of split for this step? I’m new to spark and any help will be great thanks!
P.S. Spark job of #3 Step Name: Extract Fact Table Distinct Columns. [cid:image001.png@01D47518.C74E9D70] [cid:image002.png@01D47518.C74E9D70] [cid:image003.png@01D47518.C74E9D70] [cid:image004.png@01D47518.C74E9D70] -- Best regards, Xi Chen