[ https://issues.apache.org/jira/browse/KYLIN-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17010215#comment-17010215 ]
weibin0516 commented on KYLIN-4321: ----------------------------------- Past experience and a large amount of test data show that Spark's performance is significantly better than MapReduce. !screenshot-1.png! !screenshot-2.png! Currently, when the cube is built with the spark engine, the `Create fact distinct columns` step uses mapreduce by default. Here we want to use the spark engine to perform this step by default, that is, modify the` kylin.engine.spark-fact-distinct` value to true. > Create fact distinct columns using spark by default when build engine is spark > ------------------------------------------------------------------------------ > > Key: KYLIN-4321 > URL: https://issues.apache.org/jira/browse/KYLIN-4321 > Project: Kylin > Issue Type: Improvement > Reporter: weibin0516 > Assignee: weibin0516 > Priority: Major > Fix For: v3.1.0 > > Attachments: screenshot-1.png, screenshot-2.png > > -- This message was sent by Atlassian Jira (v8.3.4#803005)