[jira] [Updated] (KYLIN-1094) improve performance of spark cubing

ZhouQianhao (JIRA) Wed, 28 Oct 2015 19:23:36 -0700

     [ 
https://issues.apache.org/jira/browse/KYLIN-1094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


ZhouQianhao updated KYLIN-1094:
-------------------------------
    Issue Type: Sub-task  (was: Improvement)
        Parent: KYLIN-744

> improve performance of spark cubing
> -----------------------------------
>
>                 Key: KYLIN-1094
>                 URL: https://issues.apache.org/jira/browse/KYLIN-1094
>             Project: Kylin
>          Issue Type: Sub-task
>          Components: Spark Engine
>    Affects Versions: v2.0
>            Reporter: ZhouQianhao
>            Assignee: ZhouQianhao
>
> POC result of spark cubing shows that, on a dataset of 150 million records, 
> MR is about 100% faster than Spark, however we believe that Spark could be at 
> least at same speed as MR, so optimization is needed here.
> We are asking Spark community for help now.
> the cluster info:
> vm: 8 nodes * (128G mem + 64 core)
> hadoop cluster: hdp 2.2.6
> spark running mode: yarn-client
> spark version: 1.5.1



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (KYLIN-1094) improve performance of spark cubing

Reply via email to