[
https://issues.apache.org/jira/browse/SYSTEMML-2278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matthias Boehm updated SYSTEMML-2278:
-------------------------------------
Summary: Performance issues spark ctable(X,Y) w/ large num distinct (was:
Performance issues spark table(X,Y) w/ large num distinct)
> Performance issues spark ctable(X,Y) w/ large num distinct
> ----------------------------------------------------------
>
> Key: SYSTEMML-2278
> URL: https://issues.apache.org/jira/browse/SYSTEMML-2278
> Project: SystemML
> Issue Type: Task
> Reporter: Matthias Boehm
> Priority: Major
>
> {code}
> N = 1e9
> X = seq(1,N);
> Y = 1+seq(1,N)/1000;
> while(FALSE){}
> Z = table(X,Y)
> while(FALSE){}
> print(sum(Z));
> {code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)