[ https://issues.apache.org/jira/browse/SYSTEMML-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15488963#comment-15488963 ]
Deron Eriksson commented on SYSTEMML-906: ----------------------------------------- I'll take a look at this. > Use of `df.first` in MLContextUtil is a Bottleneck > -------------------------------------------------- > > Key: SYSTEMML-906 > URL: https://issues.apache.org/jira/browse/SYSTEMML-906 > Project: SystemML > Issue Type: Improvement > Reporter: Mike Dusenberry > > The use of {{dataframe.first()}} at {{MLContextUtil.java:497}} causes a > severe bottleneck if the input DataFrame is a lazy result of a > compute-intensive chain. We should change this to use {{dataframe.schema}}, > and then iterate over the types in the schema. > cc [~deron] -- This message was sent by Atlassian JIRA (v6.3.4#6332)