[ 
https://issues.apache.org/jira/browse/SYSTEMML-995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15539339#comment-15539339
 ] 

Mike Dusenberry commented on SYSTEMML-995:
------------------------------------------

Yeah here was the explain output:

{code}
--MAIN PROGRAM
----GENERIC (lines 1-7) [recompile=true]
------CP createvar _fVar9 scratch_space//_p21642_9.30.110.134//_t0/temp7 true 
FRAME binaryblock -1 -1 1000 1000 -1 copy
------SPARK rblk trainf.FRAME.DOUBLE.false _fVar9.FRAME.DOUBLE 1000 1000 true
------CP createvar _fVar10 scratch_space//_p21642_9.30.110.134//_t0/temp8 true 
FRAME binaryblock -1 -1 1000 1000 -1 copy
------SPARK chkpoint _fVar9.FRAME.DOUBLE.false _fVar10.FRAME.DOUBLE 
MEMORY_AND_DISK
------CP rmvar _fVar9
------CP nrow _fVar10.FRAME.DOUBLE.false _Var11.SCALAR.INT
------CP ncol _fVar10.FRAME.DOUBLE.false _Var12.SCALAR.INT
------CP rmvar _fVar10
------CP assignvar _Var11.SCALAR.INT.false n.SCALAR.INT
------CP assignvar _Var12.SCALAR.INT.false d.SCALAR.INT
------CP rmvar _Var11
------CP rmvar _Var12
{code}

> MLContext dataframe-frame conversion with index column & vector column
> ----------------------------------------------------------------------
>
>                 Key: SYSTEMML-995
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-995
>             Project: SystemML
>          Issue Type: Bug
>          Components: APIs
>    Affects Versions: SystemML 0.11
>            Reporter: Matthias Boehm
>            Priority: Blocker
>
> MLContext currently always assumes data frame to frame conversion without 
> existing index column. Since the user cannot communicate the existence of 
> this column, the data conversion leads to incorrect results as an additional 
> column is included in the output frame. We need make the MLContext handling 
> of frames consistent with the handling of matrices.
> Additionally, the conversion code in 
> {{MLContextConversionUtil.dataFrameToFrameObject()}} does not yet take into 
> account frames with vectors, although the recent addition adds this support 
> in the underlying {{FrameRDDConverterUtils.java}} class.  Therefore, the 
> number of columns set when {{mc == null}} is incorrect.
> Thanks [~mwdus...@us.ibm.com] for catching this issue. cc [~acs_s] [~deron]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to