[ https://issues.apache.org/jira/browse/SYSTEMML-845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15410156#comment-15410156 ]
Mike Dusenberry commented on SYSTEMML-845: ------------------------------------------ [~niketanpansare] Yes, your assumption is correct -- same DML, just the {{mnist_lenet-train.dml}} uses DML-bodied functions. Same performance in forced singlenode, but performance regressions for the DML-bodied version in hybrid MR or hybrid Spark. I'll go run the limited, 2-epoch versions with {{-explain recompile_hops}}. For time reasons, I'll just terminate the processes once they've spit out the explain info. > Compare Performance of LeNet Scripts With & Without Using SystemML-NN > --------------------------------------------------------------------- > > Key: SYSTEMML-845 > URL: https://issues.apache.org/jira/browse/SYSTEMML-845 > Project: SystemML > Issue Type: Improvement > Reporter: Mike Dusenberry > Attachments: convert.dml, lenet-train-spark-explain.log, > log08.03.16-1470268602.txt, mnist_lenet-train-spark-explain.log, perf.sh, > run.sh > > > This JIRA issue tracks the comparison of the performance of the LeNet scripts > with & without using SystemML-NN. The goal is that they should have equal > performance in terms of both accuracy and time. Any difference will be > indicate areas of engine improvement. > Scripts: > * [mnist_lenet-train.dml | > https://github.com/apache/incubator-systemml/blob/master/scripts/staging/SystemML-NN/examples/mnist_lenet-train.dml] > - LeNet script that *does* use the SystemML-NN library. > * [lenet-train.dml | > https://github.com/apache/incubator-systemml/blob/master/scripts/staging/lenet-train.dml] > - LeNet script that *does not* use the SystemML-NN library. > To fully reproduce, I basically created a directory, placed the two attached > bash scripts in it, grabbed a copy of the NN library and placed it into the > directory, ran the examples/get_mnist_data.sh script from the library to get > the data (placed into examples/data), then used the attached convert.dml to > create binary copies of the data for both scripts, then ran run.sh. Also, I > copied examples/data to the base directory as well. Adjust the {{EXEC}} and > related variables in {{perf.sh}} to switch between standalone, Spark, memory > sizes, explain, stats, etc. -- This message was sent by Atlassian JIRA (v6.3.4#6332)