[ https://issues.apache.org/jira/browse/SYSTEMML-843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Imran Younus updated SYSTEMML-843: ---------------------------------- Description: I'm running the tSNE script in standalone mode with a subset of MNIST data (2500 points). I ran this with and without `-exec singlenode`. Here are the stats: (BTW, the same function implemented in python takes less than 10 sec!) -> with singlenode flag {code} ./bin/systemml scripts/staging/tSNE.dml -stats -nvargs X=/home/iyounus/workspace/tsne_python/mnist2500_X.txt Y=Y_out.txt C=C_out.txt 16/08/01 16:46:54 INFO api.DMLScript: SystemML Statistics: Total elapsed time: 109.667 sec. Total compilation time: 0.407 sec. Total execution time: 109.260 sec. Number of compiled MR Jobs: 0. Number of executed MR Jobs: 0. Cache hits (Mem, WB, FS, HDFS): 223692/0/0/1. Cache writes (WB, FS, HDFS): 80351/0/2. Cache times (ACQr/m, RLS, EXP): 0.289/0.015/85.192/0.043 sec. HOP DAGs recompiled (PRED, SB): 0/0. HOP DAGs recompile time: 0.007 sec. Functions recompiled: 1. Functions recompile time: 0.039 sec. Total JIT compile time: 4.924 sec. Total JVM GC count: 312. Total JVM GC time: 1.12 sec. Heavy hitter instructions (name, time, count): -- 1) tsne 109.202 sec 1 -- 2) x2p 109.189 sec 1 -- 3) leftIndex 106.728 sec 32136 -- 4) tsmm 0.564 sec 1 -- 5) exp 0.376 sec 8034 -- 6) rangeReIndex 0.201 sec 40170 -- 7) / 0.183 sec 24103 -- 8) * 0.161 sec 16069 -- 9) + 0.144 sec 22840 -- 10) uak+ 0.106 sec 8036 16/08/01 16:46:54 INFO api.DMLScript: END DML run 08/01/2016 16:46:54 {code} -> without singlenode flag {code} > ./bin/systemml scripts/staging/tSNE.dml -stats -nvargs > X=/home/iyounus/workspace/tsne_python/mnist2500_X.txt Y=Y_out.txt C=C_out.txt 16/08/01 16:52:59 INFO api.DMLScript: SystemML Statistics: Total elapsed time: 127.290 sec. Total compilation time: 0.396 sec. Total execution time: 126.894 sec. Number of compiled MR Jobs: 1. Number of executed MR Jobs: 0. Cache hits (Mem, WB, FS, HDFS): 223693/0/0/1. Cache writes (WB, FS, HDFS): 80352/0/2. Cache times (ACQr/m, RLS, EXP): 0.421/0.016/100.974/0.041 sec. HOP DAGs recompiled (PRED, SB): 0/0. HOP DAGs recompile time: 0.009 sec. Functions recompiled: 1. Functions recompile time: 0.038 sec. Total JIT compile time: 4.835 sec. Total JVM GC count: 312. Total JVM GC time: 1.226 sec. Heavy hitter instructions (name, time, count): -- 1) tsne 126.426 sec 1 -- 2) x2p 126.412 sec 1 -- 3) leftIndex 123.982 sec 32136 -- 4) exp 0.427 sec 8034 -- 5) MR-Job_CSV_REBLOCK 0.412 sec 1 -- 6) tsmm 0.308 sec 1 -- 7) rangeReIndex 0.242 sec 40170 -- 8) / 0.208 sec 24103 -- 9) + 0.172 sec 22840 -- 10) * 0.151 sec 16069 16/08/01 16:52:59 INFO api.DMLScript: END DML run 08/01/2016 16:52:59 {code} was: I'm running the tSNE script in standalone mode with a subset of MNIST data (2500 points). I ran this with and without `-exec singlenode`. Here are the stats: -> with singlenode flag {code} ./bin/systemml scripts/staging/tSNE.dml -stats -nvargs X=/home/iyounus/workspace/tsne_python/mnist2500_X.txt Y=Y_out.txt C=C_out.txt 16/08/01 16:46:54 INFO api.DMLScript: SystemML Statistics: Total elapsed time: 109.667 sec. Total compilation time: 0.407 sec. Total execution time: 109.260 sec. Number of compiled MR Jobs: 0. Number of executed MR Jobs: 0. Cache hits (Mem, WB, FS, HDFS): 223692/0/0/1. Cache writes (WB, FS, HDFS): 80351/0/2. Cache times (ACQr/m, RLS, EXP): 0.289/0.015/85.192/0.043 sec. HOP DAGs recompiled (PRED, SB): 0/0. HOP DAGs recompile time: 0.007 sec. Functions recompiled: 1. Functions recompile time: 0.039 sec. Total JIT compile time: 4.924 sec. Total JVM GC count: 312. Total JVM GC time: 1.12 sec. Heavy hitter instructions (name, time, count): -- 1) tsne 109.202 sec 1 -- 2) x2p 109.189 sec 1 -- 3) leftIndex 106.728 sec 32136 -- 4) tsmm 0.564 sec 1 -- 5) exp 0.376 sec 8034 -- 6) rangeReIndex 0.201 sec 40170 -- 7) / 0.183 sec 24103 -- 8) * 0.161 sec 16069 -- 9) + 0.144 sec 22840 -- 10) uak+ 0.106 sec 8036 16/08/01 16:46:54 INFO api.DMLScript: END DML run 08/01/2016 16:46:54 {code} -> without singlenode flag {code} > ./bin/systemml scripts/staging/tSNE.dml -stats -nvargs > X=/home/iyounus/workspace/tsne_python/mnist2500_X.txt Y=Y_out.txt C=C_out.txt 16/08/01 16:52:59 INFO api.DMLScript: SystemML Statistics: Total elapsed time: 127.290 sec. Total compilation time: 0.396 sec. Total execution time: 126.894 sec. Number of compiled MR Jobs: 1. Number of executed MR Jobs: 0. Cache hits (Mem, WB, FS, HDFS): 223693/0/0/1. Cache writes (WB, FS, HDFS): 80352/0/2. Cache times (ACQr/m, RLS, EXP): 0.421/0.016/100.974/0.041 sec. HOP DAGs recompiled (PRED, SB): 0/0. HOP DAGs recompile time: 0.009 sec. Functions recompiled: 1. Functions recompile time: 0.038 sec. Total JIT compile time: 4.835 sec. Total JVM GC count: 312. Total JVM GC time: 1.226 sec. Heavy hitter instructions (name, time, count): -- 1) tsne 126.426 sec 1 -- 2) x2p 126.412 sec 1 -- 3) leftIndex 123.982 sec 32136 -- 4) exp 0.427 sec 8034 -- 5) MR-Job_CSV_REBLOCK 0.412 sec 1 -- 6) tsmm 0.308 sec 1 -- 7) rangeReIndex 0.242 sec 40170 -- 8) / 0.208 sec 24103 -- 9) + 0.172 sec 22840 -- 10) * 0.151 sec 16069 16/08/01 16:52:59 INFO api.DMLScript: END DML run 08/01/2016 16:52:59 {code} > leftIndex and cache release extremely slow > ------------------------------------------ > > Key: SYSTEMML-843 > URL: https://issues.apache.org/jira/browse/SYSTEMML-843 > Project: SystemML > Issue Type: Bug > Reporter: Imran Younus > Attachments: tSNT.tar.gz > > > I'm running the tSNE script in standalone mode with a subset of MNIST data > (2500 points). I ran this with and without `-exec singlenode`. Here are the > stats: > (BTW, the same function implemented in python takes less than 10 sec!) > -> with singlenode flag > {code} > ./bin/systemml scripts/staging/tSNE.dml -stats -nvargs > X=/home/iyounus/workspace/tsne_python/mnist2500_X.txt Y=Y_out.txt C=C_out.txt > 16/08/01 16:46:54 INFO api.DMLScript: SystemML Statistics: > Total elapsed time: 109.667 sec. > Total compilation time: 0.407 sec. > Total execution time: 109.260 sec. > Number of compiled MR Jobs: 0. > Number of executed MR Jobs: 0. > Cache hits (Mem, WB, FS, HDFS): 223692/0/0/1. > Cache writes (WB, FS, HDFS): 80351/0/2. > Cache times (ACQr/m, RLS, EXP): 0.289/0.015/85.192/0.043 sec. > HOP DAGs recompiled (PRED, SB): 0/0. > HOP DAGs recompile time: 0.007 sec. > Functions recompiled: 1. > Functions recompile time: 0.039 sec. > Total JIT compile time: 4.924 sec. > Total JVM GC count: 312. > Total JVM GC time: 1.12 sec. > Heavy hitter instructions (name, time, count): > -- 1) tsne 109.202 sec 1 > -- 2) x2p 109.189 sec 1 > -- 3) leftIndex 106.728 sec 32136 > -- 4) tsmm 0.564 sec 1 > -- 5) exp 0.376 sec 8034 > -- 6) rangeReIndex 0.201 sec 40170 > -- 7) / 0.183 sec 24103 > -- 8) * 0.161 sec 16069 > -- 9) + 0.144 sec 22840 > -- 10) uak+ 0.106 sec 8036 > 16/08/01 16:46:54 INFO api.DMLScript: END DML run 08/01/2016 16:46:54 > {code} > -> without singlenode flag > {code} > > ./bin/systemml scripts/staging/tSNE.dml -stats -nvargs > > X=/home/iyounus/workspace/tsne_python/mnist2500_X.txt Y=Y_out.txt > > C=C_out.txt > 16/08/01 16:52:59 INFO api.DMLScript: SystemML Statistics: > Total elapsed time: 127.290 sec. > Total compilation time: 0.396 sec. > Total execution time: 126.894 sec. > Number of compiled MR Jobs: 1. > Number of executed MR Jobs: 0. > Cache hits (Mem, WB, FS, HDFS): 223693/0/0/1. > Cache writes (WB, FS, HDFS): 80352/0/2. > Cache times (ACQr/m, RLS, EXP): 0.421/0.016/100.974/0.041 sec. > HOP DAGs recompiled (PRED, SB): 0/0. > HOP DAGs recompile time: 0.009 sec. > Functions recompiled: 1. > Functions recompile time: 0.038 sec. > Total JIT compile time: 4.835 sec. > Total JVM GC count: 312. > Total JVM GC time: 1.226 sec. > Heavy hitter instructions (name, time, count): > -- 1) tsne 126.426 sec 1 > -- 2) x2p 126.412 sec 1 > -- 3) leftIndex 123.982 sec 32136 > -- 4) exp 0.427 sec 8034 > -- 5) MR-Job_CSV_REBLOCK 0.412 sec 1 > -- 6) tsmm 0.308 sec 1 > -- 7) rangeReIndex 0.242 sec 40170 > -- 8) / 0.208 sec 24103 > -- 9) + 0.172 sec 22840 > -- 10) * 0.151 sec 16069 > 16/08/01 16:52:59 INFO api.DMLScript: END DML run 08/01/2016 16:52:59 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)