[ 
https://issues.apache.org/jira/browse/SYSTEMML-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15998601#comment-15998601
 ] 

Krishna Kalyan commented on SYSTEMML-1451:
------------------------------------------

Commands I run after following the instructions from 
[here](https://docs.google.com/document/d/1cwwH9kHibD-PxBvOlkqOGbBrqzWsCFr5pf3iQb18BWM)
cd /home/krishna/open-source/incubator-systemml/scripts/perftest
export JAVA_OPTS="-Xmx2G -Xms2g -Xmn1g"
./genBinomialData.sh my_test_data SPARK

Jave Version : 1.8.0
Spark Version : spark-2.1.0-bin-hadoop2.7
Hadoop Version : hadoop-2.8

log1
https://gist.github.com/krishnakalyan3/2ce475a33165ee2f76457ab092c36fdb

hs_err_pid25327.log
https://gist.github.com/krishnakalyan3/fdc85e568795c7a7a62726f8c0b5e83f





> Automate performance testing and reporting
> ------------------------------------------
>
>                 Key: SYSTEMML-1451
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1451
>             Project: SystemML
>          Issue Type: Improvement
>          Components: Infrastructure, Test
>            Reporter: Nakul Jindal
>              Labels: gsoc2017, mentor, performance, reporting, testing
>
> As part of a release (and in general), performance tests are run for SystemML.
> Currently, running and reporting on these performance tests are a manual 
> process. There are helper scripts, but largely the process is manual.
> The aim of this GSoC 2017 project is to automate performance testing and its 
> reporting.
> These are the tasks that this entails
> 1. Automate running of the performance tests, including generation of test 
> data
> 2. Detect errors and report if any
> 3. Record performance benchmarking information
> 4. Automatically compare this performance to previous version to check for 
> performance regressions
> 5. Automatically compare to Spark MLLib, R?, Julia?
> 6. Prepare report with all the information about failed jobs, performance 
> information, perf info against other comparable projects/algorithms 
> (plotted/in plain text in CSV, PDF or other common format)
> 7. Create scripts to automatically run this process on a cloud provider that 
> spins up machines, runs the test, saves the reports and spins down the 
> machines.
> 8. Create a web application to do this interactively without dropping down 
> into a shell.
> As part of this project, the student will need to know scripting (in Bash, 
> Python, etc). It may also involve changing error reporting and performance 
> reporting code in SystemML. 
> Rating - Medium (for the amount of work)
> Mentor - [~nakul02] (Other co-mentors will join in)



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to