[ https://issues.apache.org/jira/browse/SYSTEMML-855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Niketan Pansare updated SYSTEMML-855: ------------------------------------- Description: As an example, this tutorial could have following sections: 1. Steps to start Python shell (or cloud service like datascientistworkbench) with SystemML support: wget https://raw.githubusercontent.com/apache/incubator-systemml/master/src/main/java/org/apache/sysml/api/python/SystemML.py wget https://sparktc.ibmcloud.com/repo/latest/SystemML.jar pyspark --master local[*] --driver-class-path SystemML.jar OR Use pip installer. 2. Give context for one of the algorithm: For example: Linear regression. We can borrow the technical detail from http://apache.github.io/incubator-systemml/algorithms-regression.html#description 3. Explain steps to download data we will use and how to implement Linear regression DS using embedded Python DSL: https://github.com/apache/incubator-systemml/pull/197 import numpy as np from sklearn import datasets diabetes = datasets.load_diabetes() diabetes_X = diabetes.data[:, np.newaxis, 2] diabetes_X_train = diabetes_X[:-20] diabetes_X_test = diabetes_X[-20:] diabetes_y_train = diabetes.target[:-20] diabetes_y_test = diabetes.target[-20:] 4. Explain how to use our algorithm instead: http://apache.github.io/incubator-systemml/algorithms-regression.html#examples 5. To explain tradeoffs of using NumPy or Scikit-Learn v/s SystemML's embedded DSL or SystemML's mllearn, increase the data size. For example: use twitter feed. was: As an example, this tutorial could have following sections: 1. Steps to start Python shell (or cloud service like datascientistworkbench) with SystemML support: wget https://raw.githubusercontent.com/apache/incubator-systemml/master/src/main/java/org/apache/sysml/api/python/SystemML.py wget https://sparktc.ibmcloud.com/repo/latest/SystemML.jar OR Use pip installer. 2. Give context for one of the algorithm: For example: Linear regression. We can borrow the technical detail from http://apache.github.io/incubator-systemml/algorithms-regression.html#description 3. Explain steps to download data we will use and how to implement Linear regression DS using embedded Python DSL: https://github.com/apache/incubator-systemml/pull/197 import numpy as np from sklearn import datasets diabetes = datasets.load_diabetes() diabetes_X = diabetes.data[:, np.newaxis, 2] diabetes_X_train = diabetes_X[:-20] diabetes_X_test = diabetes_X[-20:] diabetes_y_train = diabetes.target[:-20] diabetes_y_test = diabetes.target[-20:] 4. Explain how to use our algorithm instead: http://apache.github.io/incubator-systemml/algorithms-regression.html#examples 5. To explain tradeoffs of using NumPy or Scikit-Learn v/s SystemML's embedded DSL or SystemML's mllearn, increase the data size. For example: use twitter feed. > Add a "Get Started" tutorial for Python users > --------------------------------------------- > > Key: SYSTEMML-855 > URL: https://issues.apache.org/jira/browse/SYSTEMML-855 > Project: SystemML > Issue Type: Task > Reporter: Niketan Pansare > > As an example, this tutorial could have following sections: > 1. Steps to start Python shell (or cloud service like datascientistworkbench) > with SystemML support: > wget > https://raw.githubusercontent.com/apache/incubator-systemml/master/src/main/java/org/apache/sysml/api/python/SystemML.py > wget https://sparktc.ibmcloud.com/repo/latest/SystemML.jar > pyspark --master local[*] --driver-class-path SystemML.jar > OR > Use pip installer. > 2. Give context for one of the algorithm: For example: Linear regression. We > can borrow the technical detail from > http://apache.github.io/incubator-systemml/algorithms-regression.html#description > 3. Explain steps to download data we will use and how to implement Linear > regression DS using embedded Python DSL: > https://github.com/apache/incubator-systemml/pull/197 > import numpy as np > from sklearn import datasets > diabetes = datasets.load_diabetes() > diabetes_X = diabetes.data[:, np.newaxis, 2] > diabetes_X_train = diabetes_X[:-20] > diabetes_X_test = diabetes_X[-20:] > diabetes_y_train = diabetes.target[:-20] > diabetes_y_test = diabetes.target[-20:] > 4. Explain how to use our algorithm instead: > http://apache.github.io/incubator-systemml/algorithms-regression.html#examples > 5. To explain tradeoffs of using NumPy or Scikit-Learn v/s SystemML's > embedded DSL or SystemML's mllearn, increase the data size. For example: use > twitter feed. -- This message was sent by Atlassian JIRA (v6.3.4#6332)