Diana Carroll created SPARK-1666:
------------------------------------

             Summary: document examples
                 Key: SPARK-1666
                 URL: https://issues.apache.org/jira/browse/SPARK-1666
             Project: Spark
          Issue Type: Improvement
          Components: Documentation
    Affects Versions: 0.9.1
            Reporter: Diana Carroll


It would be great if there were some guidance about what the example code 
shipped with Spark (under $SPARKHOME/examples and $SPARKHOME/python/examples) 
does and how to run it.  Perhaps a comment block at the beginning explaining 
what the code accomplishes and what parameters it takes.  Also, if there are 
sample datasets on which the example is designed to run, please point to those. 
 

(As an example, look at kmeans.py, which takes a file argument, but has no hint 
about what sort of data is in the file or what format the data should be in.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to