Diana Carroll created SPARK-1666: ------------------------------------ Summary: document examples Key: SPARK-1666 URL: https://issues.apache.org/jira/browse/SPARK-1666 Project: Spark Issue Type: Improvement Components: Documentation Affects Versions: 0.9.1 Reporter: Diana Carroll
It would be great if there were some guidance about what the example code shipped with Spark (under $SPARKHOME/examples and $SPARKHOME/python/examples) does and how to run it. Perhaps a comment block at the beginning explaining what the code accomplishes and what parameters it takes. Also, if there are sample datasets on which the example is designed to run, please point to those. (As an example, look at kmeans.py, which takes a file argument, but has no hint about what sort of data is in the file or what format the data should be in. -- This message was sent by Atlassian JIRA (v6.2#6252)