[SPARK-1874][MLLIB] Clean up MLlib sample data

1. Added synthetic datasets for `MovieLensALS`, `LinearRegression`, 
`BinaryClassification`.
2. Embedded instructions in the help message of those example apps.

Per discussion with Matei on the JIRA page, new example data is under 
`data/mllib`.

Author: Xiangrui Meng <m...@databricks.com>

Closes #833 from mengxr/mllib-sample-data and squashes the following commits:

59f0a18 [Xiangrui Meng] add sample binary classification data
3c2f92f [Xiangrui Meng] add linear regression data
050f1ca [Xiangrui Meng] add a sample dataset for MovieLensALS example

(cherry picked from commit bcb9dce6f444a977c714117811bce0c54b417650)
Signed-off-by: Tathagata Das <tathagata.das1...@gmail.com>


Project: http://git-wip-us.apache.org/repos/asf/spark/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1c6c8b5b
Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/1c6c8b5b
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/1c6c8b5b

Branch: refs/heads/branch-1.0
Commit: 1c6c8b5bd2bdecfc5fdabd33ee8762fe25b0e69a
Parents: 78b6e6f
Author: Xiangrui Meng <m...@databricks.com>
Authored: Mon May 19 21:29:33 2014 -0700
Committer: Tathagata Das <tathagata.das1...@gmail.com>
Committed: Mon May 19 21:29:50 2014 -0700

----------------------------------------------------------------------
 .../mllib/sample_binary_classification_data.txt |  100 ++
 data/mllib/sample_linear_regression_data.txt    |  501 ++++++
 data/mllib/sample_movielens_data.txt            | 1501 ++++++++++++++++++
 .../examples/mllib/BinaryClassification.scala   |   12 +-
 .../spark/examples/mllib/LinearRegression.scala |   11 +-
 .../spark/examples/mllib/MovieLensALS.scala     |   15 +
 6 files changed, 2138 insertions(+), 2 deletions(-)
----------------------------------------------------------------------


Reply via email to