GitHub user CodingCat opened a pull request:
https://github.com/apache/incubator-spark/pull/636
[SPARK-1102] Create a saveAsNewAPIHadoopDataset method
Create a saveAsNewAPIHadoopDataset method
By @mateiz: "Right now RDDs can only be saved as files using the new Hadoop
API, not as "datasets" with no filename and just a JobConf. See
http://codeforhire.com/2014/02/18/using-spark-with-mongodb/ for an example of
how you have to give a bogus filename. For the old Hadoop API, we have
saveAsHadoopDataset."
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/CodingCat/incubator-spark SPARK-1102
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-spark/pull/636.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #636
----
commit fac89212f5b964eabfb316256daef774dffc7a5f
Author: CodingCat <[email protected]>
Date: 2014-02-23T07:18:36Z
Create a saveAsNewAPIHadoopDataset method
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. To do so, please top-post your response.
If your project does not have this feature enabled and wishes so, or if the
feature is enabled but not working, please contact infrastructure at
[email protected] or file a JIRA ticket with INFRA.
---