Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/6668
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enab
Github user Emaasit commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-118934985
Thanks @shivaram.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this fea
Github user shivaram commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-118934248
LGTM. Thanks @Emaasit for this PR. There are some outstanding comments, but
I'll fix them during the merge.
---
If your project is set up for it, you can reply to this
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32182922
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# co
Github user Emaasit commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110889582
@shivaram Ok. Got you.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110863369
Merged build finished. Test PASSed.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110863356
[Test build #34619 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34619/console)
for PR 6668 at commit
[`3a97867`](https://github.
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32143873
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# co
Github user davies commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32140681
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+#
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32139076
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user davies commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32138906
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+#
Github user shivaram commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110829362
Thanks @Emaasit for the update. I just had a few more things that I ran
into while executing the example. Also you can verify some of these things by
just running the e
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32138457
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32138417
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32137955
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110825042
[Test build #34619 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/34619/consoleFull)
for PR 6668 at commit
[`3a97867`](https://gith
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110824211
Merged build triggered.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not h
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110824242
Merged build started.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have
Github user shivaram commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110823841
Jenkins, ok to test
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this f
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32135725
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user Emaasit commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110702984
@shivaram To create a Spark DataFrame from a local data frame, I used a
subset of the data with fewer rows.
---
If your project is set up for it, you can reply to this
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32040001
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32039347
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32039315
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32038867
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32038349
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r32038272
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,101 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+
Github user Emaasit commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110299745
@shivaram I fixed that. You will notice that read.csv() does not work well
with SSL, that is https connections. so I changed the connection to http.
---
If your project
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r31974828
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,92 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+#
Github user Emaasit commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-110101395
@shivaram Yes, the base R function works. I have changed it.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as we
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r31936684
--- Diff: examples/src/main/r/data-manipulation.R ---
@@ -0,0 +1,97 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+#
Github user Emaasit commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-109847221
@shivaram I wanted to provide two options for creating DataFrames. One
where R users can convert their local dataframes into DataFrames and the second
using the read.df(
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r31884319
--- Diff: examples/src/main/r/2-data-manipulation.R ---
@@ -0,0 +1,62 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r31884096
--- Diff: examples/src/main/r/2-data-manipulation.R ---
@@ -0,0 +1,62 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r31883785
--- Diff: examples/src/main/r/1-data.R ---
@@ -0,0 +1,41 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributo
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r31883745
--- Diff: examples/src/main/r/0-getting-started.R ---
@@ -0,0 +1,25 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+#
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r31883752
--- Diff: examples/src/main/r/0-getting-started.R ---
@@ -0,0 +1,25 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+#
Github user Emaasit commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-109380729
@shivaram I have added the Apache license at the top of every file, removed
author name & data.
---
If your project is set up for it, you can reply to this email and ha
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r31832789
--- Diff: examples/src/main/r/0-getting-started.R ---
@@ -0,0 +1,23 @@
+#
+# Author: Daniel Emaasit (@emaasit)
+# Purpose: This script shows how
Github user shivaram commented on a diff in the pull request:
https://github.com/apache/spark/pull/6668#discussion_r31832695
--- Diff: examples/src/main/r/0-getting-started.R ---
@@ -0,0 +1,23 @@
+#
--- End diff --
We need to have the Apache License at the top of ev
Github user Emaasit commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-109213965
@shivaram Here is the new submission. I would like to submit a few more
examples on statistical analysis and machine learning on SparkR DataFrames.
---
If your project
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/6668#issuecomment-109211797
Can one of the admins verify this patch?
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your pr
GitHub user Emaasit opened a pull request:
https://github.com/apache/spark/pull/6668
[SPARK-8124] [SPARKR] [WIP] Created more examples on SparkR DataFrames
Here are more examples on SparkR DataFrames including creating a Spark
Contect and a SQL
context, loading data and simple
43 matches
Mail list logo