http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/filter.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/filter.html b/site/docs/2.1.3/api/R/filter.html new file mode 100644 index 0000000..322d1e3 --- /dev/null +++ b/site/docs/2.1.3/api/R/filter.html @@ -0,0 +1,121 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Filter</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for filter {SparkR}"><tr><td>filter {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>Filter</h2> + +<h3>Description</h3> + +<p>Filter the rows of a SparkDataFrame according to a given condition. +</p> + + +<h3>Usage</h3> + +<pre> +filter(x, condition) + +where(x, condition) + +## S4 method for signature 'SparkDataFrame,characterOrColumn' +filter(x, condition) + +## S4 method for signature 'SparkDataFrame,characterOrColumn' +where(x, condition) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>A SparkDataFrame to be sorted.</p> +</td></tr> +<tr valign="top"><td><code>condition</code></td> +<td> +<p>The condition to filter on. This may either be a Column expression +or a string containing a SQL statement</p> +</td></tr> +</table> + + +<h3>Value</h3> + +<p>A SparkDataFrame containing only the rows that meet the condition. +</p> + + +<h3>Note</h3> + +<p>filter since 1.4.0 +</p> +<p>where since 1.4.0 +</p> + + +<h3>See Also</h3> + +<p>Other SparkDataFrame functions: <code><a href="SparkDataFrame.html">SparkDataFrame-class</a></code>, +<code><a href="summarize.html">agg</a></code>, <code><a href="arrange.html">arrange</a></code>, +<code><a href="as.data.frame.html">as.data.frame</a></code>, <code><a href="attach.html">attach</a></code>, +<code><a href="cache.html">cache</a></code>, <code><a href="coalesce.html">coalesce</a></code>, +<code><a href="collect.html">collect</a></code>, <code><a href="columns.html">colnames</a></code>, +<code><a href="coltypes.html">coltypes</a></code>, +<code><a href="createOrReplaceTempView.html">createOrReplaceTempView</a></code>, +<code><a href="crossJoin.html">crossJoin</a></code>, <code><a href="dapplyCollect.html">dapplyCollect</a></code>, +<code><a href="dapply.html">dapply</a></code>, <code><a href="summary.html">describe</a></code>, +<code><a href="dim.html">dim</a></code>, <code><a href="distinct.html">distinct</a></code>, +<code><a href="dropDuplicates.html">dropDuplicates</a></code>, <code><a href="nafunctions.html">dropna</a></code>, +<code><a href="drop.html">drop</a></code>, <code><a href="dtypes.html">dtypes</a></code>, +<code><a href="except.html">except</a></code>, <code><a href="explain.html">explain</a></code>, +<code><a href="first.html">first</a></code>, <code><a href="gapplyCollect.html">gapplyCollect</a></code>, +<code><a href="gapply.html">gapply</a></code>, <code><a href="getNumPartitions.html">getNumPartitions</a></code>, +<code><a href="groupBy.html">group_by</a></code>, <code><a href="head.html">head</a></code>, +<code><a href="histogram.html">histogram</a></code>, <code><a href="insertInto.html">insertInto</a></code>, +<code><a href="intersect.html">intersect</a></code>, <code><a href="isLocal.html">isLocal</a></code>, +<code><a href="join.html">join</a></code>, <code><a href="limit.html">limit</a></code>, +<code><a href="merge.html">merge</a></code>, <code><a href="mutate.html">mutate</a></code>, +<code><a href="ncol.html">ncol</a></code>, <code><a href="nrow.html">nrow</a></code>, +<code><a href="persist.html">persist</a></code>, <code><a href="printSchema.html">printSchema</a></code>, +<code><a href="randomSplit.html">randomSplit</a></code>, <code><a href="rbind.html">rbind</a></code>, +<code><a href="registerTempTable-deprecated.html">registerTempTable</a></code>, <code><a href="rename.html">rename</a></code>, +<code><a href="repartition.html">repartition</a></code>, <code><a href="sample.html">sample</a></code>, +<code><a href="saveAsTable.html">saveAsTable</a></code>, <code><a href="schema.html">schema</a></code>, +<code><a href="selectExpr.html">selectExpr</a></code>, <code><a href="select.html">select</a></code>, +<code><a href="showDF.html">showDF</a></code>, <code><a href="show.html">show</a></code>, +<code><a href="storageLevel.html">storageLevel</a></code>, <code><a href="str.html">str</a></code>, +<code><a href="subset.html">subset</a></code>, <code><a href="take.html">take</a></code>, +<code><a href="union.html">union</a></code>, <code><a href="unpersist.html">unpersist</a></code>, +<code><a href="withColumn.html">withColumn</a></code>, <code><a href="with.html">with</a></code>, +<code><a href="write.df.html">write.df</a></code>, <code><a href="write.jdbc.html">write.jdbc</a></code>, +<code><a href="write.json.html">write.json</a></code>, <code><a href="write.orc.html">write.orc</a></code>, +<code><a href="write.parquet.html">write.parquet</a></code>, <code><a href="write.text.html">write.text</a></code> +</p> +<p>Other subsetting functions: <code><a href="select.html">select</a></code>, +<code><a href="subset.html">subset</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D sparkR.session() +##D path <- "path/to/file.json" +##D df <- read.json(path) +##D filter(df, "col1 > 0") +##D filter(df, df$col2 != "abcdefg") +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html>
http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/first.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/first.html b/site/docs/2.1.3/api/R/first.html new file mode 100644 index 0000000..0614546 --- /dev/null +++ b/site/docs/2.1.3/api/R/first.html @@ -0,0 +1,136 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Return the first row of a SparkDataFrame</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for first {SparkR}"><tr><td>first {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>Return the first row of a SparkDataFrame</h2> + +<h3>Description</h3> + +<p>Return the first row of a SparkDataFrame +</p> +<p>Aggregate function: returns the first value in a group. +</p> + + +<h3>Usage</h3> + +<pre> +first(x, ...) + +## S4 method for signature 'SparkDataFrame' +first(x) + +## S4 method for signature 'characterOrColumn' +first(x, na.rm = FALSE) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>a SparkDataFrame or a column used in aggregation function.</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> +<p>further arguments to be passed to or from other methods.</p> +</td></tr> +<tr valign="top"><td><code>na.rm</code></td> +<td> +<p>a logical value indicating whether NA values should be stripped +before the computation proceeds.</p> +</td></tr> +</table> + + +<h3>Details</h3> + +<p>The function by default returns the first values it sees. It will return the first non-missing +value it sees when na.rm is set to true. If all values are missing, then NA is returned. +</p> + + +<h3>Note</h3> + +<p>first(SparkDataFrame) since 1.4.0 +</p> +<p>first(characterOrColumn) since 1.4.0 +</p> + + +<h3>See Also</h3> + +<p>Other SparkDataFrame functions: <code><a href="SparkDataFrame.html">SparkDataFrame-class</a></code>, +<code><a href="summarize.html">agg</a></code>, <code><a href="arrange.html">arrange</a></code>, +<code><a href="as.data.frame.html">as.data.frame</a></code>, <code><a href="attach.html">attach</a></code>, +<code><a href="cache.html">cache</a></code>, <code><a href="coalesce.html">coalesce</a></code>, +<code><a href="collect.html">collect</a></code>, <code><a href="columns.html">colnames</a></code>, +<code><a href="coltypes.html">coltypes</a></code>, +<code><a href="createOrReplaceTempView.html">createOrReplaceTempView</a></code>, +<code><a href="crossJoin.html">crossJoin</a></code>, <code><a href="dapplyCollect.html">dapplyCollect</a></code>, +<code><a href="dapply.html">dapply</a></code>, <code><a href="summary.html">describe</a></code>, +<code><a href="dim.html">dim</a></code>, <code><a href="distinct.html">distinct</a></code>, +<code><a href="dropDuplicates.html">dropDuplicates</a></code>, <code><a href="nafunctions.html">dropna</a></code>, +<code><a href="drop.html">drop</a></code>, <code><a href="dtypes.html">dtypes</a></code>, +<code><a href="except.html">except</a></code>, <code><a href="explain.html">explain</a></code>, +<code><a href="filter.html">filter</a></code>, <code><a href="gapplyCollect.html">gapplyCollect</a></code>, +<code><a href="gapply.html">gapply</a></code>, <code><a href="getNumPartitions.html">getNumPartitions</a></code>, +<code><a href="groupBy.html">group_by</a></code>, <code><a href="head.html">head</a></code>, +<code><a href="histogram.html">histogram</a></code>, <code><a href="insertInto.html">insertInto</a></code>, +<code><a href="intersect.html">intersect</a></code>, <code><a href="isLocal.html">isLocal</a></code>, +<code><a href="join.html">join</a></code>, <code><a href="limit.html">limit</a></code>, +<code><a href="merge.html">merge</a></code>, <code><a href="mutate.html">mutate</a></code>, +<code><a href="ncol.html">ncol</a></code>, <code><a href="nrow.html">nrow</a></code>, +<code><a href="persist.html">persist</a></code>, <code><a href="printSchema.html">printSchema</a></code>, +<code><a href="randomSplit.html">randomSplit</a></code>, <code><a href="rbind.html">rbind</a></code>, +<code><a href="registerTempTable-deprecated.html">registerTempTable</a></code>, <code><a href="rename.html">rename</a></code>, +<code><a href="repartition.html">repartition</a></code>, <code><a href="sample.html">sample</a></code>, +<code><a href="saveAsTable.html">saveAsTable</a></code>, <code><a href="schema.html">schema</a></code>, +<code><a href="selectExpr.html">selectExpr</a></code>, <code><a href="select.html">select</a></code>, +<code><a href="showDF.html">showDF</a></code>, <code><a href="show.html">show</a></code>, +<code><a href="storageLevel.html">storageLevel</a></code>, <code><a href="str.html">str</a></code>, +<code><a href="subset.html">subset</a></code>, <code><a href="take.html">take</a></code>, +<code><a href="union.html">union</a></code>, <code><a href="unpersist.html">unpersist</a></code>, +<code><a href="withColumn.html">withColumn</a></code>, <code><a href="with.html">with</a></code>, +<code><a href="write.df.html">write.df</a></code>, <code><a href="write.jdbc.html">write.jdbc</a></code>, +<code><a href="write.json.html">write.json</a></code>, <code><a href="write.orc.html">write.orc</a></code>, +<code><a href="write.parquet.html">write.parquet</a></code>, <code><a href="write.text.html">write.text</a></code> +</p> +<p>Other agg_funcs: <code><a href="summarize.html">agg</a></code>, <code><a href="avg.html">avg</a></code>, +<code><a href="countDistinct.html">countDistinct</a></code>, <code><a href="count.html">count</a></code>, +<code><a href="kurtosis.html">kurtosis</a></code>, <code><a href="last.html">last</a></code>, +<code><a href="max.html">max</a></code>, <code><a href="mean.html">mean</a></code>, <code><a href="min.html">min</a></code>, +<code><a href="sd.html">sd</a></code>, <code><a href="skewness.html">skewness</a></code>, +<code><a href="stddev_pop.html">stddev_pop</a></code>, <code><a href="stddev_samp.html">stddev_samp</a></code>, +<code><a href="sumDistinct.html">sumDistinct</a></code>, <code><a href="sum.html">sum</a></code>, +<code><a href="var_pop.html">var_pop</a></code>, <code><a href="var_samp.html">var_samp</a></code>, +<code><a href="var.html">var</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D sparkR.session() +##D path <- "path/to/file.json" +##D df <- read.json(path) +##D first(df) +## End(Not run) +## Not run: +##D first(df$c) +##D first(df$c, TRUE) +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/fitted.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/fitted.html b/site/docs/2.1.3/api/R/fitted.html new file mode 100644 index 0000000..d787ad0 --- /dev/null +++ b/site/docs/2.1.3/api/R/fitted.html @@ -0,0 +1,74 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Get fitted result from a k-means model</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for fitted {SparkR}"><tr><td>fitted {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>Get fitted result from a k-means model</h2> + +<h3>Description</h3> + +<p>Get fitted result from a k-means model, similarly to R's fitted(). +Note: A saved-loaded model does not support this method. +</p> + + +<h3>Usage</h3> + +<pre> +fitted(object, ...) + +## S4 method for signature 'KMeansModel' +fitted(object, method = c("centers", "classes")) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>object</code></td> +<td> +<p>a fitted k-means model.</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> +<p>additional argument(s) passed to the method.</p> +</td></tr> +<tr valign="top"><td><code>method</code></td> +<td> +<p>type of fitted results, <code>"centers"</code> for cluster centers +or <code>"classes"</code> for assigned classes.</p> +</td></tr> +</table> + + +<h3>Value</h3> + +<p><code>fitted</code> returns a SparkDataFrame containing fitted values. +</p> + + +<h3>Note</h3> + +<p>fitted since 2.0.0 +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D model <- spark.kmeans(trainingData, ~ ., 2) +##D fitted.model <- fitted(model) +##D showDF(fitted.model) +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/floor.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/floor.html b/site/docs/2.1.3/api/R/floor.html new file mode 100644 index 0000000..c1fd07d --- /dev/null +++ b/site/docs/2.1.3/api/R/floor.html @@ -0,0 +1,77 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: floor</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for floor {SparkR}"><tr><td>floor {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>floor</h2> + +<h3>Description</h3> + +<p>Computes the floor of the given value. +</p> + + +<h3>Usage</h3> + +<pre> +## S4 method for signature 'Column' +floor(x) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>Column to compute on.</p> +</td></tr> +</table> + + +<h3>Note</h3> + +<p>floor since 1.5.0 +</p> + + +<h3>See Also</h3> + +<p>Other math_funcs: <code><a href="acos.html">acos</a></code>, <code><a href="asin.html">asin</a></code>, +<code><a href="atan2.html">atan2</a></code>, <code><a href="atan.html">atan</a></code>, +<code><a href="bin.html">bin</a></code>, <code><a href="bround.html">bround</a></code>, +<code><a href="cbrt.html">cbrt</a></code>, <code><a href="ceil.html">ceil</a></code>, +<code><a href="conv.html">conv</a></code>, <code><a href="corr.html">corr</a></code>, +<code><a href="cosh.html">cosh</a></code>, <code><a href="cos.html">cos</a></code>, +<code><a href="covar_pop.html">covar_pop</a></code>, <code><a href="cov.html">cov</a></code>, +<code><a href="expm1.html">expm1</a></code>, <code><a href="exp.html">exp</a></code>, +<code><a href="factorial.html">factorial</a></code>, <code><a href="hex.html">hex</a></code>, +<code><a href="hypot.html">hypot</a></code>, <code><a href="log10.html">log10</a></code>, +<code><a href="log1p.html">log1p</a></code>, <code><a href="log2.html">log2</a></code>, +<code><a href="log.html">log</a></code>, <code><a href="pmod.html">pmod</a></code>, +<code><a href="rint.html">rint</a></code>, <code><a href="round.html">round</a></code>, +<code><a href="shiftLeft.html">shiftLeft</a></code>, +<code><a href="shiftRightUnsigned.html">shiftRightUnsigned</a></code>, +<code><a href="shiftRight.html">shiftRight</a></code>, <code><a href="sign.html">signum</a></code>, +<code><a href="sinh.html">sinh</a></code>, <code><a href="sin.html">sin</a></code>, +<code><a href="sqrt.html">sqrt</a></code>, <code><a href="tanh.html">tanh</a></code>, +<code><a href="tan.html">tan</a></code>, <code><a href="toDegrees.html">toDegrees</a></code>, +<code><a href="toRadians.html">toRadians</a></code>, <code><a href="unhex.html">unhex</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: floor(df$c) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/format_number.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/format_number.html b/site/docs/2.1.3/api/R/format_number.html new file mode 100644 index 0000000..805b205 --- /dev/null +++ b/site/docs/2.1.3/api/R/format_number.html @@ -0,0 +1,85 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: format_number</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for format_number {SparkR}"><tr><td>format_number {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>format_number</h2> + +<h3>Description</h3> + +<p>Formats numeric column y to a format like '#,###,###.##', rounded to x decimal places, +and returns the result as a string column. +</p> + + +<h3>Usage</h3> + +<pre> +format_number(y, x) + +## S4 method for signature 'Column,numeric' +format_number(y, x) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>y</code></td> +<td> +<p>column to format</p> +</td></tr> +<tr valign="top"><td><code>x</code></td> +<td> +<p>number of decimal place to format to</p> +</td></tr> +</table> + + +<h3>Details</h3> + +<p>If x is 0, the result has no decimal point or fractional part. +If x < 0, the result will be null. +</p> + + +<h3>Note</h3> + +<p>format_number since 1.5.0 +</p> + + +<h3>See Also</h3> + +<p>Other string_funcs: <code><a href="ascii.html">ascii</a></code>, +<code><a href="base64.html">base64</a></code>, <code><a href="concat_ws.html">concat_ws</a></code>, +<code><a href="concat.html">concat</a></code>, <code><a href="decode.html">decode</a></code>, +<code><a href="encode.html">encode</a></code>, <code><a href="format_string.html">format_string</a></code>, +<code><a href="initcap.html">initcap</a></code>, <code><a href="instr.html">instr</a></code>, +<code><a href="length.html">length</a></code>, <code><a href="levenshtein.html">levenshtein</a></code>, +<code><a href="locate.html">locate</a></code>, <code><a href="lower.html">lower</a></code>, +<code><a href="lpad.html">lpad</a></code>, <code><a href="ltrim.html">ltrim</a></code>, +<code><a href="regexp_extract.html">regexp_extract</a></code>, +<code><a href="regexp_replace.html">regexp_replace</a></code>, <code><a href="reverse.html">reverse</a></code>, +<code><a href="rpad.html">rpad</a></code>, <code><a href="rtrim.html">rtrim</a></code>, +<code><a href="soundex.html">soundex</a></code>, <code><a href="substring_index.html">substring_index</a></code>, +<code><a href="translate.html">translate</a></code>, <code><a href="trim.html">trim</a></code>, +<code><a href="unbase64.html">unbase64</a></code>, <code><a href="upper.html">upper</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: format_number(df$n, 4) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/format_string.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/format_string.html b/site/docs/2.1.3/api/R/format_string.html new file mode 100644 index 0000000..50722a8 --- /dev/null +++ b/site/docs/2.1.3/api/R/format_string.html @@ -0,0 +1,81 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: format_string</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for format_string {SparkR}"><tr><td>format_string {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>format_string</h2> + +<h3>Description</h3> + +<p>Formats the arguments in printf-style and returns the result as a string column. +</p> + + +<h3>Usage</h3> + +<pre> +format_string(format, x, ...) + +## S4 method for signature 'character,Column' +format_string(format, x, ...) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>format</code></td> +<td> +<p>a character object of format strings.</p> +</td></tr> +<tr valign="top"><td><code>x</code></td> +<td> +<p>a Column.</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> +<p>additional Column(s).</p> +</td></tr> +</table> + + +<h3>Note</h3> + +<p>format_string since 1.5.0 +</p> + + +<h3>See Also</h3> + +<p>Other string_funcs: <code><a href="ascii.html">ascii</a></code>, +<code><a href="base64.html">base64</a></code>, <code><a href="concat_ws.html">concat_ws</a></code>, +<code><a href="concat.html">concat</a></code>, <code><a href="decode.html">decode</a></code>, +<code><a href="encode.html">encode</a></code>, <code><a href="format_number.html">format_number</a></code>, +<code><a href="initcap.html">initcap</a></code>, <code><a href="instr.html">instr</a></code>, +<code><a href="length.html">length</a></code>, <code><a href="levenshtein.html">levenshtein</a></code>, +<code><a href="locate.html">locate</a></code>, <code><a href="lower.html">lower</a></code>, +<code><a href="lpad.html">lpad</a></code>, <code><a href="ltrim.html">ltrim</a></code>, +<code><a href="regexp_extract.html">regexp_extract</a></code>, +<code><a href="regexp_replace.html">regexp_replace</a></code>, <code><a href="reverse.html">reverse</a></code>, +<code><a href="rpad.html">rpad</a></code>, <code><a href="rtrim.html">rtrim</a></code>, +<code><a href="soundex.html">soundex</a></code>, <code><a href="substring_index.html">substring_index</a></code>, +<code><a href="translate.html">translate</a></code>, <code><a href="trim.html">trim</a></code>, +<code><a href="unbase64.html">unbase64</a></code>, <code><a href="upper.html">upper</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: format_string('%d %s', df$a, df$b) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/freqItems.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/freqItems.html b/site/docs/2.1.3/api/R/freqItems.html new file mode 100644 index 0000000..4e8e90b --- /dev/null +++ b/site/docs/2.1.3/api/R/freqItems.html @@ -0,0 +1,80 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Finding frequent items for columns, possibly with false...</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for freqItems {SparkR}"><tr><td>freqItems {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>Finding frequent items for columns, possibly with false positives</h2> + +<h3>Description</h3> + +<p>Finding frequent items for columns, possibly with false positives. +Using the frequent element count algorithm described in +<a href="http://dx.doi.org/10.1145/762471.762473">http://dx.doi.org/10.1145/762471.762473</a>, proposed by Karp, Schenker, and Papadimitriou. +</p> + + +<h3>Usage</h3> + +<pre> +## S4 method for signature 'SparkDataFrame,character' +freqItems(x, cols, support = 0.01) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>A SparkDataFrame.</p> +</td></tr> +<tr valign="top"><td><code>cols</code></td> +<td> +<p>A vector column names to search frequent items in.</p> +</td></tr> +<tr valign="top"><td><code>support</code></td> +<td> +<p>(Optional) The minimum frequency for an item to be considered <code>frequent</code>. +Should be greater than 1e-4. Default support = 0.01.</p> +</td></tr> +</table> + + +<h3>Value</h3> + +<p>a local R data.frame with the frequent items in each column +</p> + + +<h3>Note</h3> + +<p>freqItems since 1.6.0 +</p> + + +<h3>See Also</h3> + +<p>Other stat functions: <code><a href="approxQuantile.html">approxQuantile</a></code>, +<code><a href="corr.html">corr</a></code>, <code><a href="cov.html">cov</a></code>, +<code><a href="crosstab.html">crosstab</a></code>, <code><a href="sampleBy.html">sampleBy</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D df <- read.json("/path/to/file.json") +##D fi = freqItems(df, c("title", "gender")) +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/from_unixtime.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/from_unixtime.html b/site/docs/2.1.3/api/R/from_unixtime.html new file mode 100644 index 0000000..ffbb681 --- /dev/null +++ b/site/docs/2.1.3/api/R/from_unixtime.html @@ -0,0 +1,86 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: from_unixtime</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for from_unixtime {SparkR}"><tr><td>from_unixtime {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>from_unixtime</h2> + +<h3>Description</h3> + +<p>Converts the number of seconds from unix epoch (1970-01-01 00:00:00 UTC) to a string +representing the timestamp of that moment in the current system time zone in the given +format. +</p> + + +<h3>Usage</h3> + +<pre> +from_unixtime(x, ...) + +## S4 method for signature 'Column' +from_unixtime(x, format = "yyyy-MM-dd HH:mm:ss") +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>a Column of unix timestamp.</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> +<p>further arguments to be passed to or from other methods.</p> +</td></tr> +<tr valign="top"><td><code>format</code></td> +<td> +<p>the target format. See +<a href="http://docs.oracle.com/javase/tutorial/i18n/format/simpleDateFormat.html"> +Customizing Formats</a> for available options.</p> +</td></tr> +</table> + + +<h3>Note</h3> + +<p>from_unixtime since 1.5.0 +</p> + + +<h3>See Also</h3> + +<p>Other datetime_funcs: <code><a href="add_months.html">add_months</a></code>, +<code><a href="date_add.html">date_add</a></code>, <code><a href="date_format.html">date_format</a></code>, +<code><a href="date_sub.html">date_sub</a></code>, <code><a href="datediff.html">datediff</a></code>, +<code><a href="dayofmonth.html">dayofmonth</a></code>, <code><a href="dayofyear.html">dayofyear</a></code>, +<code><a href="from_utc_timestamp.html">from_utc_timestamp</a></code>, <code><a href="hour.html">hour</a></code>, +<code><a href="last_day.html">last_day</a></code>, <code><a href="minute.html">minute</a></code>, +<code><a href="months_between.html">months_between</a></code>, <code><a href="month.html">month</a></code>, +<code><a href="next_day.html">next_day</a></code>, <code><a href="quarter.html">quarter</a></code>, +<code><a href="second.html">second</a></code>, <code><a href="to_date.html">to_date</a></code>, +<code><a href="to_utc_timestamp.html">to_utc_timestamp</a></code>, +<code><a href="unix_timestamp.html">unix_timestamp</a></code>, <code><a href="weekofyear.html">weekofyear</a></code>, +<code><a href="window.html">window</a></code>, <code><a href="year.html">year</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D from_unixtime(df$t) +##D from_unixtime(df$t, 'yyyy/MM/dd HH') +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/from_utc_timestamp.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/from_utc_timestamp.html b/site/docs/2.1.3/api/R/from_utc_timestamp.html new file mode 100644 index 0000000..8622f75 --- /dev/null +++ b/site/docs/2.1.3/api/R/from_utc_timestamp.html @@ -0,0 +1,76 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: from_utc_timestamp</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for from_utc_timestamp {SparkR}"><tr><td>from_utc_timestamp {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>from_utc_timestamp</h2> + +<h3>Description</h3> + +<p>Given a timestamp, which corresponds to a certain time of day in UTC, returns another timestamp +that corresponds to the same time of day in the given timezone. +</p> + + +<h3>Usage</h3> + +<pre> +from_utc_timestamp(y, x) + +## S4 method for signature 'Column,character' +from_utc_timestamp(y, x) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>y</code></td> +<td> +<p>Column to compute on.</p> +</td></tr> +<tr valign="top"><td><code>x</code></td> +<td> +<p>time zone to use.</p> +</td></tr> +</table> + + +<h3>Note</h3> + +<p>from_utc_timestamp since 1.5.0 +</p> + + +<h3>See Also</h3> + +<p>Other datetime_funcs: <code><a href="add_months.html">add_months</a></code>, +<code><a href="date_add.html">date_add</a></code>, <code><a href="date_format.html">date_format</a></code>, +<code><a href="date_sub.html">date_sub</a></code>, <code><a href="datediff.html">datediff</a></code>, +<code><a href="dayofmonth.html">dayofmonth</a></code>, <code><a href="dayofyear.html">dayofyear</a></code>, +<code><a href="from_unixtime.html">from_unixtime</a></code>, <code><a href="hour.html">hour</a></code>, +<code><a href="last_day.html">last_day</a></code>, <code><a href="minute.html">minute</a></code>, +<code><a href="months_between.html">months_between</a></code>, <code><a href="month.html">month</a></code>, +<code><a href="next_day.html">next_day</a></code>, <code><a href="quarter.html">quarter</a></code>, +<code><a href="second.html">second</a></code>, <code><a href="to_date.html">to_date</a></code>, +<code><a href="to_utc_timestamp.html">to_utc_timestamp</a></code>, +<code><a href="unix_timestamp.html">unix_timestamp</a></code>, <code><a href="weekofyear.html">weekofyear</a></code>, +<code><a href="window.html">window</a></code>, <code><a href="year.html">year</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: from_utc_timestamp(df$t, 'PST') +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/gapply.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/gapply.html b/site/docs/2.1.3/api/R/gapply.html new file mode 100644 index 0000000..8cf199b --- /dev/null +++ b/site/docs/2.1.3/api/R/gapply.html @@ -0,0 +1,192 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: gapply</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for gapply {SparkR}"><tr><td>gapply {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>gapply</h2> + +<h3>Description</h3> + +<p>gapply +</p> +<p>Groups the SparkDataFrame using the specified columns and applies the R function to each +group. +</p> + + +<h3>Usage</h3> + +<pre> +gapply(x, ...) + +## S4 method for signature 'GroupedData' +gapply(x, func, schema) + +## S4 method for signature 'SparkDataFrame' +gapply(x, cols, func, schema) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>a SparkDataFrame or GroupedData.</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> +<p>additional argument(s) passed to the method.</p> +</td></tr> +<tr valign="top"><td><code>func</code></td> +<td> +<p>a function to be applied to each group partition specified by grouping +column of the SparkDataFrame. The function <code>func</code> takes as argument +a key - grouping columns and a data frame - a local R data.frame. +The output of <code>func</code> is a local R data.frame.</p> +</td></tr> +<tr valign="top"><td><code>schema</code></td> +<td> +<p>the schema of the resulting SparkDataFrame after the function is applied. +The schema must match to output of <code>func</code>. It has to be defined for each +output column with preferred output column name and corresponding data type.</p> +</td></tr> +<tr valign="top"><td><code>cols</code></td> +<td> +<p>grouping columns.</p> +</td></tr> +</table> + + +<h3>Value</h3> + +<p>A SparkDataFrame. +</p> + + +<h3>Note</h3> + +<p>gapply(GroupedData) since 2.0.0 +</p> +<p>gapply(SparkDataFrame) since 2.0.0 +</p> + + +<h3>See Also</h3> + +<p><a href="gapplyCollect.html">gapplyCollect</a> +</p> +<p>Other SparkDataFrame functions: <code><a href="SparkDataFrame.html">SparkDataFrame-class</a></code>, +<code><a href="summarize.html">agg</a></code>, <code><a href="arrange.html">arrange</a></code>, +<code><a href="as.data.frame.html">as.data.frame</a></code>, <code><a href="attach.html">attach</a></code>, +<code><a href="cache.html">cache</a></code>, <code><a href="coalesce.html">coalesce</a></code>, +<code><a href="collect.html">collect</a></code>, <code><a href="columns.html">colnames</a></code>, +<code><a href="coltypes.html">coltypes</a></code>, +<code><a href="createOrReplaceTempView.html">createOrReplaceTempView</a></code>, +<code><a href="crossJoin.html">crossJoin</a></code>, <code><a href="dapplyCollect.html">dapplyCollect</a></code>, +<code><a href="dapply.html">dapply</a></code>, <code><a href="summary.html">describe</a></code>, +<code><a href="dim.html">dim</a></code>, <code><a href="distinct.html">distinct</a></code>, +<code><a href="dropDuplicates.html">dropDuplicates</a></code>, <code><a href="nafunctions.html">dropna</a></code>, +<code><a href="drop.html">drop</a></code>, <code><a href="dtypes.html">dtypes</a></code>, +<code><a href="except.html">except</a></code>, <code><a href="explain.html">explain</a></code>, +<code><a href="filter.html">filter</a></code>, <code><a href="first.html">first</a></code>, +<code><a href="gapplyCollect.html">gapplyCollect</a></code>, +<code><a href="getNumPartitions.html">getNumPartitions</a></code>, <code><a href="groupBy.html">group_by</a></code>, +<code><a href="head.html">head</a></code>, <code><a href="histogram.html">histogram</a></code>, +<code><a href="insertInto.html">insertInto</a></code>, <code><a href="intersect.html">intersect</a></code>, +<code><a href="isLocal.html">isLocal</a></code>, <code><a href="join.html">join</a></code>, +<code><a href="limit.html">limit</a></code>, <code><a href="merge.html">merge</a></code>, +<code><a href="mutate.html">mutate</a></code>, <code><a href="ncol.html">ncol</a></code>, +<code><a href="nrow.html">nrow</a></code>, <code><a href="persist.html">persist</a></code>, +<code><a href="printSchema.html">printSchema</a></code>, <code><a href="randomSplit.html">randomSplit</a></code>, +<code><a href="rbind.html">rbind</a></code>, <code><a href="registerTempTable-deprecated.html">registerTempTable</a></code>, +<code><a href="rename.html">rename</a></code>, <code><a href="repartition.html">repartition</a></code>, +<code><a href="sample.html">sample</a></code>, <code><a href="saveAsTable.html">saveAsTable</a></code>, +<code><a href="schema.html">schema</a></code>, <code><a href="selectExpr.html">selectExpr</a></code>, +<code><a href="select.html">select</a></code>, <code><a href="showDF.html">showDF</a></code>, +<code><a href="show.html">show</a></code>, <code><a href="storageLevel.html">storageLevel</a></code>, +<code><a href="str.html">str</a></code>, <code><a href="subset.html">subset</a></code>, +<code><a href="take.html">take</a></code>, <code><a href="union.html">union</a></code>, +<code><a href="unpersist.html">unpersist</a></code>, <code><a href="withColumn.html">withColumn</a></code>, +<code><a href="with.html">with</a></code>, <code><a href="write.df.html">write.df</a></code>, +<code><a href="write.jdbc.html">write.jdbc</a></code>, <code><a href="write.json.html">write.json</a></code>, +<code><a href="write.orc.html">write.orc</a></code>, <code><a href="write.parquet.html">write.parquet</a></code>, +<code><a href="write.text.html">write.text</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D Computes the arithmetic mean of the second column by grouping +##D on the first and third columns. Output the grouping values and the average. +##D +##D df <- createDataFrame ( +##D list(list(1L, 1, "1", 0.1), list(1L, 2, "1", 0.2), list(3L, 3, "3", 0.3)), +##D c("a", "b", "c", "d")) +##D +##D Here our output contains three columns, the key which is a combination of two +##D columns with data types integer and string and the mean which is a double. +##D schema <- structType(structField("a", "integer"), structField("c", "string"), +##D structField("avg", "double")) +##D result <- gapply( +##D df, +##D c("a", "c"), +##D function(key, x) { +##D y <- data.frame(key, mean(x$b), stringsAsFactors = FALSE) +##D }, schema) +##D +##D We can also group the data and afterwards call gapply on GroupedData. +##D For Example: +##D gdf <- group_by(df, "a", "c") +##D result <- gapply( +##D gdf, +##D function(key, x) { +##D y <- data.frame(key, mean(x$b), stringsAsFactors = FALSE) +##D }, schema) +##D collect(result) +##D +##D Result +##D ------ +##D a c avg +##D 3 3 3.0 +##D 1 1 1.5 +##D +##D Fits linear models on iris dataset by grouping on the 'Species' column and +##D using 'Sepal_Length' as a target variable, 'Sepal_Width', 'Petal_Length' +##D and 'Petal_Width' as training features. +##D +##D df <- createDataFrame (iris) +##D schema <- structType(structField("(Intercept)", "double"), +##D structField("Sepal_Width", "double"),structField("Petal_Length", "double"), +##D structField("Petal_Width", "double")) +##D df1 <- gapply( +##D df, +##D df$"Species", +##D function(key, x) { +##D m <- suppressWarnings(lm(Sepal_Length ~ +##D Sepal_Width + Petal_Length + Petal_Width, x)) +##D data.frame(t(coef(m))) +##D }, schema) +##D collect(df1) +##D +##D Result +##D --------- +##D Model (Intercept) Sepal_Width Petal_Length Petal_Width +##D 1 0.699883 0.3303370 0.9455356 -0.1697527 +##D 2 1.895540 0.3868576 0.9083370 -0.6792238 +##D 3 2.351890 0.6548350 0.2375602 0.2521257 +##D +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/gapplyCollect.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/gapplyCollect.html b/site/docs/2.1.3/api/R/gapplyCollect.html new file mode 100644 index 0000000..72dba5d --- /dev/null +++ b/site/docs/2.1.3/api/R/gapplyCollect.html @@ -0,0 +1,180 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: gapplyCollect</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for gapplyCollect {SparkR}"><tr><td>gapplyCollect {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>gapplyCollect</h2> + +<h3>Description</h3> + +<p>gapplyCollect +</p> +<p>Groups the SparkDataFrame using the specified columns, applies the R function to each +group and collects the result back to R as data.frame. +</p> + + +<h3>Usage</h3> + +<pre> +gapplyCollect(x, ...) + +## S4 method for signature 'GroupedData' +gapplyCollect(x, func) + +## S4 method for signature 'SparkDataFrame' +gapplyCollect(x, cols, func) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>a SparkDataFrame or GroupedData.</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> +<p>additional argument(s) passed to the method.</p> +</td></tr> +<tr valign="top"><td><code>func</code></td> +<td> +<p>a function to be applied to each group partition specified by grouping +column of the SparkDataFrame. The function <code>func</code> takes as argument +a key - grouping columns and a data frame - a local R data.frame. +The output of <code>func</code> is a local R data.frame.</p> +</td></tr> +<tr valign="top"><td><code>cols</code></td> +<td> +<p>grouping columns.</p> +</td></tr> +</table> + + +<h3>Value</h3> + +<p>A data.frame. +</p> + + +<h3>Note</h3> + +<p>gapplyCollect(GroupedData) since 2.0.0 +</p> +<p>gapplyCollect(SparkDataFrame) since 2.0.0 +</p> + + +<h3>See Also</h3> + +<p><a href="gapply.html">gapply</a> +</p> +<p>Other SparkDataFrame functions: <code><a href="SparkDataFrame.html">SparkDataFrame-class</a></code>, +<code><a href="summarize.html">agg</a></code>, <code><a href="arrange.html">arrange</a></code>, +<code><a href="as.data.frame.html">as.data.frame</a></code>, <code><a href="attach.html">attach</a></code>, +<code><a href="cache.html">cache</a></code>, <code><a href="coalesce.html">coalesce</a></code>, +<code><a href="collect.html">collect</a></code>, <code><a href="columns.html">colnames</a></code>, +<code><a href="coltypes.html">coltypes</a></code>, +<code><a href="createOrReplaceTempView.html">createOrReplaceTempView</a></code>, +<code><a href="crossJoin.html">crossJoin</a></code>, <code><a href="dapplyCollect.html">dapplyCollect</a></code>, +<code><a href="dapply.html">dapply</a></code>, <code><a href="summary.html">describe</a></code>, +<code><a href="dim.html">dim</a></code>, <code><a href="distinct.html">distinct</a></code>, +<code><a href="dropDuplicates.html">dropDuplicates</a></code>, <code><a href="nafunctions.html">dropna</a></code>, +<code><a href="drop.html">drop</a></code>, <code><a href="dtypes.html">dtypes</a></code>, +<code><a href="except.html">except</a></code>, <code><a href="explain.html">explain</a></code>, +<code><a href="filter.html">filter</a></code>, <code><a href="first.html">first</a></code>, +<code><a href="gapply.html">gapply</a></code>, <code><a href="getNumPartitions.html">getNumPartitions</a></code>, +<code><a href="groupBy.html">group_by</a></code>, <code><a href="head.html">head</a></code>, +<code><a href="histogram.html">histogram</a></code>, <code><a href="insertInto.html">insertInto</a></code>, +<code><a href="intersect.html">intersect</a></code>, <code><a href="isLocal.html">isLocal</a></code>, +<code><a href="join.html">join</a></code>, <code><a href="limit.html">limit</a></code>, +<code><a href="merge.html">merge</a></code>, <code><a href="mutate.html">mutate</a></code>, +<code><a href="ncol.html">ncol</a></code>, <code><a href="nrow.html">nrow</a></code>, +<code><a href="persist.html">persist</a></code>, <code><a href="printSchema.html">printSchema</a></code>, +<code><a href="randomSplit.html">randomSplit</a></code>, <code><a href="rbind.html">rbind</a></code>, +<code><a href="registerTempTable-deprecated.html">registerTempTable</a></code>, <code><a href="rename.html">rename</a></code>, +<code><a href="repartition.html">repartition</a></code>, <code><a href="sample.html">sample</a></code>, +<code><a href="saveAsTable.html">saveAsTable</a></code>, <code><a href="schema.html">schema</a></code>, +<code><a href="selectExpr.html">selectExpr</a></code>, <code><a href="select.html">select</a></code>, +<code><a href="showDF.html">showDF</a></code>, <code><a href="show.html">show</a></code>, +<code><a href="storageLevel.html">storageLevel</a></code>, <code><a href="str.html">str</a></code>, +<code><a href="subset.html">subset</a></code>, <code><a href="take.html">take</a></code>, +<code><a href="union.html">union</a></code>, <code><a href="unpersist.html">unpersist</a></code>, +<code><a href="withColumn.html">withColumn</a></code>, <code><a href="with.html">with</a></code>, +<code><a href="write.df.html">write.df</a></code>, <code><a href="write.jdbc.html">write.jdbc</a></code>, +<code><a href="write.json.html">write.json</a></code>, <code><a href="write.orc.html">write.orc</a></code>, +<code><a href="write.parquet.html">write.parquet</a></code>, <code><a href="write.text.html">write.text</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D Computes the arithmetic mean of the second column by grouping +##D on the first and third columns. Output the grouping values and the average. +##D +##D df <- createDataFrame ( +##D list(list(1L, 1, "1", 0.1), list(1L, 2, "1", 0.2), list(3L, 3, "3", 0.3)), +##D c("a", "b", "c", "d")) +##D +##D result <- gapplyCollect( +##D df, +##D c("a", "c"), +##D function(key, x) { +##D y <- data.frame(key, mean(x$b), stringsAsFactors = FALSE) +##D colnames(y) <- c("key_a", "key_c", "mean_b") +##D y +##D }) +##D +##D We can also group the data and afterwards call gapply on GroupedData. +##D For Example: +##D gdf <- group_by(df, "a", "c") +##D result <- gapplyCollect( +##D gdf, +##D function(key, x) { +##D y <- data.frame(key, mean(x$b), stringsAsFactors = FALSE) +##D colnames(y) <- c("key_a", "key_c", "mean_b") +##D y +##D }) +##D +##D Result +##D ------ +##D key_a key_c mean_b +##D 3 3 3.0 +##D 1 1 1.5 +##D +##D Fits linear models on iris dataset by grouping on the 'Species' column and +##D using 'Sepal_Length' as a target variable, 'Sepal_Width', 'Petal_Length' +##D and 'Petal_Width' as training features. +##D +##D df <- createDataFrame (iris) +##D result <- gapplyCollect( +##D df, +##D df$"Species", +##D function(key, x) { +##D m <- suppressWarnings(lm(Sepal_Length ~ +##D Sepal_Width + Petal_Length + Petal_Width, x)) +##D data.frame(t(coef(m))) +##D }) +##D +##D Result +##D --------- +##D Model X.Intercept. Sepal_Width Petal_Length Petal_Width +##D 1 0.699883 0.3303370 0.9455356 -0.1697527 +##D 2 1.895540 0.3868576 0.9083370 -0.6792238 +##D 3 2.351890 0.6548350 0.2375602 0.2521257 +##D +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/generateAliasesForIntersectedCols.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/generateAliasesForIntersectedCols.html b/site/docs/2.1.3/api/R/generateAliasesForIntersectedCols.html new file mode 100644 index 0000000..d8e181c --- /dev/null +++ b/site/docs/2.1.3/api/R/generateAliasesForIntersectedCols.html @@ -0,0 +1,54 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Creates a list of columns by replacing the intersected ones...</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> +</head><body> + +<table width="100%" summary="page for generateAliasesForIntersectedCols {SparkR}"><tr><td>generateAliasesForIntersectedCols {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>Creates a list of columns by replacing the intersected ones with aliases</h2> + +<h3>Description</h3> + +<p>Creates a list of columns by replacing the intersected ones with aliases. +The name of the alias column is formed by concatanating the original column name and a suffix. +</p> + + +<h3>Usage</h3> + +<pre> +generateAliasesForIntersectedCols(x, intersectedColNames, suffix) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>a SparkDataFrame</p> +</td></tr> +<tr valign="top"><td><code>intersectedColNames</code></td> +<td> +<p>a list of intersected column names of the SparkDataFrame</p> +</td></tr> +<tr valign="top"><td><code>suffix</code></td> +<td> +<p>a suffix for the column name</p> +</td></tr> +</table> + + +<h3>Value</h3> + +<p>list of columns +</p> + + +<h3>Note</h3> + +<p>generateAliasesForIntersectedCols since 1.6.0 +</p> + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/getNumPartitions.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/getNumPartitions.html b/site/docs/2.1.3/api/R/getNumPartitions.html new file mode 100644 index 0000000..9359f00 --- /dev/null +++ b/site/docs/2.1.3/api/R/getNumPartitions.html @@ -0,0 +1,96 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: getNumPartitions</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for getNumPartitions {SparkR}"><tr><td>getNumPartitions {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>getNumPartitions</h2> + +<h3>Description</h3> + +<p>Return the number of partitions +</p> + + +<h3>Usage</h3> + +<pre> +## S4 method for signature 'SparkDataFrame' +getNumPartitions(x) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>A SparkDataFrame</p> +</td></tr> +</table> + + +<h3>Note</h3> + +<p>getNumPartitions since 2.1.1 +</p> + + +<h3>See Also</h3> + +<p>Other SparkDataFrame functions: <code><a href="SparkDataFrame.html">SparkDataFrame-class</a></code>, +<code><a href="summarize.html">agg</a></code>, <code><a href="arrange.html">arrange</a></code>, +<code><a href="as.data.frame.html">as.data.frame</a></code>, <code><a href="attach.html">attach</a></code>, +<code><a href="cache.html">cache</a></code>, <code><a href="coalesce.html">coalesce</a></code>, +<code><a href="collect.html">collect</a></code>, <code><a href="columns.html">colnames</a></code>, +<code><a href="coltypes.html">coltypes</a></code>, +<code><a href="createOrReplaceTempView.html">createOrReplaceTempView</a></code>, +<code><a href="crossJoin.html">crossJoin</a></code>, <code><a href="dapplyCollect.html">dapplyCollect</a></code>, +<code><a href="dapply.html">dapply</a></code>, <code><a href="summary.html">describe</a></code>, +<code><a href="dim.html">dim</a></code>, <code><a href="distinct.html">distinct</a></code>, +<code><a href="dropDuplicates.html">dropDuplicates</a></code>, <code><a href="nafunctions.html">dropna</a></code>, +<code><a href="drop.html">drop</a></code>, <code><a href="dtypes.html">dtypes</a></code>, +<code><a href="except.html">except</a></code>, <code><a href="explain.html">explain</a></code>, +<code><a href="filter.html">filter</a></code>, <code><a href="first.html">first</a></code>, +<code><a href="gapplyCollect.html">gapplyCollect</a></code>, <code><a href="gapply.html">gapply</a></code>, +<code><a href="groupBy.html">group_by</a></code>, <code><a href="head.html">head</a></code>, +<code><a href="histogram.html">histogram</a></code>, <code><a href="insertInto.html">insertInto</a></code>, +<code><a href="intersect.html">intersect</a></code>, <code><a href="isLocal.html">isLocal</a></code>, +<code><a href="join.html">join</a></code>, <code><a href="limit.html">limit</a></code>, +<code><a href="merge.html">merge</a></code>, <code><a href="mutate.html">mutate</a></code>, +<code><a href="ncol.html">ncol</a></code>, <code><a href="nrow.html">nrow</a></code>, +<code><a href="persist.html">persist</a></code>, <code><a href="printSchema.html">printSchema</a></code>, +<code><a href="randomSplit.html">randomSplit</a></code>, <code><a href="rbind.html">rbind</a></code>, +<code><a href="registerTempTable-deprecated.html">registerTempTable</a></code>, <code><a href="rename.html">rename</a></code>, +<code><a href="repartition.html">repartition</a></code>, <code><a href="sample.html">sample</a></code>, +<code><a href="saveAsTable.html">saveAsTable</a></code>, <code><a href="schema.html">schema</a></code>, +<code><a href="selectExpr.html">selectExpr</a></code>, <code><a href="select.html">select</a></code>, +<code><a href="showDF.html">showDF</a></code>, <code><a href="show.html">show</a></code>, +<code><a href="storageLevel.html">storageLevel</a></code>, <code><a href="str.html">str</a></code>, +<code><a href="subset.html">subset</a></code>, <code><a href="take.html">take</a></code>, +<code><a href="union.html">union</a></code>, <code><a href="unpersist.html">unpersist</a></code>, +<code><a href="withColumn.html">withColumn</a></code>, <code><a href="with.html">with</a></code>, +<code><a href="write.df.html">write.df</a></code>, <code><a href="write.jdbc.html">write.jdbc</a></code>, +<code><a href="write.json.html">write.json</a></code>, <code><a href="write.orc.html">write.orc</a></code>, +<code><a href="write.parquet.html">write.parquet</a></code>, <code><a href="write.text.html">write.text</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D sparkR.session() +##D df <- createDataFrame(cars, numPartitions = 2) +##D getNumPartitions(df) +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/glm.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/glm.html b/site/docs/2.1.3/api/R/glm.html new file mode 100644 index 0000000..ec4ef05 --- /dev/null +++ b/site/docs/2.1.3/api/R/glm.html @@ -0,0 +1,187 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Generalized Linear Models (R-compliant)</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for glm {SparkR}"><tr><td>glm {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>Generalized Linear Models (R-compliant)</h2> + +<h3>Description</h3> + +<p>Fits a generalized linear model, similarly to R's glm(). +</p> + + +<h3>Usage</h3> + +<pre> +glm(formula, family = gaussian, data, weights, subset, na.action, + start = NULL, etastart, mustart, offset, control = list(...), + model = TRUE, method = "glm.fit", x = FALSE, y = TRUE, + contrasts = NULL, ...) + +## S4 method for signature 'formula,ANY,SparkDataFrame' +glm(formula, family = gaussian, data, + epsilon = 1e-06, maxit = 25, weightCol = NULL) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>formula</code></td> +<td> +<p>a symbolic description of the model to be fitted. Currently only a few formula +operators are supported, including '~', '.', ':', '+', and '-'.</p> +</td></tr> +<tr valign="top"><td><code>family</code></td> +<td> +<p>a description of the error distribution and link function to be used in the model. +This can be a character string naming a family function, a family function or +the result of a call to a family function. Refer R family at +<a href="https://stat.ethz.ch/R-manual/R-devel/library/stats/html/family.html">https://stat.ethz.ch/R-manual/R-devel/library/stats/html/family.html</a>. +Currently these families are supported: <code>binomial</code>, <code>gaussian</code>, +<code>Gamma</code>, and <code>poisson</code>.</p> +</td></tr> +<tr valign="top"><td><code>data</code></td> +<td> +<p>a SparkDataFrame or R's glm data for training.</p> +</td></tr> +<tr valign="top"><td><code>weights</code></td> +<td> +<p>an optional vector of ‘prior weights’ to be used +in the fitting process. Should be <code>NULL</code> or a numeric vector.</p> +</td></tr> +<tr valign="top"><td><code>subset</code></td> +<td> +<p>an optional vector specifying a subset of observations +to be used in the fitting process.</p> +</td></tr> +<tr valign="top"><td><code>na.action</code></td> +<td> +<p>a function which indicates what should happen +when the data contain <code>NA</code>s. The default is set by +the <code>na.action</code> setting of <code><a href="../../base/html/options.html">options</a></code>, and is +<code><a href="../../stats/html/na.fail.html">na.fail</a></code> if that is unset. The ‘factory-fresh’ +default is <code><a href="nafunctions.html">na.omit</a></code>. Another possible value is +<code>NULL</code>, no action. Value <code><a href="../../stats/html/na.fail.html">na.exclude</a></code> can be useful.</p> +</td></tr> +<tr valign="top"><td><code>start</code></td> +<td> +<p>starting values for the parameters in the linear predictor.</p> +</td></tr> +<tr valign="top"><td><code>etastart</code></td> +<td> +<p>starting values for the linear predictor.</p> +</td></tr> +<tr valign="top"><td><code>mustart</code></td> +<td> +<p>starting values for the vector of means.</p> +</td></tr> +<tr valign="top"><td><code>offset</code></td> +<td> +<p>this can be used to specify an <em>a priori</em> known +component to be included in the linear predictor during fitting. +This should be <code>NULL</code> or a numeric vector of length equal to +the number of cases. One or more <code><a href="../../stats/html/offset.html">offset</a></code> terms can be +included in the formula instead or as well, and if more than one is +specified their sum is used. See <code><a href="../../stats/html/model.extract.html">model.offset</a></code>.</p> +</td></tr> +<tr valign="top"><td><code>control</code></td> +<td> +<p>a list of parameters for controlling the fitting +process. For <code>glm.fit</code> this is passed to +<code><a href="../../stats/html/glm.control.html">glm.control</a></code>.</p> +</td></tr> +<tr valign="top"><td><code>model</code></td> +<td> +<p>a logical value indicating whether <em>model frame</em> +should be included as a component of the returned value.</p> +</td></tr> +<tr valign="top"><td><code>method</code></td> +<td> +<p>the method to be used in fitting the model. The default +method <code>"glm.fit"</code> uses iteratively reweighted least squares +(IWLS): the alternative <code>"model.frame"</code> returns the model frame +and does no fitting. +</p> +<p>User-supplied fitting functions can be supplied either as a function +or a character string naming a function, with a function which takes +the same arguments as <code>glm.fit</code>. If specified as a character +string it is looked up from within the <span class="pkg">stats</span> namespace. +</p> +</td></tr> +<tr valign="top"><td><code>x, y</code></td> +<td> +<p>For <code>glm</code>: logical values indicating whether the response vector +and model matrix used in the fitting process should be returned as +components of the returned value.</p> +</td></tr> +<tr valign="top"><td><code>contrasts</code></td> +<td> +<p>an optional list. See the <code>contrasts.arg</code> +of <code>model.matrix.default</code>.</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> + +<p>For <code>glm</code>: arguments to be used to form the default +<code>control</code> argument if it is not supplied directly. +</p> +<p>For <code>weights</code>: further arguments passed to or from other methods. +</p> +</td></tr> +<tr valign="top"><td><code>epsilon</code></td> +<td> +<p>positive convergence tolerance of iterations.</p> +</td></tr> +<tr valign="top"><td><code>maxit</code></td> +<td> +<p>integer giving the maximal number of IRLS iterations.</p> +</td></tr> +<tr valign="top"><td><code>weightCol</code></td> +<td> +<p>the weight column name. If this is not set or <code>NULL</code>, we treat all instance +weights as 1.0.</p> +</td></tr> +</table> + + +<h3>Value</h3> + +<p><code>glm</code> returns a fitted generalized linear model. +</p> + + +<h3>Note</h3> + +<p>glm since 1.5.0 +</p> + + +<h3>See Also</h3> + +<p><a href="spark.glm.html">spark.glm</a> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D sparkR.session() +##D data(iris) +##D df <- createDataFrame(iris) +##D model <- glm(Sepal_Length ~ Sepal_Width, df, family = "gaussian") +##D summary(model) +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/greatest.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/greatest.html b/site/docs/2.1.3/api/R/greatest.html new file mode 100644 index 0000000..2961a86 --- /dev/null +++ b/site/docs/2.1.3/api/R/greatest.html @@ -0,0 +1,72 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: greatest</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for greatest {SparkR}"><tr><td>greatest {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>greatest</h2> + +<h3>Description</h3> + +<p>Returns the greatest value of the list of column names, skipping null values. +This function takes at least 2 parameters. It will return null if all parameters are null. +</p> + + +<h3>Usage</h3> + +<pre> +greatest(x, ...) + +## S4 method for signature 'Column' +greatest(x, ...) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>Column to compute on</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> +<p>other columns</p> +</td></tr> +</table> + + +<h3>Note</h3> + +<p>greatest since 1.5.0 +</p> + + +<h3>See Also</h3> + +<p>Other normal_funcs: <code><a href="abs.html">abs</a></code>, +<code><a href="bitwiseNOT.html">bitwiseNOT</a></code>, <code><a href="coalesce.html">coalesce</a></code>, +<code><a href="column.html">column</a></code>, <code><a href="expr.html">expr</a></code>, +<code><a href="ifelse.html">ifelse</a></code>, <code><a href="is.nan.html">isnan</a></code>, +<code><a href="least.html">least</a></code>, <code><a href="lit.html">lit</a></code>, +<code><a href="nanvl.html">nanvl</a></code>, <code><a href="negate.html">negate</a></code>, +<code><a href="randn.html">randn</a></code>, <code><a href="rand.html">rand</a></code>, +<code><a href="struct.html">struct</a></code>, <code><a href="when.html">when</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: greatest(df$c, df$d) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/groupBy.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/groupBy.html b/site/docs/2.1.3/api/R/groupBy.html new file mode 100644 index 0000000..2d6f815 --- /dev/null +++ b/site/docs/2.1.3/api/R/groupBy.html @@ -0,0 +1,117 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: GroupBy</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for group_by {SparkR}"><tr><td>group_by {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>GroupBy</h2> + +<h3>Description</h3> + +<p>Groups the SparkDataFrame using the specified columns, so we can run aggregation on them. +</p> + + +<h3>Usage</h3> + +<pre> +group_by(x, ...) + +groupBy(x, ...) + +## S4 method for signature 'SparkDataFrame' +groupBy(x, ...) + +## S4 method for signature 'SparkDataFrame' +group_by(x, ...) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>a SparkDataFrame.</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> +<p>variable(s) (character names(s) or Column(s)) to group on.</p> +</td></tr> +</table> + + +<h3>Value</h3> + +<p>A GroupedData. +</p> + + +<h3>Note</h3> + +<p>groupBy since 1.4.0 +</p> +<p>group_by since 1.4.0 +</p> + + +<h3>See Also</h3> + +<p>Other SparkDataFrame functions: <code><a href="SparkDataFrame.html">SparkDataFrame-class</a></code>, +<code><a href="summarize.html">agg</a></code>, <code><a href="arrange.html">arrange</a></code>, +<code><a href="as.data.frame.html">as.data.frame</a></code>, <code><a href="attach.html">attach</a></code>, +<code><a href="cache.html">cache</a></code>, <code><a href="coalesce.html">coalesce</a></code>, +<code><a href="collect.html">collect</a></code>, <code><a href="columns.html">colnames</a></code>, +<code><a href="coltypes.html">coltypes</a></code>, +<code><a href="createOrReplaceTempView.html">createOrReplaceTempView</a></code>, +<code><a href="crossJoin.html">crossJoin</a></code>, <code><a href="dapplyCollect.html">dapplyCollect</a></code>, +<code><a href="dapply.html">dapply</a></code>, <code><a href="summary.html">describe</a></code>, +<code><a href="dim.html">dim</a></code>, <code><a href="distinct.html">distinct</a></code>, +<code><a href="dropDuplicates.html">dropDuplicates</a></code>, <code><a href="nafunctions.html">dropna</a></code>, +<code><a href="drop.html">drop</a></code>, <code><a href="dtypes.html">dtypes</a></code>, +<code><a href="except.html">except</a></code>, <code><a href="explain.html">explain</a></code>, +<code><a href="filter.html">filter</a></code>, <code><a href="first.html">first</a></code>, +<code><a href="gapplyCollect.html">gapplyCollect</a></code>, <code><a href="gapply.html">gapply</a></code>, +<code><a href="getNumPartitions.html">getNumPartitions</a></code>, <code><a href="head.html">head</a></code>, +<code><a href="histogram.html">histogram</a></code>, <code><a href="insertInto.html">insertInto</a></code>, +<code><a href="intersect.html">intersect</a></code>, <code><a href="isLocal.html">isLocal</a></code>, +<code><a href="join.html">join</a></code>, <code><a href="limit.html">limit</a></code>, +<code><a href="merge.html">merge</a></code>, <code><a href="mutate.html">mutate</a></code>, +<code><a href="ncol.html">ncol</a></code>, <code><a href="nrow.html">nrow</a></code>, +<code><a href="persist.html">persist</a></code>, <code><a href="printSchema.html">printSchema</a></code>, +<code><a href="randomSplit.html">randomSplit</a></code>, <code><a href="rbind.html">rbind</a></code>, +<code><a href="registerTempTable-deprecated.html">registerTempTable</a></code>, <code><a href="rename.html">rename</a></code>, +<code><a href="repartition.html">repartition</a></code>, <code><a href="sample.html">sample</a></code>, +<code><a href="saveAsTable.html">saveAsTable</a></code>, <code><a href="schema.html">schema</a></code>, +<code><a href="selectExpr.html">selectExpr</a></code>, <code><a href="select.html">select</a></code>, +<code><a href="showDF.html">showDF</a></code>, <code><a href="show.html">show</a></code>, +<code><a href="storageLevel.html">storageLevel</a></code>, <code><a href="str.html">str</a></code>, +<code><a href="subset.html">subset</a></code>, <code><a href="take.html">take</a></code>, +<code><a href="union.html">union</a></code>, <code><a href="unpersist.html">unpersist</a></code>, +<code><a href="withColumn.html">withColumn</a></code>, <code><a href="with.html">with</a></code>, +<code><a href="write.df.html">write.df</a></code>, <code><a href="write.jdbc.html">write.jdbc</a></code>, +<code><a href="write.json.html">write.json</a></code>, <code><a href="write.orc.html">write.orc</a></code>, +<code><a href="write.parquet.html">write.parquet</a></code>, <code><a href="write.text.html">write.text</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D # Compute the average for all numeric columns grouped by department. +##D avg(groupBy(df, "department")) +##D +##D # Compute the max age and average salary, grouped by department and gender. +##D agg(groupBy(df, "department", "gender"), salary="avg", "age" -> "max") +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/hash.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/hash.html b/site/docs/2.1.3/api/R/hash.html new file mode 100644 index 0000000..279ff7e --- /dev/null +++ b/site/docs/2.1.3/api/R/hash.html @@ -0,0 +1,66 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: hash</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for hash {SparkR}"><tr><td>hash {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>hash</h2> + +<h3>Description</h3> + +<p>Calculates the hash code of given columns, and returns the result as a int column. +</p> + + +<h3>Usage</h3> + +<pre> +hash(x, ...) + +## S4 method for signature 'Column' +hash(x, ...) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>Column to compute on.</p> +</td></tr> +<tr valign="top"><td><code>...</code></td> +<td> +<p>additional Column(s) to be included.</p> +</td></tr> +</table> + + +<h3>Note</h3> + +<p>hash since 2.0.0 +</p> + + +<h3>See Also</h3> + +<p>Other misc_funcs: <code><a href="crc32.html">crc32</a></code>, <code><a href="md5.html">md5</a></code>, +<code><a href="monotonically_increasing_id.html">monotonically_increasing_id</a></code>, +<code><a href="sha1.html">sha1</a></code>, <code><a href="sha2.html">sha2</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: hash(df$c) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/hashCode.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/hashCode.html b/site/docs/2.1.3/api/R/hashCode.html new file mode 100644 index 0000000..9333003 --- /dev/null +++ b/site/docs/2.1.3/api/R/hashCode.html @@ -0,0 +1,68 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Compute the hashCode of an object</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for hashCode {SparkR}"><tr><td>hashCode {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>Compute the hashCode of an object</h2> + +<h3>Description</h3> + +<p>Java-style function to compute the hashCode for the given object. Returns +an integer value. +</p> + + +<h3>Usage</h3> + +<pre> +hashCode(key) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>key</code></td> +<td> +<p>the object to be hashed</p> +</td></tr> +</table> + + +<h3>Details</h3> + +<p>This only works for integer, numeric and character types right now. +</p> + + +<h3>Value</h3> + +<p>the hash code as an integer +</p> + + +<h3>Note</h3> + +<p>hashCode since 1.4.0 +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D hashCode(1L) # 1 +##D hashCode(1.0) # 1072693248 +##D hashCode("1") # 49 +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> http://git-wip-us.apache.org/repos/asf/spark-website/blob/3a634959/site/docs/2.1.3/api/R/head.html ---------------------------------------------------------------------- diff --git a/site/docs/2.1.3/api/R/head.html b/site/docs/2.1.3/api/R/head.html new file mode 100644 index 0000000..541ec50 --- /dev/null +++ b/site/docs/2.1.3/api/R/head.html @@ -0,0 +1,108 @@ +<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"><html xmlns="http://www.w3.org/1999/xhtml"><head><title>R: Head</title> +<meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> +<link rel="stylesheet" type="text/css" href="R.css" /> + +<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/styles/github.min.css"> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/highlight.min.js"></script> +<script src="https://cdnjs.cloudflare.com/ajax/libs/highlight.js/8.3/languages/r.min.js"></script> +<script>hljs.initHighlightingOnLoad();</script> +</head><body> + +<table width="100%" summary="page for head {SparkR}"><tr><td>head {SparkR}</td><td style="text-align: right;">R Documentation</td></tr></table> + +<h2>Head</h2> + +<h3>Description</h3> + +<p>Return the first <code>num</code> rows of a SparkDataFrame as a R data.frame. If <code>num</code> is not +specified, then head() returns the first 6 rows as with R data.frame. +</p> + + +<h3>Usage</h3> + +<pre> +## S4 method for signature 'SparkDataFrame' +head(x, num = 6L) +</pre> + + +<h3>Arguments</h3> + +<table summary="R argblock"> +<tr valign="top"><td><code>x</code></td> +<td> +<p>a SparkDataFrame.</p> +</td></tr> +<tr valign="top"><td><code>num</code></td> +<td> +<p>the number of rows to return. Default is 6.</p> +</td></tr> +</table> + + +<h3>Value</h3> + +<p>A data.frame. +</p> + + +<h3>Note</h3> + +<p>head since 1.4.0 +</p> + + +<h3>See Also</h3> + +<p>Other SparkDataFrame functions: <code><a href="SparkDataFrame.html">SparkDataFrame-class</a></code>, +<code><a href="summarize.html">agg</a></code>, <code><a href="arrange.html">arrange</a></code>, +<code><a href="as.data.frame.html">as.data.frame</a></code>, <code><a href="attach.html">attach</a></code>, +<code><a href="cache.html">cache</a></code>, <code><a href="coalesce.html">coalesce</a></code>, +<code><a href="collect.html">collect</a></code>, <code><a href="columns.html">colnames</a></code>, +<code><a href="coltypes.html">coltypes</a></code>, +<code><a href="createOrReplaceTempView.html">createOrReplaceTempView</a></code>, +<code><a href="crossJoin.html">crossJoin</a></code>, <code><a href="dapplyCollect.html">dapplyCollect</a></code>, +<code><a href="dapply.html">dapply</a></code>, <code><a href="summary.html">describe</a></code>, +<code><a href="dim.html">dim</a></code>, <code><a href="distinct.html">distinct</a></code>, +<code><a href="dropDuplicates.html">dropDuplicates</a></code>, <code><a href="nafunctions.html">dropna</a></code>, +<code><a href="drop.html">drop</a></code>, <code><a href="dtypes.html">dtypes</a></code>, +<code><a href="except.html">except</a></code>, <code><a href="explain.html">explain</a></code>, +<code><a href="filter.html">filter</a></code>, <code><a href="first.html">first</a></code>, +<code><a href="gapplyCollect.html">gapplyCollect</a></code>, <code><a href="gapply.html">gapply</a></code>, +<code><a href="getNumPartitions.html">getNumPartitions</a></code>, <code><a href="groupBy.html">group_by</a></code>, +<code><a href="histogram.html">histogram</a></code>, <code><a href="insertInto.html">insertInto</a></code>, +<code><a href="intersect.html">intersect</a></code>, <code><a href="isLocal.html">isLocal</a></code>, +<code><a href="join.html">join</a></code>, <code><a href="limit.html">limit</a></code>, +<code><a href="merge.html">merge</a></code>, <code><a href="mutate.html">mutate</a></code>, +<code><a href="ncol.html">ncol</a></code>, <code><a href="nrow.html">nrow</a></code>, +<code><a href="persist.html">persist</a></code>, <code><a href="printSchema.html">printSchema</a></code>, +<code><a href="randomSplit.html">randomSplit</a></code>, <code><a href="rbind.html">rbind</a></code>, +<code><a href="registerTempTable-deprecated.html">registerTempTable</a></code>, <code><a href="rename.html">rename</a></code>, +<code><a href="repartition.html">repartition</a></code>, <code><a href="sample.html">sample</a></code>, +<code><a href="saveAsTable.html">saveAsTable</a></code>, <code><a href="schema.html">schema</a></code>, +<code><a href="selectExpr.html">selectExpr</a></code>, <code><a href="select.html">select</a></code>, +<code><a href="showDF.html">showDF</a></code>, <code><a href="show.html">show</a></code>, +<code><a href="storageLevel.html">storageLevel</a></code>, <code><a href="str.html">str</a></code>, +<code><a href="subset.html">subset</a></code>, <code><a href="take.html">take</a></code>, +<code><a href="union.html">union</a></code>, <code><a href="unpersist.html">unpersist</a></code>, +<code><a href="withColumn.html">withColumn</a></code>, <code><a href="with.html">with</a></code>, +<code><a href="write.df.html">write.df</a></code>, <code><a href="write.jdbc.html">write.jdbc</a></code>, +<code><a href="write.json.html">write.json</a></code>, <code><a href="write.orc.html">write.orc</a></code>, +<code><a href="write.parquet.html">write.parquet</a></code>, <code><a href="write.text.html">write.text</a></code> +</p> + + +<h3>Examples</h3> + +<pre><code class="r">## Not run: +##D sparkR.session() +##D path <- "path/to/file.json" +##D df <- read.json(path) +##D head(df) +## End(Not run) +</code></pre> + + +<hr /><div style="text-align: center;">[Package <em>SparkR</em> version 2.1.3 <a href="00Index.html">Index</a>]</div> +</body></html> --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org