spark git commit: [SPARK-17499][SPARKR][ML][MLLIB] make the default params in sparkR spark.mlp consistent with MultilayerPerceptronClassifier

2016-09-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 90d575421 -> f89808b0f [SPARK-17499][SPARKR][ML][MLLIB] make the default params in sparkR spark.mlp consistent with MultilayerPerceptronClassifier ## What changes were proposed in this pull request? update `MultilayerPerceptronClassifierW

spark git commit: [SPARK-17210][SPARKR] sparkr.zip is not distributed to executors when running sparkr in RStudio

2016-09-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 d3f90e71a -> 1a8ea000e [SPARK-17210][SPARKR] sparkr.zip is not distributed to executors when running sparkr in RStudio ## What changes were proposed in this pull request? Spark will add sparkr.zip to archive only when it is yarn mode

spark git commit: [SPARK-17210][SPARKR] sparkr.zip is not distributed to executors when running sparkr in RStudio

2016-09-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master f89808b0f -> f62ddc598 [SPARK-17210][SPARKR] sparkr.zip is not distributed to executors when running sparkr in RStudio ## What changes were proposed in this pull request? Spark will add sparkr.zip to archive only when it is yarn mode (Sp

spark git commit: [SPARK-17499][SPARKR][FOLLOWUP] Check null first for layers in spark.mlp to avoid warnings in test results

2016-09-27 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master b03b4adf6 -> 4a8339568 [SPARK-17499][SPARKR][FOLLOWUP] Check null first for layers in spark.mlp to avoid warnings in test results ## What changes were proposed in this pull request? Some tests in `test_mllib.r` are as below: ```r expect_

spark git commit: [SPARK-17658][SPARKR] read.df/write.df API taking path optionally in SparkR

2016-10-04 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master a99743d05 -> c9fe10d4e [SPARK-17658][SPARKR] read.df/write.df API taking path optionally in SparkR ## What changes were proposed in this pull request? `write.df`/`read.df` API require path which is not actually always necessary in Spark.

spark git commit: [SPARK-17665][SPARKR] Support options/mode all for read/write APIs and options in other types

2016-10-07 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master bb1aaf28e -> 9d8ae853e [SPARK-17665][SPARKR] Support options/mode all for read/write APIs and options in other types ## What changes were proposed in this pull request? This PR includes the changes below: - Support `mode`/`options` in

spark git commit: [SPARK-17817][PYSPARK] PySpark RDD Repartitioning Results in Highly Skewed Partition Sizes

2016-10-11 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 75b9e3514 -> 07508bd01 [SPARK-17817][PYSPARK] PySpark RDD Repartitioning Results in Highly Skewed Partition Sizes ## What changes were proposed in this pull request? Quoted from JIRA description: Calling repartition on a PySpark RDD to i

spark git commit: [SPARK-17790][SPARKR] Support for parallelizing R data.frame larger than 2GB

2016-10-12 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master d5580ebaa -> 5cc503f4f [SPARK-17790][SPARKR] Support for parallelizing R data.frame larger than 2GB ## What changes were proposed in this pull request? If the R data structure that is being parallelized is larger than `INT_MAX` we use file

spark git commit: [SPARK-17790][SPARKR] Support for parallelizing R data.frame larger than 2GB

2016-10-12 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 5451541d1 -> d55ba3063 [SPARK-17790][SPARKR] Support for parallelizing R data.frame larger than 2GB ## What changes were proposed in this pull request? If the R data structure that is being parallelized is larger than `INT_MAX` we use

spark git commit: [SPARKR] fix warnings

2016-10-20 Thread felixcheung
ame Consider adding importFrom("utils", "object.size") to your NAMESPACE file. ``` ## How was this patch tested? unit tests Author: Felix Cheung Closes #15560 from felixcheung/rwarnings. (cherry picked from commit 3180272d2d49e440516085c0e4aebd5bad18bcad) Signed-off-by: Felix

spark git commit: [SPARKR] fix warnings

2016-10-20 Thread felixcheung
der adding importFrom("utils", "object.size") to your NAMESPACE file. ``` ## How was this patch tested? unit tests Author: Felix Cheung Closes #15560 from felixcheung/rwarnings. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org

spark git commit: [SPARK-17674][SPARKR] check for warning in test output

2016-10-21 Thread felixcheung
arn" = 2)`), we need to have a way to detect any new warning and fail ## How was this patch tested? manual testing, Jenkins Author: Felix Cheung Closes #15576 from felixcheung/rtestwarning. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/re

spark git commit: [SPARK-18013][SPARKR] add crossJoin API

2016-10-21 Thread felixcheung
hor: Felix Cheung Closes #15559 from felixcheung/rcrossjoin. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/e21e1c94 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/e21e1c94 Diff: http://git-wip-us.apache.org/repos/

spark git commit: [SPARK-17811] SparkR cannot parallelize data.frame with NA or NULL in Date columns

2016-10-21 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master e21e1c946 -> e371040a0 [SPARK-17811] SparkR cannot parallelize data.frame with NA or NULL in Date columns ## What changes were proposed in this pull request? NA date values are serialized as "NA" and NA time values are serialized as NaN f

spark git commit: [SPARK-17811] SparkR cannot parallelize data.frame with NA or NULL in Date columns

2016-10-21 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 a65d40ab6 -> 78458a7eb [SPARK-17811] SparkR cannot parallelize data.frame with NA or NULL in Date columns ## What changes were proposed in this pull request? NA date values are serialized as "NA" and NA time values are serialized as Na

spark git commit: [SPARKR][BRANCH-2.0] R merge API doc and example fix

2016-10-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 e21e9d416 -> 0e0d83a59 [SPARKR][BRANCH-2.0] R merge API doc and example fix ## What changes were proposed in this pull request? Fixes for R doc ## How was this patch tested? N/A Author: Felix Cheung Closes #15589 from felixche

spark git commit: [SPARKR][BRANCH-2.0] R merge API doc and example fix

2016-10-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master a81fba048 -> 3a423f5a0 [SPARKR][BRANCH-2.0] R merge API doc and example fix ## What changes were proposed in this pull request? Fixes for R doc ## How was this patch tested? N/A Author: Felix Cheung Closes #15589 from felixche

spark git commit: [SPARK-18007][SPARKR][ML] update SparkR MLP - add initalWeights parameter

2016-10-25 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master c329a568b -> 12b3e8d2e [SPARK-18007][SPARKR][ML] update SparkR MLP - add initalWeights parameter ## What changes were proposed in this pull request? update SparkR MLP, add initalWeights parameter. ## How was this patch tested? test added

spark git commit: [SPARK-17961][SPARKR][SQL] Add storageLevel to DataFrame for SparkR

2016-10-26 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master ea3605e82 -> fb0a8a8dd [SPARK-17961][SPARKR][SQL] Add storageLevel to DataFrame for SparkR ## What changes were proposed in this pull request? Add storageLevel to DataFrame for SparkR. This is similar to this RP: https://github.com/apache

spark git commit: [SPARK-17157][SPARKR] Add multiclass logistic regression SparkR Wrapper

2016-10-26 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 5b7d403c1 -> 29cea8f33 [SPARK-17157][SPARKR] Add multiclass logistic regression SparkR Wrapper ## What changes were proposed in this pull request? As we discussed in #14818, I added a separate R wrapper spark.logit for logistic regression

spark git commit: [SPARK-17157][SPARKR][FOLLOW-UP] doc fixes

2016-10-26 Thread felixcheung
ung Closes #15650 from felixcheung/logitfix. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1dbe9896 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/1dbe9896 Diff: http://git-wip-us.apache.org/repos/asf/spark/d

spark git commit: [SQL][DOC] updating doc for JSON source to link to jsonlines.org

2016-10-26 Thread felixcheung
ted? manual test Author: Felix Cheung Closes #15629 from felixcheung/jsondoc. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/44c8bfda Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/44c8bfda Diff: http://git-

spark git commit: [SPARK-17919] Make timeout to RBackend configurable in SparkR

2016-10-30 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 8ae2da0b2 -> 2881a2d1d [SPARK-17919] Make timeout to RBackend configurable in SparkR ## What changes were proposed in this pull request? This patch makes RBackend connection timeout configurable by user. ## How was this patch tested? N/A

spark git commit: [SPARK-16137][SPARKR] randomForest for R

2016-10-30 Thread felixcheung
ual tests, unit tests Author: Felix Cheung Closes #15607 from felixcheung/rrandomforest. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/b6879b8b Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/b6879b8b Diff: http://

spark git commit: [SPARK-18110][PYTHON][ML] add missing parameter in Python for RandomForest regression and classification

2016-10-30 Thread felixcheung
Add varianceCol to randomForestRegressor In Python ## How was this patch tested? manual tests Author: Felix Cheung Closes #15638 from felixcheung/pyrandomforest. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/7c378692 Tree: h

spark git commit: [SPARK-17838][SPARKR] Check named arguments for options and use formatted R friendly message from JVM exception message

2016-11-01 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master ad4832a9f -> 1ecfafa08 [SPARK-17838][SPARKR] Check named arguments for options and use formatted R friendly message from JVM exception message ## What changes were proposed in this pull request? This PR proposes to - improve the R-friendl

spark git commit: [SPARKR][TEST] remove unnecessary suppressWarnings

2016-11-03 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 67659c9af -> e89202523 [SPARKR][TEST] remove unnecessary suppressWarnings ## What changes were proposed in this pull request? In test_mllib.R, there are two unnecessary suppressWarnings. This PR just removes them. ## How was this patch t

spark git commit: [SPARK-14393][SQL][DOC] update doc for python and R

2016-11-03 Thread felixcheung
: Felix Cheung Closes #15747 from felixcheung/pySPARK-14393. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/a08463b1 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/a08463b1 Diff: http://git-wip-us.apache.org/repos/as

spark git commit: [SPARK-14393][SQL][DOC] update doc for python and R

2016-11-03 Thread felixcheung
Author: Felix Cheung Closes #15747 from felixcheung/pySPARK-14393. (cherry picked from commit a08463b1d32348a81d0f148dfaf22741d5c23b1a) Signed-off-by: Felix Cheung Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/8e145a94 Tre

spark git commit: [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently with expressions

2016-11-05 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 dcbc42658 -> e9f1d4aaa [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently with expressions ## What changes were proposed in this pull request? This PR proposes to improve documentation and fix some descriptio

spark git commit: [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently with expressions

2016-11-05 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 9a87c3138 -> 15d392688 [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently with expressions ## What changes were proposed in this pull request? This PR proposes to improve documentation and fix some descriptions e

spark git commit: [SPARK-18239][SPARKR] Gradient Boosted Tree for R

2016-11-08 Thread felixcheung
ely isolated I'd like to target this for branch-2.1 ## How was this patch tested? manual tests, unit tests Author: Felix Cheung Closes #15746 from felixcheung/rgbt. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/55964c15 Tr

spark git commit: [SPARK-18239][SPARKR] Gradient Boosted Tree for R

2016-11-08 Thread felixcheung
ely isolated I'd like to target this for branch-2.1 ## How was this patch tested? manual tests, unit tests Author: Felix Cheung Closes #15746 from felixcheung/rgbt. (cherry picked from commit 55964c15a7b639f920dfe6c104ae4fdcd673705c) Signed-off-by: Felix Cheung Project: http:/

spark git commit: [SPARK-18349][SPARKR] Update R API documentation on ml model summary

2016-12-08 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 4ac8b20bf -> 86a96034c [SPARK-18349][SPARKR] Update R API documentation on ml model summary ## What changes were proposed in this pull request? In this PR, the document of `summary` method is improved in the format: returns summary informa

spark git commit: [SPARK-18349][SPARKR] Update R API documentation on ml model summary

2016-12-08 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 ef5646b4c -> 4ceed95b4 [SPARK-18349][SPARKR] Update R API documentation on ml model summary ## What changes were proposed in this pull request? In this PR, the document of `summary` method is improved in the format: returns summary inf

spark git commit: [SPARK-18865][SPARKR] SparkR vignettes MLP and LDA updates

2016-12-14 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master ffdd1fcd1 -> 324388531 [SPARK-18865][SPARKR] SparkR vignettes MLP and LDA updates ## What changes were proposed in this pull request? When do the QA work, I found that the following issues: 1). `spark.mlp` doesn't include an example; 2).

spark git commit: [SPARK-18865][SPARKR] SparkR vignettes MLP and LDA updates

2016-12-14 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 280c35af9 -> 0d94201e0 [SPARK-18865][SPARKR] SparkR vignettes MLP and LDA updates ## What changes were proposed in this pull request? When do the QA work, I found that the following issues: 1). `spark.mlp` doesn't include an example;

spark git commit: [SPARK-18849][ML][SPARKR][DOC] vignettes final check reorg

2016-12-17 Thread felixcheung
.io/sparkr-vignettes.html Previous: https://felixcheung.github.io/sparkr-vignettes_old.html Author: Felix Cheung Closes #16301 from felixcheung/rvignettespass2. (cherry picked from commit 38fd163d0d2c44128bf8872d297b79edd7bd4137) Signed-off-by: Felix Cheung Project: http://git-wip-us.apache.

spark git commit: [SPARK-18849][ML][SPARKR][DOC] vignettes final check reorg

2016-12-17 Thread felixcheung
rkr-vignettes.html Previous: https://felixcheung.github.io/sparkr-vignettes_old.html Author: Felix Cheung Closes #16301 from felixcheung/rvignettespass2. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/38fd163d Tree: http://

spark git commit: [SPARK-18903][SPARKR] Add API to get SparkUI URL

2016-12-21 Thread felixcheung
ung Closes #16367 from felixcheung/rwebui. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/7e8994ff Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/7e8994ff Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/7e899

spark git commit: [BUILD] make-distribution should find JAVA_HOME for non-RHEL systems

2016-12-21 Thread felixcheung
How was this patch tested? Manually Author: Felix Cheung Closes #16363 from felixcheung/buildjava. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/e1b43dc4 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/e1b43dc4 D

spark git commit: [DOC][BUILD][MINOR] add doc on new make-distribution switches

2016-12-27 Thread felixcheung
his patch tested? Doc only Author: Felix Cheung Closes #16364 from felixcheung/buildguide. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/2af8b5cf Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/2af8b5cf Diff: h

spark git commit: [SPARK-19126][DOCS] Update Join Documentation Across Languages

2017-01-08 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 8690d4bd1 -> 8779e6a46 [SPARK-19126][DOCS] Update Join Documentation Across Languages ## What changes were proposed in this pull request? - [X] Make sure all join types are clearly mentioned - [X] Make join labeling/style consistent -

spark git commit: [SPARK-19126][DOCS] Update Join Documentation Across Languages

2017-01-08 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 1f6ded645 -> 19d9d4c85 [SPARK-19126][DOCS] Update Join Documentation Across Languages ## What changes were proposed in this pull request? - [X] Make sure all join types are clearly mentioned - [X] Make join labeling/style consistent - [X]

spark git commit: [SPARK-18903][SPARKR][BACKPORT-2.1] Add API to get SparkUI URL

2017-01-08 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 8779e6a46 -> 80a3e13e5 [SPARK-18903][SPARKR][BACKPORT-2.1] Add API to get SparkUI URL ## What changes were proposed in this pull request? backport to 2.1 Author: Felix Cheung Closes #16507 from felixcheung/portsparkuir21. Proj

spark git commit: [SPARK-19133][SPARKR][ML] fix glm for Gamma, clarify glm family supported

2017-01-10 Thread felixcheung
ted? manual Author: Felix Cheung Closes #16511 from felixcheung/rdocglmfamily. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/9bc3507e Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/9bc3507e Diff: http://git-

spark git commit: [SPARK-19133][SPARKR][ML][BACKPORT-2.1] fix glm for Gamma, clarify glm family supported

2017-01-10 Thread felixcheung
nit tests Author: Felix Cheung Closes #16532 from felixcheung/rgammabackport. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1022049c Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/1022049c Diff: http://git-

spark git commit: [SPARK-20196][PYTHON][SQL] update doc for catalog functions for all languages, add pyspark refreshByPath API

2017-04-06 Thread felixcheung
add refreshByPath in python ## How was this patch tested? manual Author: Felix Cheung Closes #17512 from felixcheung/catalogdoc. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/bccc3301 Tree: http://git-wip-us.apache.org/re

spark git commit: [SPARK-20195][SPARKR][SQL] add createTable catalog API and deprecate createExternalTable

2017-04-06 Thread felixcheung
and deprecate createExternalTable, plus a number of minor fixes ## How was this patch tested? manual, unit tests Author: Felix Cheung Closes #17511 from felixcheung/rceatetable. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/com

spark git commit: [SPARK-20026][DOC][SPARKR] Add Tweedie example for SparkR in programming guide

2017-04-07 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 9e0893b53 -> 870b9d9aa [SPARK-20026][DOC][SPARKR] Add Tweedie example for SparkR in programming guide ## What changes were proposed in this pull request? Add Tweedie example for SparkR in programming guide. The doc was already updated in #1

spark git commit: [SPARK-20197][SPARKR] CRAN check fail with package installation

2017-04-07 Thread felixcheung
ses #17516 from felixcheung/rdircheckincran. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/8feb799a Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/8feb799a Diff: http://git-wip-us.apache.org/repos/asf/spark/d

spark git commit: [SPARK-20258][DOC][SPARKR] Fix SparkR logistic regression example in programming guide (did not converge)

2017-04-07 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 8feb799af -> 1ad73f0a2 [SPARK-20258][DOC][SPARKR] Fix SparkR logistic regression example in programming guide (did not converge) ## What changes were proposed in this pull request? SparkR logistic regression example did not converge in pr

spark git commit: [SPARK-20278][R] Disable 'multiple_dots_linter' lint rule that is against project's code style

2017-04-16 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master ad935f526 -> 86d251c58 [SPARK-20278][R] Disable 'multiple_dots_linter' lint rule that is against project's code style ## What changes were proposed in this pull request? Currently, multi-dot separated variables in R is not allowed. For ex

spark git commit: [SPARK-19828][R][FOLLOWUP] Rename asJsonArray to as.json.array in from_json function in R

2017-04-17 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 86d251c58 -> 24f09b39c [SPARK-19828][R][FOLLOWUP] Rename asJsonArray to as.json.array in from_json function in R ## What changes were proposed in this pull request? This was suggested to be `as.json.array` at the first place in the PR to

spark git commit: [SPARK-17647][SQL][FOLLOWUP][MINOR] fix typo

2017-04-17 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 33ea908af -> b0a1e93e9 [SPARK-17647][SQL][FOLLOWUP][MINOR] fix typo ## What changes were proposed in this pull request? fix typo ## How was this patch tested? manual Author: Felix Cheung Closes #17663 from felixcheung/likedoct

spark git commit: [SPARK-17647][SQL][FOLLOWUP][MINOR] fix typo

2017-04-18 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 3808b4728 -> a4c1ebc1d [SPARK-17647][SQL][FOLLOWUP][MINOR] fix typo ## What changes were proposed in this pull request? fix typo ## How was this patch tested? manual Author: Felix Cheung Closes #17663 from felixcheung/likedoct

spark git commit: [SPARK-20208][R][DOCS] Document R fpGrowth support

2017-04-18 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master e468a96c4 -> 702d85af2 [SPARK-20208][R][DOCS] Document R fpGrowth support ## What changes were proposed in this pull request? Document fpGrowth in: - vignettes - programming guide - code example ## How was this patch tested? Manual tes

spark git commit: [SPARK-20208][R][DOCS] Document R fpGrowth support

2017-04-18 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 a33d44805 -> ef6923f7e [SPARK-20208][R][DOCS] Document R fpGrowth support ## What changes were proposed in this pull request? Document fpGrowth in: - vignettes - programming guide - code example ## How was this patch tested? Manual

spark git commit: [SPARK-20375][R] R wrappers for array and map

2017-04-19 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master bdc605691 -> 46c574976 [SPARK-20375][R] R wrappers for array and map ## What changes were proposed in this pull request? Adds wrappers for `o.a.s.sql.functions.array` and `o.a.s.sql.functions.map` ## How was this patch tested? Unit tests

spark git commit: [SPARK-20371][R] Add wrappers for collect_list and collect_set

2017-04-21 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master eb00378f0 -> fd648bff6 [SPARK-20371][R] Add wrappers for collect_list and collect_set ## What changes were proposed in this pull request? Adds wrappers for `collect_list` and `collect_set`. ## How was this patch tested? Unit tests, `chec

spark git commit: [SPARK-20438][R] SparkR wrappers for split and repeat

2017-04-24 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 90264aced -> 8a272ddc9 [SPARK-20438][R] SparkR wrappers for split and repeat ## What changes were proposed in this pull request? Add wrappers for `o.a.s.sql.functions`: - `split` as `split_string` - `repeat` as `repeat_string` ## How was

spark git commit: [SPARK-5484][GRAPHX] Periodically do checkpoint in Pregel

2017-04-25 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 67eef47ac -> 0a7f5f279 [SPARK-5484][GRAPHX] Periodically do checkpoint in Pregel ## What changes were proposed in this pull request? Pregel-based iterative algorithms with more than ~50 iterations begin to slow down and eventually fail wi

spark git commit: [SPARK-5484][GRAPHX] Periodically do checkpoint in Pregel

2017-04-25 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 55834a898 -> f971ce5dd [SPARK-5484][GRAPHX] Periodically do checkpoint in Pregel ## What changes were proposed in this pull request? Pregel-based iterative algorithms with more than ~50 iterations begin to slow down and eventually fai

spark git commit: [SPARK-20437][R] R wrappers for rollup and cube

2017-04-25 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 57e1da394 -> df58a95a3 [SPARK-20437][R] R wrappers for rollup and cube ## What changes were proposed in this pull request? - Add `rollup` and `cube` methods and corresponding generics. - Add short description to the vignette. ## How was t

spark git commit: [DOCS][MINOR] Add missing since to SparkR repeat_string note.

2017-04-27 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master b4724db19 -> b58cf77c4 [DOCS][MINOR] Add missing since to SparkR repeat_string note. ## What changes were proposed in this pull request? Replace note repeat_string 2.3.0 with note repeat_string since 2.3.0 ## How was this patch

spark git commit: [SPARK-20208][DOCS][FOLLOW-UP] Add FP-Growth to SparkR programming guide

2017-04-27 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master b58cf77c4 -> ba7666274 [SPARK-20208][DOCS][FOLLOW-UP] Add FP-Growth to SparkR programming guide ## What changes were proposed in this pull request? Add `spark.fpGrowth` to SparkR programming guide. ## How was this patch tested? Manual te

spark git commit: [SPARKR][DOC] Document LinearSVC in R programming guide

2017-04-27 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master b90bf520f -> 7fe824979 [SPARKR][DOC] Document LinearSVC in R programming guide ## What changes were proposed in this pull request? add link to svmLinear in the SparkR programming document. ## How was this patch tested? Build doc manually

spark git commit: [SPARKR][DOC] Document LinearSVC in R programming guide

2017-04-27 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 e02b6ebfd -> f60ed0c2c [SPARKR][DOC] Document LinearSVC in R programming guide ## What changes were proposed in this pull request? add link to svmLinear in the SparkR programming document. ## How was this patch tested? Build doc manu

spark git commit: [SPARK-20477][SPARKR][DOC] Document R bisecting k-means in R programming guide

2017-04-29 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 814a61a86 -> b28c3bc20 [SPARK-20477][SPARKR][DOC] Document R bisecting k-means in R programming guide ## What changes were proposed in this pull request? Add hyper link in the SparkR programming guide. ## How was this patch tested? Build

spark git commit: [SPARK-20477][SPARKR][DOC] Document R bisecting k-means in R programming guide

2017-04-29 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 ca6c59e7e -> 4a86d8db4 [SPARK-20477][SPARKR][DOC] Document R bisecting k-means in R programming guide ## What changes were proposed in this pull request? Add hyper link in the SparkR programming guide. ## How was this patch tested? B

spark git commit: [SPARK-19791][ML] Add doc and example for fpgrowth

2017-04-29 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master b28c3bc20 -> add9d1bba [SPARK-19791][ML] Add doc and example for fpgrowth ## What changes were proposed in this pull request? Add a new section for fpm Add Example for FPGrowth in scala and Java updated: Rewrite transform to be more compa

spark git commit: [SPARK-19791][ML] Add doc and example for fpgrowth

2017-04-29 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 4a86d8db4 -> 9789d5c57 [SPARK-19791][ML] Add doc and example for fpgrowth ## What changes were proposed in this pull request? Add a new section for fpm Add Example for FPGrowth in scala and Java updated: Rewrite transform to be more c

spark git commit: [SPARK-20533][SPARKR] SparkR Wrappers Model should be private and value should be lazy

2017-04-29 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master add9d1bba -> ee694cdff [SPARK-20533][SPARKR] SparkR Wrappers Model should be private and value should be lazy ## What changes were proposed in this pull request? MultilayerPerceptronClassifierWrapper model should be private. LogisticRegre

spark git commit: [SPARK-20493][R] De-duplicate parse logics for DDL-like type strings in R

2017-04-29 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master ee694cdff -> 70f1bcd7b [SPARK-20493][R] De-duplicate parse logics for DDL-like type strings in R ## What changes were proposed in this pull request? It seems we are using `SQLUtils.getSQLDataType` for type string in structField. It looks

spark git commit: [SPARK-20535][SPARKR] R wrappers for explode_outer and posexplode_outer

2017-04-30 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 1ee494d08 -> ae3df4e98 [SPARK-20535][SPARKR] R wrappers for explode_outer and posexplode_outer ## What changes were proposed in this pull request? Ad R wrappers for - `o.a.s.sql.functions.explode_outer` - `o.a.s.sql.functions.posexplode_o

spark git commit: [MINOR][DOCS][PYTHON] Adding missing boolean type for replacement value in fillna

2017-04-30 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master ae3df4e98 -> 6613046c8 [MINOR][DOCS][PYTHON] Adding missing boolean type for replacement value in fillna ## What changes were proposed in this pull request? Currently pyspark Dataframe.fillna API supports boolean type when we pass dict,

spark git commit: [MINOR][DOCS][PYTHON] Adding missing boolean type for replacement value in fillna

2017-04-30 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 c5beabcbd -> 994d9da90 [MINOR][DOCS][PYTHON] Adding missing boolean type for replacement value in fillna ## What changes were proposed in this pull request? Currently pyspark Dataframe.fillna API supports boolean type when we pass dic

spark git commit: [SPARK-20490][SPARKR] Add R wrappers for eqNullSafe and ! / not

2017-04-30 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 6613046c8 -> 80e9cf1b5 [SPARK-20490][SPARKR] Add R wrappers for eqNullSafe and ! / not ## What changes were proposed in this pull request? - Add null-safe equality operator `%<=>%` (sames as `o.a.s.sql.Column.eqNullSafe`, `o.a.s.sql.Colum

spark git commit: [SPARK-20541][SPARKR][SS] support awaitTermination without timeout

2017-04-30 Thread felixcheung
eed this for 2.2 ## How was this patch tested? manually, unit test Author: Felix Cheung Closes #17815 from felixcheung/rssawaitinfinite. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/a355b667 Tree: http://git-wip-us.apache.

spark git commit: [SPARK-20541][SPARKR][SS] support awaitTermination without timeout

2017-04-30 Thread felixcheung
eed this for 2.2 ## How was this patch tested? manually, unit test Author: Felix Cheung Closes #17815 from felixcheung/rssawaitinfinite. (cherry picked from commit a355b667a3718d9c5d48a0781e836bf5418ab842) Signed-off-by: Felix Cheung Project: http://git-wip-us.apache.org/repos/asf/spark/r

spark git commit: [SPARK-20192][SPARKR][DOC] SparkR migration guide to 2.2.0

2017-05-01 Thread felixcheung
ses #17816 from felixcheung/r22relnote. (cherry picked from commit d20a976e8918ca8d607af452301e8014fe14e64a) Signed-off-by: Felix Cheung Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/b7c1c2f9 Tree: http://git-wip-us.apache.

spark git commit: [SPARK-20192][SPARKR][DOC] SparkR migration guide to 2.2.0

2017-05-01 Thread felixcheung
ses #17816 from felixcheung/r22relnote. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/d20a976e Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/d20a976e Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/d20a976e Bra

spark git commit: [SPARK-20532][SPARKR] Implement grouping and grouping_id

2017-05-01 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master d20a976e8 -> 90d77e971 [SPARK-20532][SPARKR] Implement grouping and grouping_id ## What changes were proposed in this pull request? Adds R wrappers for: - `o.a.s.sql.functions.grouping` as `o.a.s.sql.functions.is_grouping` (to avoid shad

spark git commit: [SPARK-20490][SPARKR][DOC] add family tag for not function

2017-05-02 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master b1e639ab0 -> 13f47dc50 [SPARK-20490][SPARKR][DOC] add family tag for not function ## What changes were proposed in this pull request? doc only ## How was this patch tested? manual Author: Felix Cheung Closes #17828 from felixche

spark git commit: [SPARK-20543][SPARKR] skip tests when running on CRAN

2017-05-03 Thread felixcheung
ing, hivecontext) - tests on error conditions - tests won't likely change/break ## How was this patch tested? unit tests, `R CMD check --as-cran`, `R CMD check` Author: Felix Cheung Closes #17817 from felixcheung/rskiptest. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http:/

spark git commit: [SPARK-20544][SPARKR] skip tests when running on CRAN

2017-05-03 Thread felixcheung
n't likely change/break unit tests, `R CMD check --as-cran`, `R CMD check` Author: Felix Cheung Closes #17817 from felixcheung/rskiptest. (cherry picked from commit fc472bddd1d9c6a28e57e31496c0166777af597e) Signed-off-by: Felix Cheung Project: http://git-wip-us.apache.org/repos/asf/spark/rep

spark git commit: [SPARK-20015][SPARKR][SS][DOC][EXAMPLE] Document R Structured Streaming (experimental) in R vignettes and R & SS programming guide, R example

2017-05-04 Thread felixcheung
gnettes - R programming guide - SS programming guide - R example Also disable spark.als in vignettes for now since it's failing (SPARK-20402) ## How was this patch tested? manually Author: Felix Cheung Closes #17814 from felixcheung/rdocss. Project: http://git-wip-us.apache.org/repos/as

spark git commit: [SPARK-20015][SPARKR][SS][DOC][EXAMPLE] Document R Structured Streaming (experimental) in R vignettes and R & SS programming guide, R example

2017-05-04 Thread felixcheung
ide - R example Also disable spark.als in vignettes for now since it's failing (SPARK-20402) manually Author: Felix Cheung Closes #17814 from felixcheung/rdocss. (cherry picked from commit b8302ccd02265f9d7a7895c7b033441fa2d8ffd1) Signed-off-by: Felix Cheung Project: http://git-wip-u

spark git commit: [SPARK-20585][SPARKR] R generic hint support

2017-05-04 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master b8302ccd0 -> 9c36aa279 [SPARK-20585][SPARKR] R generic hint support ## What changes were proposed in this pull request? Adds support for generic hints on `SparkDataFrame` ## How was this patch tested? Unit tests, `check-cran.sh` Author:

spark git commit: [SPARK-20585][SPARKR] R generic hint support

2017-05-04 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 6c5c594b7 -> 3f5c54812 [SPARK-20585][SPARKR] R generic hint support Adds support for generic hints on `SparkDataFrame` Unit tests, `check-cran.sh` Author: zero323 Closes #17851 from zero323/SPARK-20585. (cherry picked from commit 9

spark git commit: [SPARK-20544][SPARKR] R wrapper for input_file_name

2017-05-04 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 9c36aa279 -> f21897fc1 [SPARK-20544][SPARKR] R wrapper for input_file_name ## What changes were proposed in this pull request? Adds wrapper for `o.a.s.sql.functions.input_file_name` ## How was this patch tested? Existing unit tests, addi

spark git commit: [SPARK-20571][SPARKR][SS] Flaky Structured Streaming tests

2017-05-04 Thread felixcheung
ely the flakiness from processing delay when Jenkins is hard to account for. This isn't an actual public API supported ## How was this patch tested? unit tests Author: Felix Cheung Closes #17857 from felixcheung/rsstestrelia. Project: http://git-wip-us.apache.org/repos/asf/spark/rep

spark git commit: [SPARK-20571][SPARKR][SS] Flaky Structured Streaming tests

2017-05-04 Thread felixcheung
but ultimately the flakiness from processing delay when Jenkins is hard to account for. This isn't an actual public API supported ## How was this patch tested? unit tests Author: Felix Cheung Closes #17857 from felixcheung/rsstestrelia. (cherry picked fro

spark git commit: [SPARK-20613] Remove excess quotes in Windows executable

2017-05-05 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 9064f1b04 -> b9ad2d191 [SPARK-20613] Remove excess quotes in Windows executable ## What changes were proposed in this pull request? Quotes are already added to the RUNNER variable on line 54. There is no need to put quotes on line 67. If

spark git commit: [SPARK-20613] Remove excess quotes in Windows executable

2017-05-05 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 dbb54a7b3 -> 1fa3c86a7 [SPARK-20613] Remove excess quotes in Windows executable ## What changes were proposed in this pull request? Quotes are already added to the RUNNER variable on line 54. There is no need to put quotes on line 67.

spark git commit: [SPARK-20613] Remove excess quotes in Windows executable

2017-05-05 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 179f5370e -> 2a7f5dae5 [SPARK-20613] Remove excess quotes in Windows executable ## What changes were proposed in this pull request? Quotes are already added to the RUNNER variable on line 54. There is no need to put quotes on line 67.

spark git commit: [SPARK-20208][DOCS][FOLLOW-UP] Add FP-Growth to SparkR programming guide

2017-05-05 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 1d9b7a74a -> 423a78625 [SPARK-20208][DOCS][FOLLOW-UP] Add FP-Growth to SparkR programming guide ## What changes were proposed in this pull request? Add `spark.fpGrowth` to SparkR programming guide. ## How was this patch tested? Manua

spark git commit: [SPARK-20614][PROJECT INFRA] Use the same log4j configuration with Jenkins in AppVeyor

2017-05-05 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 5d75b14bf -> b433acae7 [SPARK-20614][PROJECT INFRA] Use the same log4j configuration with Jenkins in AppVeyor ## What changes were proposed in this pull request? Currently, there are flooding logs in AppVeyor (in the console). This has be

spark git commit: [SPARK-20543][SPARKR][FOLLOWUP] Don't skip tests on AppVeyor

2017-05-07 Thread felixcheung
x Cheung Closes #17878 from felixcheung/appveyorrcran. (cherry picked from commit 7087e01194964a1aad0b45bdb41506a17100eacf) Signed-off-by: Felix Cheung Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/048e9890 Tree: http:/

spark git commit: [SPARK-20543][SPARKR][FOLLOWUP] Don't skip tests on AppVeyor

2017-05-07 Thread felixcheung
Closes #17878 from felixcheung/appveyorrcran. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/7087e011 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/7087e011 Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/

  1   2   3   4   >