spark git commit: [MINOR][SPARKR] R API documentation for "coltypes" is confusing

2016-08-10 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 9dc3e602d -> 1203c8415 [MINOR][SPARKR] R API documentation for "coltypes" is confusing ## What changes were proposed in this pull request? R API documentation for "coltypes" is confusing, found when working on another ticket. Current

spark git commit: [SPARK-16444][SPARKR] Isotonic Regression wrapper in SparkR

2016-08-17 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 4d0cc84af -> 363793f2b [SPARK-16444][SPARKR] Isotonic Regression wrapper in SparkR ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) Add Isotonic Regression wrapper in SparkR Wrappers in R

spark git commit: [SPARK-19386][SPARKR][DOC] Bisecting k-means in SparkR documentation

2017-02-03 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 2f523fa0c -> 48aafeda7 [SPARK-19386][SPARKR][DOC] Bisecting k-means in SparkR documentation ## What changes were proposed in this pull request? Update programming guide, example and vignette with Bisecting k-means. Author: krishnakalyan3

spark git commit: [SPARK-19319][SPARKR] SparkR Kmeans summary returns error when the cluster size doesn't equal to k

2017-01-31 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 906383580 -> 9ac05225e [SPARK-19319][SPARKR] SparkR Kmeans summary returns error when the cluster size doesn't equal to k ## What changes were proposed in this pull request When Kmeans using initMode = "random" and some random seed, it

spark git commit: [SPARK-19395][SPARKR] Convert coefficients in summary to matrix

2017-01-31 Thread felixcheung
int [1:2] 2 4 - attr(*, "dimnames")=List of 2 ..$ : chr [1:2] "(Intercept)" "Sepal_Width" ..$ : chr [1:4] "Estimate" "Std. Error" "t value" "Pr(>|t|)" > s$coefficients[, 2] $`(Intercept)` [1] 0.4788963 $Sepal_W

spark git commit: [SPARK-18823][SPARKR] add support for assigning to column

2017-01-24 Thread felixcheung
? manual tests, unit tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16663 from felixcheung/rcolset. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/f27e0247 Tree: http://git-wip-us.apache.org/repos/asf/spar

spark git commit: [SPARK-19333][SPARKR] Add Apache License headers to R files

2017-01-27 Thread felixcheung
rly Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16709 from felixcheung/rfilelicense. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/385d7384 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/385d7384 D

spark git commit: [SPARK-19333][SPARKR] Add Apache License headers to R files

2017-01-27 Thread felixcheung
rly Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16709 from felixcheung/rfilelicense. (cherry picked from commit 385d73848b0d274467b633c7615e03b370f4a634) Signed-off-by: Felix Cheung <felixche...@apache.org> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Com

spark git commit: [SPARK-19333][SPARKR] Add Apache License headers to R files

2017-01-27 Thread felixcheung
rly Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16709 from felixcheung/rfilelicense. (cherry picked from commit 385d73848b0d274467b633c7615e03b370f4a634) Signed-off-by: Felix Cheung <felixche...@apache.org> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Com

spark git commit: [SPARKR][DOCS] update R API doc for subset/extract

2017-01-30 Thread felixcheung
in doc. Also a few minor updates: examples, links. ## How was this patch tested? manual Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16721 from felixcheung/rsubsetdoc. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/

spark git commit: [BACKPORT-2.1][SPARKR][DOCS] update R API doc for subset/extract

2017-01-30 Thread felixcheung
ung <felixcheun...@hotmail.com> Closes #16749 from felixcheung/rsubsetdocbackport. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/e43f161b Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/e43f161b Diff: http:

spark git commit: [SPARK-18821][SPARKR] Bisecting k-means wrapper in SparkR

2017-01-26 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 1191fe267 -> c0ba28430 [SPARK-18821][SPARKR] Bisecting k-means wrapper in SparkR ## What changes were proposed in this pull request? Add R wrapper for bisecting Kmeans. As JIRA is down, I will update title to link with corresponding JIRA

spark git commit: [SPARK-18788][SPARKR] Add API for getNumPartitions

2017-01-26 Thread felixcheung
hor: Felix Cheung <felixcheun...@hotmail.com> Closes #16668 from felixcheung/rgetnumpartitions. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/90817a6c Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/90817a6c D

spark git commit: [SPARK-18788][SPARKR] Add API for getNumPartitions

2017-01-26 Thread felixcheung
sts Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16668 from felixcheung/rgetnumpartitions. (cherry picked from commit 90817a6cd06068fa9f9ff77384a1fcba73b43006) Signed-off-by: Felix Cheung <felixche...@apache.org> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Com

spark git commit: [SPARK-19399][SPARKR][BACKPORT-2.1] fix tests broken by merge

2017-02-15 Thread felixcheung
lix Cheung <felixcheun...@hotmail.com> Closes #16950 from felixcheung/fixrtest. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/252dd05f Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/252dd05f Diff: http:

spark git commit: [SPARKR][EXAMPLES] update examples to stop spark session

2017-02-17 Thread felixcheung
xcheun...@hotmail.com> Closes #16973 from felixcheung/rexamples. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/988f6d7e Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/988f6d7e Diff: http://git-wip-us.apache.org/

spark git commit: [SPARK-19639][SPARKR][EXAMPLE] Add spark.svmLinear example and update vignettes

2017-02-17 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 15b144d2b -> 8b57ea4a1 [SPARK-19639][SPARKR][EXAMPLE] Add spark.svmLinear example and update vignettes ## What changes were proposed in this pull request? We recently add the spark.svmLinear API for SparkR. We need to add an example and

spark git commit: [SPARK-19616][SPARKR] weightCol and aggregationDepth should be improved for some SparkR APIs

2017-02-22 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master e4065376d -> 1f86e795b [SPARK-19616][SPARKR] weightCol and aggregationDepth should be improved for some SparkR APIs ## What changes were proposed in this pull request? This is a follow-up PR of #16800 When doing SPARK-19456, we found

spark git commit: [SPARK-19682][SPARKR] Issue warning (or error) when subset method "[[" takes vector index

2017-02-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 78eae7e67 -> 7bf09433f [SPARK-19682][SPARKR] Issue warning (or error) when subset method "[[" takes vector index ## What changes were proposed in this pull request? The `[[` method is supposed to take a single index and return a column.

spark git commit: [SPARK-19682][SPARKR] Issue warning (or error) when subset method "[[" takes vector index

2017-02-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 21afc4534 -> d30238f1b [SPARK-19682][SPARKR] Issue warning (or error) when subset method "[[" takes vector index ## What changes were proposed in this pull request? The `[[` method is supposed to take a single index and return a

spark git commit: [SPARK-19571][R] Fix SparkR test break on Windows via AppVeyor

2017-02-14 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 7b64f7aa0 -> f776e3b42 [SPARK-19571][R] Fix SparkR test break on Windows via AppVeyor ## What changes were proposed in this pull request? It seems wintuils for Hadoop 2.6.5 not exiting for now in https://github.com/steveloughran/winutils

spark git commit: [SPARK-19342][SPARKR] bug fixed in collect method for collecting timestamp column

2017-02-12 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 3881f342b -> bc0a0e639 [SPARK-19342][SPARKR] bug fixed in collect method for collecting timestamp column ## What changes were proposed in this pull request? Fix a bug in collect method for collecting timestamp column, the bug can be

spark git commit: [SPARK-19342][SPARKR] bug fixed in collect method for collecting timestamp column

2017-02-12 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 e580bb035 -> 173c2387a [SPARK-19342][SPARKR] bug fixed in collect method for collecting timestamp column ## What changes were proposed in this pull request? Fix a bug in collect method for collecting timestamp column, the bug can be

spark git commit: [SPARK-19319][BACKPORT-2.1][SPARKR] SparkR Kmeans summary returns error when the cluster size doesn't equal to k

2017-02-12 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 173c2387a -> 06e77e009 [SPARK-19319][BACKPORT-2.1][SPARKR] SparkR Kmeans summary returns error when the cluster size doesn't equal to k ## What changes were proposed in this pull request? Backport fix of #1 ## How was this patch

spark git commit: [SPARK-19456][SPARKR] Add LinearSVC R API

2017-02-15 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 447b2b530 -> 3973403d5 [SPARK-19456][SPARKR] Add LinearSVC R API ## What changes were proposed in this pull request? Linear SVM classifier is newly added into ML and python API has been added. This JIRA is to add R side API. Marked as

spark git commit: [SPARK-19399][SPARKR] Add R coalesce API for DataFrame and Column

2017-02-15 Thread felixcheung
How was this patch tested? manual, unit tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16739 from felixcheung/rcoalesce. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/671bc08e Tree: http://git-wip-us.a

spark git commit: [SPARK-19399][SPARKR] Add R coalesce API for DataFrame and Column

2017-02-15 Thread felixcheung
xcheun...@hotmail.com> Closes #16739 from felixcheung/rcoalesce. (cherry picked from commit 671bc08ed502815bfa2254c30d64149402acb0c7) Signed-off-by: Felix Cheung <felixche...@apache.org> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/a

spark git commit: [SPARK-18828][SPARKR] Refactor scripts for R

2017-01-16 Thread felixcheung
lly Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16249 from felixcheung/rscripts. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/c84f7d3e Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/c84f7d3e D

spark git commit: [SPARK-19231][SPARKR] add error handling for download and untar for Spark release

2017-01-18 Thread felixcheung
ase distribution we need to handle error for download and untar, and clean up, otherwise it will get stuck. ## How was this patch tested? manually Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16589 from felixcheung/rtarreturncode. Project: http://git-wip-us.apache.org/repos/asf/s

spark git commit: [SPARK-19231][SPARKR] add error handling for download and untar for Spark release

2017-01-18 Thread felixcheung
ase distribution we need to handle error for download and untar, and clean up, otherwise it will get stuck. ## How was this patch tested? manually Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16589 from felixcheung/rtarreturncode. (cherry picked fr

spark git commit: [SPARK-19066][SPARKR][BACKPORT-2.1] LDA doesn't set optimizer correctly

2017-01-17 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 3ec3e3f2e -> 29b954bba [SPARK-19066][SPARKR][BACKPORT-2.1] LDA doesn't set optimizer correctly ## What changes were proposed in this pull request? Back port the fix to SPARK-19066 to 2.1 branch. ## How was this patch tested? Unit

spark git commit: [SPARKR][EXAMPLE] change example APP name

2016-08-20 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 01401e965 -> 3e5fdeb3f [SPARKR][EXAMPLE] change example APP name ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) For R SQL example, appname is "MyApp". While examples in scala, Java and

spark git commit: [SPARK-17173][SPARKR] R MLlib refactor, cleanup, reformat, fix deprecation in test

2016-08-22 Thread felixcheung
tch tested? unit tests, manual tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #14735 from felixcheung/rmllibutil. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/0583ecda Tree: http://git-wip-us.apache.org/

spark git commit: [SPARK-18352][DOCS] wholeFile JSON update doc and programming guide

2017-03-02 Thread felixcheung
How was this patch tested? manually Author: Felix Cheung <felixcheun...@hotmail.com> Closes #17128 from felixcheung/jsonwholefiledoc. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/8d6ef895 Tree: http://git-wip-us.apache.

spark git commit: [SPARK-19460][SPARKR] Update dataset used in R documentation, examples to reduce warning noise and confusions

2017-02-28 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 731588056 -> 89cd3845b [SPARK-19460][SPARKR] Update dataset used in R documentation, examples to reduce warning noise and confusions ## What changes were proposed in this pull request? Replace `iris` dataset with `Titanic` or other

spark git commit: [SPARK-19572][SPARKR] Allow to disable hive in sparkR shell

2017-02-28 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 0fe8020f3 -> 731588056 [SPARK-19572][SPARKR] Allow to disable hive in sparkR shell ## What changes were proposed in this pull request? SPARK-15236 do this for scala shell, this ticket is for sparkR shell. This is not only for sparkR

spark git commit: [SPARK-19572][SPARKR] Allow to disable hive in sparkR shell

2017-02-28 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 d887f7581 -> f719cccdc [SPARK-19572][SPARKR] Allow to disable hive in sparkR shell ## What changes were proposed in this pull request? SPARK-15236 do this for scala shell, this ticket is for sparkR shell. This is not only for sparkR

spark git commit: [SPARK-16508][SPARKR] doc updates and more CRAN check fixes

2016-08-22 Thread felixcheung
ing CRAN check warnings ## How was this patch tested? create doc with knitr junyangq Author: Felix Cheung <felixcheun...@hotmail.com> Closes #14734 from felixcheung/rdoccleanup. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spa

spark git commit: [SPARKR][MINOR] Add installation message for remote master mode and improve other messages

2016-08-24 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 43273377a -> 9f363a690 [SPARKR][MINOR] Add installation message for remote master mode and improve other messages ## What changes were proposed in this pull request? This PR gives informative message to users when they try to connect

spark git commit: [SPARKR][MINOR] Add installation message for remote master mode and improve other messages

2016-08-24 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 18708f76c -> 3a60be4b1 [SPARKR][MINOR] Add installation message for remote master mode and improve other messages ## What changes were proposed in this pull request? This PR gives informative message to users when they try to connect to

spark git commit: [MINOR][SPARKR] fix R MLlib parameter documentation

2016-08-24 Thread felixcheung
tch tested? run knitr junyangq Author: Felix Cheung <felixcheun...@hotmail.com> Closes #14792 from felixcheung/rdocmllib. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/945c04bc Tree: http://git-wip-us.apache.org/repos/

spark git commit: [SPARKR][MINOR] Add more examples to window function docs

2016-08-24 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 945c04bcd -> 18708f76c [SPARKR][MINOR] Add more examples to window function docs ## What changes were proposed in this pull request? This PR adds more examples to window function docs to make them more accessible to the users. It also

spark git commit: [SPARKR][MINOR] Add more examples to window function docs

2016-08-24 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 9f924a01b -> 43273377a [SPARKR][MINOR] Add more examples to window function docs ## What changes were proposed in this pull request? This PR adds more examples to window function docs to make them more accessible to the users. It

spark git commit: [SPARK-17376][SPARKR] followup - change since version

2016-09-02 Thread felixcheung
xcheun...@hotmail.com> Closes #14939 from felixcheung/rsparkversion2. (cherry picked from commit eac1d0e921345b5d15aa35d8c565140292ab2af3) Signed-off-by: Felix Cheung <felixche...@apache.org> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/

spark git commit: [SPARK-17376][SPARKR] followup - change since version

2016-09-02 Thread felixcheung
xcheun...@hotmail.com> Closes #14939 from felixcheung/rsparkversion2. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/eac1d0e9 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/eac1d0e9 Diff: http://git-wip-us.apache.org/

spark git commit: [SPARK-17315][SPARKR] Kolmogorov-Smirnov test SparkR wrapper

2016-09-03 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master c2a1576c2 -> abb2f9210 [SPARK-17315][SPARKR] Kolmogorov-Smirnov test SparkR wrapper ## What changes were proposed in this pull request? This PR tries to add Kolmogorov-Smirnov Test wrapper to SparkR. This wrapper implementation only

spark git commit: [SPARK-16883][SPARKR] SQL decimal type is not properly cast to number when collecting SparkDataFrame

2016-09-02 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 2ab8dbdda -> 0f30cdedb [SPARK-16883][SPARKR] SQL decimal type is not properly cast to number when collecting SparkDataFrame ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix)

spark git commit: [SPARK-16883][SPARKR] SQL decimal type is not properly cast to number when collecting SparkDataFrame

2016-09-02 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 f9463238d -> 171bdfd96 [SPARK-16883][SPARKR] SQL decimal type is not properly cast to number when collecting SparkDataFrame ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix)

spark git commit: [SPARK-15509][ML][SPARKR] R MLlib algorithms should support input columns "features" and "label"

2016-09-02 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 0f30cdedb -> 6969dcc79 [SPARK-15509][ML][SPARKR] R MLlib algorithms should support input columns "features" and "label" https://issues.apache.org/jira/browse/SPARK-15509 ## What changes were proposed in this pull request? Currently in

spark git commit: [SPARK-17178][SPARKR][SPARKSUBMIT] Allow to set sparkr shell command through --conf

2016-08-31 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master d92cd227c -> fa6347938 [SPARK-17178][SPARKR][SPARKSUBMIT] Allow to set sparkr shell command through --conf ## What changes were proposed in this pull request? Allow user to set sparkr shell command through --conf spark.r.shell.command

spark git commit: [SPARK-17658][SPARKR] read.df/write.df API taking path optionally in SparkR

2016-10-04 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master a99743d05 -> c9fe10d4e [SPARK-17658][SPARKR] read.df/write.df API taking path optionally in SparkR ## What changes were proposed in this pull request? `write.df`/`read.df` API require path which is not actually always necessary in Spark.

spark git commit: [SPARK-17665][SPARKR] Support options/mode all for read/write APIs and options in other types

2016-10-07 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master bb1aaf28e -> 9d8ae853e [SPARK-17665][SPARKR] Support options/mode all for read/write APIs and options in other types ## What changes were proposed in this pull request? This PR includes the changes below: - Support `mode`/`options` in

spark git commit: [SPARK-17499][SPARKR][ML][MLLIB] make the default params in sparkR spark.mlp consistent with MultilayerPerceptronClassifier

2016-09-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 90d575421 -> f89808b0f [SPARK-17499][SPARKR][ML][MLLIB] make the default params in sparkR spark.mlp consistent with MultilayerPerceptronClassifier ## What changes were proposed in this pull request? update

spark git commit: [SPARK-17210][SPARKR] sparkr.zip is not distributed to executors when running sparkr in RStudio

2016-09-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 d3f90e71a -> 1a8ea000e [SPARK-17210][SPARKR] sparkr.zip is not distributed to executors when running sparkr in RStudio ## What changes were proposed in this pull request? Spark will add sparkr.zip to archive only when it is yarn mode

spark git commit: [SPARK-17210][SPARKR] sparkr.zip is not distributed to executors when running sparkr in RStudio

2016-09-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master f89808b0f -> f62ddc598 [SPARK-17210][SPARKR] sparkr.zip is not distributed to executors when running sparkr in RStudio ## What changes were proposed in this pull request? Spark will add sparkr.zip to archive only when it is yarn mode

spark git commit: [SPARK-16447][ML][SPARKR] LDA wrapper in SparkR

2016-08-18 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 68f5087d2 -> b72bb62d4 [SPARK-16447][ML][SPARKR] LDA wrapper in SparkR ## What changes were proposed in this pull request? Add LDA Wrapper in SparkR with the following interfaces: - spark.lda(data, ...) - spark.posterior(object,

spark git commit: [SPARKR][MINOR] Fix example of spark.naiveBayes

2016-08-26 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 970ab8f6d -> 188321623 [SPARKR][MINOR] Fix example of spark.naiveBayes ## What changes were proposed in this pull request? The original example doesn't work because the features are not categorical. This PR fixes this by changing to

spark git commit: [SPARKR][MINOR] Fix example of spark.naiveBayes

2016-08-26 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 27ed6d5dc -> 6f82d2da3 [SPARKR][MINOR] Fix example of spark.naiveBayes ## What changes were proposed in this pull request? The original example doesn't work because the features are not categorical. This PR fixes this by changing to

spark git commit: [SPARKR][MINOR] Remove reference link for common Windows environment variables

2016-08-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 811a2cef0 -> cc4018996 [SPARKR][MINOR] Remove reference link for common Windows environment variables ## What changes were proposed in this pull request? The PR removes reference link in the doc for environment variables for common

spark git commit: [SPARKR][MINOR] Remove reference link for common Windows environment variables

2016-08-23 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 9afdfc94f -> 8fd63e808 [SPARKR][MINOR] Remove reference link for common Windows environment variables ## What changes were proposed in this pull request? The PR removes reference link in the doc for environment variables for common

spark git commit: [SPARKR][MINOR] Fix doc for show method

2016-08-24 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 45b786aca -> d2932a0e9 [SPARKR][MINOR] Fix doc for show method ## What changes were proposed in this pull request? The original doc of `show` put methods for multiple classes together but the text only talks about `SparkDataFrame`. This

spark git commit: [SPARKR][MINOR] Fix doc for show method

2016-08-24 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 33d79b587 -> 29091d7cd [SPARKR][MINOR] Fix doc for show method ## What changes were proposed in this pull request? The original doc of `show` put methods for multiple classes together but the text only talks about `SparkDataFrame`.

spark git commit: [SPARK-16445][MLLIB][SPARKR] Multilayer Perceptron Classifier wrapper in SparkR

2016-08-24 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master d2932a0e9 -> 2fbdb6063 [SPARK-16445][MLLIB][SPARKR] Multilayer Perceptron Classifier wrapper in SparkR https://issues.apache.org/jira/browse/SPARK-16445 ## What changes were proposed in this pull request? Create Multilayer Perceptron

spark git commit: [SPARK-17499][SPARKR][FOLLOWUP] Check null first for layers in spark.mlp to avoid warnings in test results

2016-09-27 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master b03b4adf6 -> 4a8339568 [SPARK-17499][SPARKR][FOLLOWUP] Check null first for layers in spark.mlp to avoid warnings in test results ## What changes were proposed in this pull request? Some tests in `test_mllib.r` are as below: ```r

spark git commit: [SPARK-17157][SPARKR][FOLLOW-UP] doc fixes

2016-10-27 Thread felixcheung
ung <felixcheun...@hotmail.com> Closes #15650 from felixcheung/logitfix. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1dbe9896 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/1dbe9896 Diff: http://git-wip-us.a

spark git commit: [SQL][DOC] updating doc for JSON source to link to jsonlines.org

2016-10-27 Thread felixcheung
ted? manual test Author: Felix Cheung <felixcheun...@hotmail.com> Closes #15629 from felixcheung/jsondoc. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/44c8bfda Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/44c8

spark git commit: [SPARK-18007][SPARKR][ML] update SparkR MLP - add initalWeights parameter

2016-10-25 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master c329a568b -> 12b3e8d2e [SPARK-18007][SPARKR][ML] update SparkR MLP - add initalWeights parameter ## What changes were proposed in this pull request? update SparkR MLP, add initalWeights parameter. ## How was this patch tested? test

spark git commit: [SPARK-17961][SPARKR][SQL] Add storageLevel to DataFrame for SparkR

2016-10-26 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master ea3605e82 -> fb0a8a8dd [SPARK-17961][SPARKR][SQL] Add storageLevel to DataFrame for SparkR ## What changes were proposed in this pull request? Add storageLevel to DataFrame for SparkR. This is similar to this RP:

spark git commit: [SPARK-18239][SPARKR] Gradient Boosted Tree for R

2016-11-08 Thread felixcheung
ely isolated I'd like to target this for branch-2.1 ## How was this patch tested? manual tests, unit tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #15746 from felixcheung/rgbt. (cherry picked from commit 55964c15a7b639f920dfe6c104ae4fdcd673705c) Signed-off-by: Felix Cheung

spark git commit: [SPARK-18239][SPARKR] Gradient Boosted Tree for R

2016-11-08 Thread felixcheung
ely isolated I'd like to target this for branch-2.1 ## How was this patch tested? manual tests, unit tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #15746 from felixcheung/rgbt. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/

spark git commit: [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently with expressions

2016-11-05 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 dcbc42658 -> e9f1d4aaa [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently with expressions ## What changes were proposed in this pull request? This PR proposes to improve documentation and fix some

spark git commit: [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently with expressions

2016-11-05 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 9a87c3138 -> 15d392688 [MINOR][DOCUMENTATION] Fix some minor descriptions in functions consistently with expressions ## What changes were proposed in this pull request? This PR proposes to improve documentation and fix some descriptions

spark git commit: [SPARKR][BRANCH-2.0] R merge API doc and example fix

2016-10-23 Thread felixcheung
il.com> Closes #15589 from felixcheung/rdocmergefix. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/0e0d83a5 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/0e0d83a5 Diff: http://git-wip-us.apache.org/repos/asf/s

spark git commit: [SPARKR][BRANCH-2.0] R merge API doc and example fix

2016-10-23 Thread felixcheung
; Closes #15589 from felixcheung/rdocmergefix. (cherry picked from commit 0e0d83a597885ab1773cb69d6dcc10346d6976a3) Signed-off-by: Felix Cheung <felixche...@apache.org> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commi

spark git commit: [SPARK-17811] SparkR cannot parallelize data.frame with NA or NULL in Date columns

2016-10-21 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 a65d40ab6 -> 78458a7eb [SPARK-17811] SparkR cannot parallelize data.frame with NA or NULL in Date columns ## What changes were proposed in this pull request? NA date values are serialized as "NA" and NA time values are serialized as

spark git commit: [SPARK-18013][SPARKR] add crossJoin API

2016-10-21 Thread felixcheung
hor: Felix Cheung <felixcheun...@hotmail.com> Closes #15559 from felixcheung/rcrossjoin. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/e21e1c94 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/e21e1c94 Diff: http:

spark git commit: [SPARK-17811] SparkR cannot parallelize data.frame with NA or NULL in Date columns

2016-10-21 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master e21e1c946 -> e371040a0 [SPARK-17811] SparkR cannot parallelize data.frame with NA or NULL in Date columns ## What changes were proposed in this pull request? NA date values are serialized as "NA" and NA time values are serialized as NaN

spark git commit: [SPARK-17674][SPARKR] check for warning in test output

2016-10-21 Thread felixcheung
arn" = 2)`), we need to have a way to detect any new warning and fail ## How was this patch tested? manual testing, Jenkins Author: Felix Cheung <felixcheun...@hotmail.com> Closes #15576 from felixcheung/rtestwarning. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Com

spark git commit: [SPARKR] fix warnings

2016-10-20 Thread felixcheung
der adding importFrom("utils", "object.size") to your NAMESPACE file. ``` ## How was this patch tested? unit tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #15560 from felixcheung/rwarnings. Project: http://git-wip-us.apache.org/repos/asf/spar

spark git commit: [SPARKR] fix warnings

2016-10-20 Thread felixcheung
ame Consider adding importFrom("utils", "object.size") to your NAMESPACE file. ``` ## How was this patch tested? unit tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #15560 from felixcheung/rwarnings. (cherry picked from commit 3180272d2d49e440516085c0e4aebd5ba

spark git commit: [SPARK-17919] Make timeout to RBackend configurable in SparkR

2016-10-30 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 8ae2da0b2 -> 2881a2d1d [SPARK-17919] Make timeout to RBackend configurable in SparkR ## What changes were proposed in this pull request? This patch makes RBackend connection timeout configurable by user. ## How was this patch tested? N/A

spark git commit: [SPARK-16137][SPARKR] randomForest for R

2016-10-30 Thread felixcheung
ual tests, unit tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #15607 from felixcheung/rrandomforest. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/b6879b8b Tree: http://git-wip-us.apache.org/repos/asf/s

spark git commit: [SPARK-18110][PYTHON][ML] add missing parameter in Python for RandomForest regression and classification

2016-10-30 Thread felixcheung
Add varianceCol to randomForestRegressor In Python ## How was this patch tested? manual tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #15638 from felixcheung/pyrandomforest. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/

spark git commit: [SPARKR][TEST] remove unnecessary suppressWarnings

2016-11-03 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 67659c9af -> e89202523 [SPARKR][TEST] remove unnecessary suppressWarnings ## What changes were proposed in this pull request? In test_mllib.R, there are two unnecessary suppressWarnings. This PR just removes them. ## How was this patch

spark git commit: [SPARK-14393][SQL][DOC] update doc for python and R

2016-11-03 Thread felixcheung
Author: Felix Cheung <felixcheun...@hotmail.com> Closes #15747 from felixcheung/pySPARK-14393. (cherry picked from commit a08463b1d32348a81d0f148dfaf22741d5c23b1a) Signed-off-by: Felix Cheung <felixche...@apache.org> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Com

spark git commit: [SPARK-14393][SQL][DOC] update doc for python and R

2016-11-03 Thread felixcheung
: Felix Cheung <felixcheun...@hotmail.com> Closes #15747 from felixcheung/pySPARK-14393. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/a08463b1 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/a08463b1 Diff: h

spark git commit: [SPARK-17838][SPARKR] Check named arguments for options and use formatted R friendly message from JVM exception message

2016-11-01 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master ad4832a9f -> 1ecfafa08 [SPARK-17838][SPARKR] Check named arguments for options and use formatted R friendly message from JVM exception message ## What changes were proposed in this pull request? This PR proposes to - improve the

spark git commit: [SPARK-17817][PYSPARK] PySpark RDD Repartitioning Results in Highly Skewed Partition Sizes

2016-10-11 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 75b9e3514 -> 07508bd01 [SPARK-17817][PYSPARK] PySpark RDD Repartitioning Results in Highly Skewed Partition Sizes ## What changes were proposed in this pull request? Quoted from JIRA description: Calling repartition on a PySpark RDD to

spark git commit: [SPARK-17790][SPARKR] Support for parallelizing R data.frame larger than 2GB

2016-10-12 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.0 5451541d1 -> d55ba3063 [SPARK-17790][SPARKR] Support for parallelizing R data.frame larger than 2GB ## What changes were proposed in this pull request? If the R data structure that is being parallelized is larger than `INT_MAX` we use

spark git commit: [SPARK-17790][SPARKR] Support for parallelizing R data.frame larger than 2GB

2016-10-12 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master d5580ebaa -> 5cc503f4f [SPARK-17790][SPARKR] Support for parallelizing R data.frame larger than 2GB ## What changes were proposed in this pull request? If the R data structure that is being parallelized is larger than `INT_MAX` we use

spark git commit: [SPARK-18349][SPARKR] Update R API documentation on ml model summary

2016-12-08 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 4ac8b20bf -> 86a96034c [SPARK-18349][SPARKR] Update R API documentation on ml model summary ## What changes were proposed in this pull request? In this PR, the document of `summary` method is improved in the format: returns summary

spark git commit: [SPARK-18349][SPARKR] Update R API documentation on ml model summary

2016-12-08 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 ef5646b4c -> 4ceed95b4 [SPARK-18349][SPARKR] Update R API documentation on ml model summary ## What changes were proposed in this pull request? In this PR, the document of `summary` method is improved in the format: returns summary

spark git commit: [SPARK-18865][SPARKR] SparkR vignettes MLP and LDA updates

2016-12-14 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.1 280c35af9 -> 0d94201e0 [SPARK-18865][SPARKR] SparkR vignettes MLP and LDA updates ## What changes were proposed in this pull request? When do the QA work, I found that the following issues: 1). `spark.mlp` doesn't include an example;

spark git commit: [SPARK-18865][SPARKR] SparkR vignettes MLP and LDA updates

2016-12-14 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master ffdd1fcd1 -> 324388531 [SPARK-18865][SPARKR] SparkR vignettes MLP and LDA updates ## What changes were proposed in this pull request? When do the QA work, I found that the following issues: 1). `spark.mlp` doesn't include an example; 2).

spark git commit: [SPARK-18849][ML][SPARKR][DOC] vignettes final check reorg

2016-12-17 Thread felixcheung
.io/sparkr-vignettes.html Previous: https://felixcheung.github.io/sparkr-vignettes_old.html Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16301 from felixcheung/rvignettespass2. (cherry picked from commit 38fd163d0d2c44128bf8872d297b79edd7bd4137) Signed-off-by: Felix Cheung

spark git commit: [SPARK-18849][ML][SPARKR][DOC] vignettes final check reorg

2016-12-17 Thread felixcheung
rkr-vignettes.html Previous: https://felixcheung.github.io/sparkr-vignettes_old.html Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16301 from felixcheung/rvignettespass2. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spa

spark git commit: [SPARK-19133][SPARKR][ML][BACKPORT-2.1] fix glm for Gamma, clarify glm family supported

2017-01-10 Thread felixcheung
nit tests Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16532 from felixcheung/rgammabackport. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/1022049c Tree: http://git-wip-us.apache.org/repos/asf/spark/tree

spark git commit: [SPARK-19133][SPARKR][ML] fix glm for Gamma, clarify glm family supported

2017-01-10 Thread felixcheung
ted? manual Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16511 from felixcheung/rdocglmfamily. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/9bc3507e Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/9bc3

spark git commit: [SPARK-12757][CORE] lower "block locks were not released" log to info level

2017-01-12 Thread felixcheung
it is generating a lot of warnings in running ML, graph calls, as pointed out in the JIRA. Author: Felix Cheung <felixcheun...@hotmail.com> Closes #16513 from felixcheung/blocklockswarn. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/a

spark git commit: [SPARK-19126][DOCS] Update Join Documentation Across Languages

2017-01-08 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 1f6ded645 -> 19d9d4c85 [SPARK-19126][DOCS] Update Join Documentation Across Languages ## What changes were proposed in this pull request? - [X] Make sure all join types are clearly mentioned - [X] Make join labeling/style consistent - [X]

  1   2   3   4   >