spark git commit: [SPARK-20553][ML][PYSPARK] Update ALS examples with recommend-all methods

2017-05-16 Thread mlnick
Repository: spark Updated Branches: refs/heads/branch-2.2 57c87cf2d -> b8d37ac37 [SPARK-20553][ML][PYSPARK] Update ALS examples with recommend-all methods Update ALS examples illustrating use of "recommendForAllX" methods. ## How was this patch tested? Built and ran examples locally Author:

spark git commit: [SPARK-20553][ML][PYSPARK] Update ALS examples with recommend-all methods

2017-05-16 Thread mlnick
Repository: spark Updated Branches: refs/heads/master dbe81633a -> 6af7b43b3 [SPARK-20553][ML][PYSPARK] Update ALS examples with recommend-all methods Update ALS examples illustrating use of "recommendForAllX" methods. ## How was this patch tested? Built and ran examples locally Author: Nick

spark git commit: [SPARK-20677][MLLIB][ML] Follow-up to ALS recommend-all performance PRs

2017-05-16 Thread mlnick
Repository: spark Updated Branches: refs/heads/master 6af7b43b3 -> 25b4f41d2 [SPARK-20677][MLLIB][ML] Follow-up to ALS recommend-all performance PRs Small clean ups from #17742 and #17845. ## How was this patch tested? Existing unit tests. Author: Nick Pentreath Closes #17919 from MLnick/

spark git commit: [SPARK-20677][MLLIB][ML] Follow-up to ALS recommend-all performance PRs

2017-05-16 Thread mlnick
Repository: spark Updated Branches: refs/heads/branch-2.2 b8d37ac37 -> ee0d2af95 [SPARK-20677][MLLIB][ML] Follow-up to ALS recommend-all performance PRs Small clean ups from #17742 and #17845. ## How was this patch tested? Existing unit tests. Author: Nick Pentreath Closes #17919 from MLn

spark git commit: [SPARK-20529][CORE] Allow worker and master work with a proxy server

2017-05-16 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 25b4f41d2 -> 9150bca47 [SPARK-20529][CORE] Allow worker and master work with a proxy server ## What changes were proposed in this pull request? In the current codes, when worker connects to master, master will send its address to the work

spark git commit: [SPARK-20529][CORE] Allow worker and master work with a proxy server

2017-05-16 Thread zsxwing
Repository: spark Updated Branches: refs/heads/branch-2.2 ee0d2af95 -> 75e5ea294 [SPARK-20529][CORE] Allow worker and master work with a proxy server ## What changes were proposed in this pull request? In the current codes, when worker connects to master, master will send its address to the

spark git commit: [SPARK-19372][SQL] Fix throwing a Java exception at df.fliter() due to 64KB bytecode size limit

2017-05-16 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master 9150bca47 -> 6f62e9d9b [SPARK-19372][SQL] Fix throwing a Java exception at df.fliter() due to 64KB bytecode size limit ## What changes were proposed in this pull request? When an expression for `df.filter()` has many nodes (e.g. 400), the

spark git commit: [SPARK-20140][DSTREAM] Remove hardcoded kinesis retry wait and max retries

2017-05-16 Thread brkyvz
Repository: spark Updated Branches: refs/heads/master 6f62e9d9b -> 38f4e8692 [SPARK-20140][DSTREAM] Remove hardcoded kinesis retry wait and max retries ## What changes were proposed in this pull request? The pull requests proposes to remove the hardcoded values for Amazon Kinesis - MIN_RETRY

spark git commit: [SPARK-20140][DSTREAM] Remove hardcoded kinesis retry wait and max retries

2017-05-16 Thread brkyvz
Repository: spark Updated Branches: refs/heads/branch-2.2 75e5ea294 -> 7076ab40f [SPARK-20140][DSTREAM] Remove hardcoded kinesis retry wait and max retries ## What changes were proposed in this pull request? The pull requests proposes to remove the hardcoded values for Amazon Kinesis - MIN_R

spark git commit: [SQL][TRIVIAL] Lower parser log level to debug

2017-05-16 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 38f4e8692 -> 69bb7715f [SQL][TRIVIAL] Lower parser log level to debug ## What changes were proposed in this pull request? Currently the parser logs the query it is parsing at `info` level. This is too high, this PR lowers the log level to

spark git commit: [SPARK-20690][SQL] Subqueries in FROM should have alias names

2017-05-16 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 69bb7715f -> 7463a88be [SPARK-20690][SQL] Subqueries in FROM should have alias names ## What changes were proposed in this pull request? We add missing attributes into Filter in Analyzer. But we shouldn't do it through subqueries like thi

spark git commit: [SPARK-20776] Fix perf. problems in JobProgressListener caused by TaskMetrics construction

2017-05-16 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 7463a88be -> 30e0557db [SPARK-20776] Fix perf. problems in JobProgressListener caused by TaskMetrics construction ## What changes were proposed in this pull request? In ``` ./bin/spark-shell --master=local[64] ``` I ran ``` sc.parallel

spark git commit: [SPARK-20776] Fix perf. problems in JobProgressListener caused by TaskMetrics construction

2017-05-16 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.2 7076ab40f -> d42c67a1f [SPARK-20776] Fix perf. problems in JobProgressListener caused by TaskMetrics construction ## What changes were proposed in this pull request? In ``` ./bin/spark-shell --master=local[64] ``` I ran ``` sc.para