spark git commit: [SPARK-12073][STREAMING] backpressure rate controller consumes events preferentially from lagg…

2016-03-04 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master a6e2bd31f -> f19228eed [SPARK-12073][STREAMING] backpressure rate controller consumes events preferentially from lagg… …ing partitions I'm pretty sure this is the reason we couldn't easily recover from an unbalanced Kafka partition u

spark git commit: [SPARK-13255] [SQL] Update vectorized reader to directly return ColumnarBatch instead of InternalRows.

2016-03-04 Thread davies
Repository: spark Updated Branches: refs/heads/master 5f42c28b1 -> a6e2bd31f [SPARK-13255] [SQL] Update vectorized reader to directly return ColumnarBatch instead of InternalRows. ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) Currently, th

spark git commit: [SPARK-13459][WEB UI] Separate Alive and Dead Executors in Executor Totals Table

2016-03-04 Thread tgraves
Repository: spark Updated Branches: refs/heads/master b7d414742 -> 5f42c28b1 [SPARK-13459][WEB UI] Separate Alive and Dead Executors in Executor Totals Table ## What changes were proposed in this pull request? Now that dead executors are shown in the executors table (#10058) the totals table

spark git commit: [HOTFIX] fix the conflict when cherry-pick

2016-03-04 Thread davies
Repository: spark Updated Branches: refs/heads/branch-1.6 f0cc511ec -> ffaf7c080 [HOTFIX] fix the conflict when cherry-pick Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/ffaf7c08 Tree: http://git-wip-us.apache.org/repos/

spark git commit: [SPARK-13444][MLLIB] QuantileDiscretizer chooses bad splits on large DataFrames

2016-03-04 Thread meng
Repository: spark Updated Branches: refs/heads/branch-1.6 528e37352 -> f0cc511ec [SPARK-13444][MLLIB] QuantileDiscretizer chooses bad splits on large DataFrames ## What changes were proposed in this pull request? Change line 113 of QuantileDiscretizer.scala to `val requiredSamples = math.max

[3/3] spark git commit: [SPARK-13633][SQL] Move things into catalyst.parser package

2016-03-04 Thread andrewor14
[SPARK-13633][SQL] Move things into catalyst.parser package ## What changes were proposed in this pull request? This patch simply moves things to existing package `o.a.s.sql.catalyst.parser` in an effort to reduce the size of the diff in #11048. This is conceptually the same as a recently merge

[2/3] spark git commit: [SPARK-13633][SQL] Move things into catalyst.parser package

2016-03-04 Thread andrewor14
http://git-wip-us.apache.org/repos/asf/spark/blob/b7d41474/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/CatalystQl.scala -- diff --git a/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/parser/Catalyst

[1/3] spark git commit: [SPARK-13633][SQL] Move things into catalyst.parser package

2016-03-04 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 83302c3bf -> b7d414742 http://git-wip-us.apache.org/repos/asf/spark/blob/b7d41474/sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/util/DataTypeParserSuite.scala --

spark git commit: [SPARK-12941][SQL][MASTER] Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHAR datatype mapping

2016-03-04 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 5a2712952 -> 528e37352 [SPARK-12941][SQL][MASTER] Spark-SQL JDBC Oracle dialect fails to map string datatypes to Oracle VARCHAR datatype mapping A test suite added for the bug fix -SPARK 12941; for the mapping of the StringType to cor

spark git commit: [SPARK-13036][SPARK-13318][SPARK-13319] Add save/load for feature.py

2016-03-04 Thread meng
Repository: spark Updated Branches: refs/heads/master c8f25459e -> 83302c3bf [SPARK-13036][SPARK-13318][SPARK-13319] Add save/load for feature.py Add save/load for feature.py. Meanwhile, add save/load for `ElementwiseProduct` in Scala side and fix a bug of missing `setDefault` in `VectorSlice

spark git commit: [SPARK-13676] Fix mismatched default values for regParam in LogisticRegression

2016-03-04 Thread meng
Repository: spark Updated Branches: refs/heads/master e61750824 -> c8f25459e [SPARK-13676] Fix mismatched default values for regParam in LogisticRegression ## What changes were proposed in this pull request? The default value of regularization parameter for `LogisticRegression` algorithm is

spark git commit: [SPARK-11515][ML] QuantileDiscretizer should take random seed

2016-03-04 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 51c676e46 -> 5a2712952 [SPARK-11515][ML] QuantileDiscretizer should take random seed cc jkbradley Author: Yu ISHIKAWA Closes #9535 from yu-iskw/SPARK-11515. (cherry picked from commit 574571c87098795a2206a113ee9ed4bafba8f00f) Signed

spark git commit: [SPARK-13673][WINDOWS] Fixed not to pollute environment variables.

2016-03-04 Thread srowen
Repository: spark Updated Branches: refs/heads/master 204b02b56 -> e61750824 [SPARK-13673][WINDOWS] Fixed not to pollute environment variables. ## What changes were proposed in this pull request? This patch fixes the problem that `bin\beeline.cmd` pollutes environment variables. The similar

spark git commit: [SPARK-12925] Improve HiveInspectors.unwrap for StringObjectInspector.…

2016-03-04 Thread srowen
Repository: spark Updated Branches: refs/heads/master c04dc27ce -> 204b02b56 [SPARK-12925] Improve HiveInspectors.unwrap for StringObjectInspector.… Earlier fix did not copy the bytes and it is possible for higher level to reuse Text object. This was causing issues. Proposed fix now copies

spark git commit: [SPARK-13398][STREAMING] Move away from thread pool task support to forkjoin

2016-03-04 Thread srowen
Repository: spark Updated Branches: refs/heads/master 27e88faa0 -> c04dc27ce [SPARK-13398][STREAMING] Move away from thread pool task support to forkjoin ## What changes were proposed in this pull request? Remove old deprecated ThreadPoolExecutor and replace with ExecutionContext using a For

spark git commit: [SPARK-13646][MLLIB] QuantileDiscretizer counts dataset twice in get…

2016-03-04 Thread srowen
Repository: spark Updated Branches: refs/heads/master dd83c209f -> 27e88faa0 [SPARK-13646][MLLIB] QuantileDiscretizer counts dataset twice in get… ## What changes were proposed in this pull request? It avoids counting the dataframe twice. Author: Abou Haydar Elias Author: Elie A Closes

spark git commit: [SPARK-13603][SQL] support SQL generation for subquery

2016-03-04 Thread lian
Repository: spark Updated Branches: refs/heads/master 465c665db -> dd83c209f [SPARK-13603][SQL] support SQL generation for subquery ## What changes were proposed in this pull request? This is support SQL generation for subquery expressions, which will be replaced to a SubqueryHolder inside S