[GitHub] spark pull request #14517: [SPARK-16931][PYTHON] PySpark APIS for bucketBy a...

2016-10-03 Thread GregBowyer
Github user GregBowyer commented on a diff in the pull request: https://github.com/apache/spark/pull/14517#discussion_r81665506 --- Diff: python/pyspark/sql/readwriter.py --- @@ -747,16 +800,25 @@ def _test(): except py4j.protocol.Py4JError: spark

[GitHub] spark issue #9766: [SPARK-11775][PYSPARK][SQL] Allow PySpark to register Jav...

2016-09-22 Thread GregBowyer
Github user GregBowyer commented on the issue: https://github.com/apache/spark/pull/9766 Where do we stand on this, I just reapplied this patch to a spark 2.1-xxx build to get the same behaviour. --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark issue #14517: [SPARK-16931][PYTHON] PySpark APIS for bucketBy and sort...

2016-08-26 Thread GregBowyer
Github user GregBowyer commented on the issue: https://github.com/apache/spark/pull/14517 What thoughts do people have about merging in? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request #14517: [SPARK-16931][PYTHON] PySpark APIS for bucketBy a...

2016-08-16 Thread GregBowyer
Github user GregBowyer commented on a diff in the pull request: https://github.com/apache/spark/pull/14517#discussion_r75011876 --- Diff: python/pyspark/sql/readwriter.py --- @@ -692,8 +734,7 @@ def orc(self, path, mode=None, partitionBy=None, compression=None

[GitHub] spark pull request #14517: [SPARK-16931][PYTHON] PySpark APIS for bucketBy a...

2016-08-16 Thread GregBowyer
Github user GregBowyer commented on a diff in the pull request: https://github.com/apache/spark/pull/14517#discussion_r75011758 --- Diff: python/pyspark/sql/readwriter.py --- @@ -692,8 +734,7 @@ def orc(self, path, mode=None, partitionBy=None, compression=None

[GitHub] spark pull request #14517: [SPARK-16931][PYTHON] PySpark APIS for bucketBy a...

2016-08-16 Thread GregBowyer
Github user GregBowyer commented on a diff in the pull request: https://github.com/apache/spark/pull/14517#discussion_r75011462 --- Diff: python/pyspark/sql/readwriter.py --- @@ -733,11 +774,19 @@ def _test(): import os import tempfile import py4j

[GitHub] spark issue #14517: [SPARK-16931][PYTHON] PySpark APIS for bucketBy and sort...

2016-08-08 Thread GregBowyer
Github user GregBowyer commented on the issue: https://github.com/apache/spark/pull/14517 Amended commit with style changes from MLNick. Can someone call the OK to test please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request #14517: [SPARK-16931][PYTHON] PySpark APIS for bucketBy a...

2016-08-05 Thread GregBowyer
GitHub user GregBowyer opened a pull request: https://github.com/apache/spark/pull/14517 [SPARK-16931][PYTHON] PySpark APIS for bucketBy and sortBy ## What changes were proposed in this pull request? API access to allow pyspark to use bucketBy and sortBy in datraframes. You can