[GitHub] spark pull request: [SPARK-8669] [SQL] Fix crash with BINARY (ENUM...

2015-06-29 Thread stevencanopy
Github user stevencanopy commented on the pull request:

https://github.com/apache/spark/pull/7048#issuecomment-116718717
  
Thanks @liancheng for looking into the Parquet spec! Yes, I first 
encountered the bug when I was converting Avro to Parquet with parquet-mr 1.7.0.

I've reverted the change to the two null cases, added ENUM to the UTF8 
case, and updated the test case.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-8669: Fix crash with BINARY (ENUM) field...

2015-06-26 Thread stevencanopy
GitHub user stevencanopy opened a pull request:

https://github.com/apache/spark/pull/7048

SPARK-8669: Fix crash with BINARY (ENUM) fields with Parquet 1.7

Patch to fix crash with BINARY fields with ENUM original types.

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/stevencanopy/spark SPARK-8669

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/7048.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #7048


commit 4e23d242e68f3060d0f0bfb47acc9564dafa5815
Author: Steven She ste...@canopylabs.com
Date:   2015-06-26T19:30:34Z

SPARK-8669: Fix crash with BINARY (ENUM) fields with Parquet 1.7




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] spark pull request: SPARK-7103: Fix crash with SparkContext.union ...

2015-04-24 Thread stevencanopy
GitHub user stevencanopy opened a pull request:

https://github.com/apache/spark/pull/5679

SPARK-7103: Fix crash with SparkContext.union when RDD has no partitioner

Added a check to the SparkContext.union method to check that a partitioner 
is defined on all RDDs when instantiating a PartitionerAwareUnionRDD. 

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/stevencanopy/spark SPARK-7103

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/spark/pull/5679.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #5679


commit 5a3d84649b46df9fd670e951941e809e1e6d98a7
Author: Steven She ste...@canopylabs.com
Date:   2015-04-24T05:55:25Z

SPARK-7103: Fix crash with SparkContext.union when at least one RDD has no 
partitioner




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org