[GitHub] spark pull request: [SPARK-8669] [SQL] Fix crash with BINARY (ENUM...
Github user stevencanopy commented on the pull request: https://github.com/apache/spark/pull/7048#issuecomment-116718717 Thanks @liancheng for looking into the Parquet spec! Yes, I first encountered the bug when I was converting Avro to Parquet with parquet-mr 1.7.0. I've reverted the change to the two null cases, added ENUM to the UTF8 case, and updated the test case. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: SPARK-8669: Fix crash with BINARY (ENUM) field...
GitHub user stevencanopy opened a pull request: https://github.com/apache/spark/pull/7048 SPARK-8669: Fix crash with BINARY (ENUM) fields with Parquet 1.7 Patch to fix crash with BINARY fields with ENUM original types. You can merge this pull request into a Git repository by running: $ git pull https://github.com/stevencanopy/spark SPARK-8669 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/7048.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #7048 commit 4e23d242e68f3060d0f0bfb47acc9564dafa5815 Author: Steven She ste...@canopylabs.com Date: 2015-06-26T19:30:34Z SPARK-8669: Fix crash with BINARY (ENUM) fields with Parquet 1.7 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org
[GitHub] spark pull request: SPARK-7103: Fix crash with SparkContext.union ...
GitHub user stevencanopy opened a pull request: https://github.com/apache/spark/pull/5679 SPARK-7103: Fix crash with SparkContext.union when RDD has no partitioner Added a check to the SparkContext.union method to check that a partitioner is defined on all RDDs when instantiating a PartitionerAwareUnionRDD. You can merge this pull request into a Git repository by running: $ git pull https://github.com/stevencanopy/spark SPARK-7103 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/5679.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #5679 commit 5a3d84649b46df9fd670e951941e809e1e6d98a7 Author: Steven She ste...@canopylabs.com Date: 2015-04-24T05:55:25Z SPARK-7103: Fix crash with SparkContext.union when at least one RDD has no partitioner --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- - To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org