Michael Allman created SPARK-17992: -------------------------------------- Summary: HiveClient.getPartitionsByFilter throws an exception for some unsupported filters when hive.metastore.try.direct.sql=false Key: SPARK-17992 URL: https://issues.apache.org/jira/browse/SPARK-17992 Project: Spark Issue Type: Bug Components: SQL Reporter: Michael Allman
We recently added (and enabled by default) table partition pruning for partitioned Hive tables converted to using {{TableFileCatalog}}. When the Hive configuration option {{hive.metastore.try.direct.sql}} is set to {{false}}, Hive will throw an exception for unsupported filter expressions. For example, attempting to filter on an integer partition column will throw a {{org.apache.hadoop.hive.metastore.api.MetaException}}. I discovered this behavior because VideoAmp uses the CDH version of Hive with a Postgresql metastore DB. In this configuration, CDH sets {{hive.metastore.try.direct.sql}} to {{false}} by default, and queries that filter on a non-string partition column will fail. That would be a rather rude surprise for these Spark 2.1 users... I'm not sure exactly what behavior we should expect, but I suggest that {{HiveClientImpl.getPartitionsByFilter}} catch this metastore exception and return all partitions instead. This is what Spark does for Hive 0.12 users, which does not support this feature at all. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org