maswin commented on PR #4415: URL: https://github.com/apache/hive/pull/4415#issuecomment-1599719006
> @maswin could you please check if the below failure is unrelated: http://ci.hive.apache.org/job/hive-precommit/job/PR-4415/3/testReport/junit/org.apache.hadoop.hive.cli/TestMiniTezCliDriver/Testing___split_15___PostProcess___testCliDriver_vector_non_string_partition_/ The test failure was related to the fix. In VectorizedParquetRecordReader, partition columns were added every time the next() is called - https://github.com/apache/hive/blob/f78ca5df80c0bcb566f0915cda65112268df492c/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/vector/VectorizedParquetRecordReader.java#L406 But in VectorizedOrcRecordReader, partition columns are set once and reused on every next() call - https://github.com/apache/hive/blob/7c83f6babc1a95e7fc26aeaa779d35ce7c91d1c0/ql/src/java/org/apache/hadoop/hive/ql/io/orc/VectorizedOrcInputFormat.java#L130 So when I did a reset it was not setting back the partition column. Fixed it. Partition and virtual columns will not be reset. The test passed in my local. Will wait for the full test suite to finish. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
