Re: [VOTE] Create apache/orc-rust repository to adopt datafusion-contrib/orc-rust

2024-10-30 Thread Quanlong Huang
+1 On Thu, Oct 31, 2024 at 12:48 PM Gang Wu wrote: > > Hi, > > As a follow up of [1], I'd like to formally propose to adopt the donation > of ORC Rust implementation from datafusion-contrib/orc-rust [2] . > > Specifically, the datafusion-contrib/orc-rust repository will be moved to > apache/orc-r

Re: [VOTE] Release Apache ORC 1.8.3 (RC0)

2023-03-12 Thread Quanlong Huang
+1 Verified the checksum and signature Built and ran tests on Ubuntu 18.04 and MacOS 11.6.7 using JDK8 Thanks, Quanlong On Mon, Mar 13, 2023 at 9:23 AM Gang Wu wrote: > +1 > > Best Regards, > Gang > > On Mon, Mar 13, 2023 at 3:14 AM William H. wrote: > > > +1 > > > > Thank you, > > William >

Re: Apache ORC 1.8/1.7 and 1.6 EOL Release

2022-09-05 Thread Quanlong Huang
Impala currently uses the Java lib in 1.6.2 and the cpp lib in 1.7. I think it makes sense to bump its dependent java version to 1.7 as well. Filed an JIRA for this: https://issues.apache.org/jira/browse/IMPALA-11554 Thanks, Quanlong On Tue, Sep 6, 2022 at 11:42 AM Dongjoon Hyun wrote: > Thank

Re: [VOTE] Release Apache ORC 1.8.0 (RC0)

2022-08-29 Thread Quanlong Huang
+1 - Built and test C++/Java on Ubuntu 16.04 and OpenJDK 1.8.0_275 - Verified signature and checksum Thanks, Quanlong On Sun, Aug 28, 2022 at 3:31 AM Dongjoon Hyun wrote: > +1 > > Thank you. It's great to see 1.8.0 finally. > > Dongjoon > > > On Sat, Aug 27, 2022 at 11:22 AM William H. > wrot

Re: [VOTE] Release Apache ORC 1.7.6 (RC0)

2022-08-15 Thread Quanlong Huang
+1 - Built and tested C++/Java on Ubuntu 16.04 with OpenJDK 1.8.0_275 - Verified checksum and signature Thanks, Quanlong On Mon, Aug 15, 2022 at 9:43 PM Yiqun Zhang wrote: > +1 > > - Built and tested C++/Java on macOS 12.5 and openjdk 11.0.16 > - Shasum and gpg sig checked > > Thanks, > Yiqun

Re: Feature Freeze for Apache ORC 1.8?

2022-05-04 Thread Quanlong Huang
gt; Lastly, if you have any new feature, you can still backport it. Since > the Apache ORC PMC won't block you, don't worry about that. :) > > Dongjoon. > > > On Tue, May 3, 2022 at 1:34 AM Quanlong Huang > wrote: > > > +1 for the feature freeze plan. > > &

Re: Feature Freeze for Apache ORC 1.8?

2022-05-03 Thread Quanlong Huang
+1 for the feature freeze plan. However, 4 months seems like a pretty long testing time. Do we have any plans for the tests? Impala currently uses a snapshot version of ORC. But it can only cover the C++ client part. I'm not sure if there are any other projects using snapshot versions of ORC. So t

[jira] [Created] (ORC-1160) [C++] RowReaderImpl::seekToRow() can't seek within a selected row group

2022-04-27 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1160: --- Summary: [C++] RowReaderImpl::seekToRow() can't seek within a selected row group Key: ORC-1160 URL: https://issues.apache.org/jira/browse/ORC-1160 Project

[jira] [Created] (ORC-1159) [C++] Crash when the last stripe is skipped

2022-04-26 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1159: --- Summary: [C++] Crash when the last stripe is skipped Key: ORC-1159 URL: https://issues.apache.org/jira/browse/ORC-1159 Project: ORC Issue Type: Bug

Re: [VOTE] Release Apache ORC 1.6.14 (RC0)

2022-04-12 Thread Quanlong Huang
+1 - Built and tested C++/Java on Ubuntu 16.04 with OpenJDK 1.8.0_275. - Verified checksum and signature Thanks, Quanlong On Mon, Apr 11, 2022 at 12:13 PM Dongjoon Hyun wrote: > > I'll start with my +1. > > All test results were tracked here during preparing RC0. > - https://github.com/apach

Re: [VOTE] Release Apache ORC 1.7.4 (RC0)

2022-04-12 Thread Quanlong Huang
+1 - Built and tested C++/Java on Ubuntu 16.04 with OpenJDK 1.8.0_275. - Verified checksum and signature. Thanks, Quanlong On Tue, Apr 12, 2022 at 2:41 PM Yiqun Zhang wrote: > > +1 > > - Built and tested C++/Java on macOS 12.3.1 and openjdk 11.0.12 > - Shasum and gpg sig checked > > Best, > Yiq

[jira] [Created] (ORC-1150) Improve RowReaderImpl::computeBatchSize()

2022-04-12 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1150: --- Summary: Improve RowReaderImpl::computeBatchSize() Key: ORC-1150 URL: https://issues.apache.org/jira/browse/ORC-1150 Project: ORC Issue Type: Improvement

[jira] [Created] (ORC-1144) [C++] Provide options to trim duplicated values for reader

2022-03-28 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1144: --- Summary: [C++] Provide options to trim duplicated values for reader Key: ORC-1144 URL: https://issues.apache.org/jira/browse/ORC-1144 Project: ORC Issue Type

[jira] [Created] (ORC-1143) [C++] Support reading the PRESENT stream without reading the column data

2022-03-28 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1143: --- Summary: [C++] Support reading the PRESENT stream without reading the column data Key: ORC-1143 URL: https://issues.apache.org/jira/browse/ORC-1143 Project: ORC

[jira] [Created] (ORC-1142) [C++] Unroll loops in BooleanRleDecoderImpl::next()

2022-03-28 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1142: --- Summary: [C++] Unroll loops in BooleanRleDecoderImpl::next() Key: ORC-1142 URL: https://issues.apache.org/jira/browse/ORC-1142 Project: ORC Issue Type

[jira] [Created] (ORC-1137) [C++] Improve float/double conversion in DoubleColumnReader::next()

2022-03-27 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1137: --- Summary: [C++] Improve float/double conversion in DoubleColumnReader::next() Key: ORC-1137 URL: https://issues.apache.org/jira/browse/ORC-1137 Project: ORC

[jira] [Created] (ORC-1134) [C++] Compile error when switching branches

2022-03-22 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1134: --- Summary: [C++] Compile error when switching branches Key: ORC-1134 URL: https://issues.apache.org/jira/browse/ORC-1134 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-1132) [C++] EncodedStringVectorBatch allocates used buffers

2022-03-20 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1132: --- Summary: [C++] EncodedStringVectorBatch allocates used buffers Key: ORC-1132 URL: https://issues.apache.org/jira/browse/ORC-1132 Project: ORC Issue Type

[jira] [Created] (ORC-1131) [C++] getMemoryUsage() is incorrect on string vector batches

2022-03-20 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1131: --- Summary: [C++] getMemoryUsage() is incorrect on string vector batches Key: ORC-1131 URL: https://issues.apache.org/jira/browse/ORC-1131 Project: ORC Issue

[jira] [Created] (ORC-1129) The build of tool-test should depends on cpp tools

2022-03-15 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1129: --- Summary: The build of tool-test should depends on cpp tools Key: ORC-1129 URL: https://issues.apache.org/jira/browse/ORC-1129 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-1127) [C++] Add missing version of UNSTABLE-PRE-2.0

2022-03-15 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1127: --- Summary: [C++] Add missing version of UNSTABLE-PRE-2.0 Key: ORC-1127 URL: https://issues.apache.org/jira/browse/ORC-1127 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-1125) [C++] Support decoding decimals in RLE

2022-03-06 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1125: --- Summary: [C++] Support decoding decimals in RLE Key: ORC-1125 URL: https://issues.apache.org/jira/browse/ORC-1125 Project: ORC Issue Type: Sub-task

[jira] [Created] (ORC-1124) Add C++ benchmark

2022-03-06 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1124: --- Summary: Add C++ benchmark Key: ORC-1124 URL: https://issues.apache.org/jira/browse/ORC-1124 Project: ORC Issue Type: New Feature Components: C

[jira] [Created] (ORC-1122) Add buffer to decode the whole run in RleDecoderV2

2022-03-04 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1122: --- Summary: Add buffer to decode the whole run in RleDecoderV2 Key: ORC-1122 URL: https://issues.apache.org/jira/browse/ORC-1122 Project: ORC Issue Type

Re: [VOTE] Release Apache ORC 1.6.13 (RC0)

2022-01-20 Thread Quanlong Huang
+1 - C++ build and test on Ubuntu 16.04 - Java build and test on Ubuntu 16.04 with JDK8 - Checksum verified Thanks, Quanlong On 2022/01/20 18:26:42 Dongjoon Hyun wrote: > Thank you, Yiqun, William and Panagiotis. :) > > On Wed, Jan 19, 2022 at 9:03 AM Panos Garefalakis > wrote: > > > +1 (bind

[jira] [Created] (ORC-1098) [C++] Support specifying column names and type ids in tools

2022-01-18 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1098: --- Summary: [C++] Support specifying column names and type ids in tools Key: ORC-1098 URL: https://issues.apache.org/jira/browse/ORC-1098 Project: ORC Issue

[jira] [Created] (ORC-1087) Seek overflow in an uncompressed chunk

2022-01-10 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1087: --- Summary: Seek overflow in an uncompressed chunk Key: ORC-1087 URL: https://issues.apache.org/jira/browse/ORC-1087 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-1081) heap-use-after-free in orc::SearchArgumentBuilderImpl::end()

2022-01-07 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1081: --- Summary: heap-use-after-free in orc::SearchArgumentBuilderImpl::end() Key: ORC-1081 URL: https://issues.apache.org/jira/browse/ORC-1081 Project: ORC Issue

[jira] [Created] (ORC-1025) Add exhaustive test on bloom filter hashing between Java and C++ clients

2021-10-10 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1025: --- Summary: Add exhaustive test on bloom filter hashing between Java and C++ clients Key: ORC-1025 URL: https://issues.apache.org/jira/browse/ORC-1025 Project: ORC

[jira] [Created] (ORC-1024) BloomFilter hash computation is inconsistent between Java and C++ clients

2021-10-10 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1024: --- Summary: BloomFilter hash computation is inconsistent between Java and C++ clients Key: ORC-1024 URL: https://issues.apache.org/jira/browse/ORC-1024 Project: ORC

[jira] [Created] (ORC-1023) Support writing bloom filters in ConvertTool

2021-10-09 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1023: --- Summary: Support writing bloom filters in ConvertTool Key: ORC-1023 URL: https://issues.apache.org/jira/browse/ORC-1023 Project: ORC Issue Type: New Feature

[jira] [Created] (ORC-1021) Add -fno-omit-frame-pointer in RELWITHDEBINFO build

2021-10-07 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1021: --- Summary: Add -fno-omit-frame-pointer in RELWITHDEBINFO build Key: ORC-1021 URL: https://issues.apache.org/jira/browse/ORC-1021 Project: ORC Issue Type

[jira] [Created] (ORC-1020) Improve orc::RleDecoderV2::nextDirect

2021-10-07 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1020: --- Summary: Improve orc::RleDecoderV2::nextDirect Key: ORC-1020 URL: https://issues.apache.org/jira/browse/ORC-1020 Project: ORC Issue Type: Improvement

[jira] [Created] (ORC-1012) Support specifying columns in the orc-scan tool

2021-09-29 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1012: --- Summary: Support specifying columns in the orc-scan tool Key: ORC-1012 URL: https://issues.apache.org/jira/browse/ORC-1012 Project: ORC Issue Type

[jira] [Created] (ORC-999) [C++] Add PredicateDataType in PredicateLeaf::toString() and Literal::toString()

2021-09-15 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-999: -- Summary: [C++] Add PredicateDataType in PredicateLeaf::toString() and Literal::toString() Key: ORC-999 URL: https://issues.apache.org/jira/browse/ORC-999 Project: ORC

[jira] [Created] (ORC-973) [C++] Provide more interfaces for creating IN predicate

2021-08-26 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-973: -- Summary: [C++] Provide more interfaces for creating IN predicate Key: ORC-973 URL: https://issues.apache.org/jira/browse/ORC-973 Project: ORC Issue Type: New

[jira] [Created] (ORC-972) [C++] Evaluate SearchArguments using dictionaries

2021-08-25 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-972: -- Summary: [C++] Evaluate SearchArguments using dictionaries Key: ORC-972 URL: https://issues.apache.org/jira/browse/ORC-972 Project: ORC Issue Type: Improvement

[jira] [Created] (ORC-971) LESS_THAN_EQUALS doesn't handle the case when min=max

2021-08-25 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-971: -- Summary: LESS_THAN_EQUALS doesn't handle the case when min=max Key: ORC-971 URL: https://issues.apache.org/jira/browse/ORC-971 Project: ORC Issue Type

[jira] [Created] (ORC-969) Evaluate SearchArguments using file and stripe level stats

2021-08-25 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-969: -- Summary: Evaluate SearchArguments using file and stripe level stats Key: ORC-969 URL: https://issues.apache.org/jira/browse/ORC-969 Project: ORC Issue Type

[jira] [Created] (ORC-968) Column names used to build SearchArgument should be full path names

2021-08-24 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-968: -- Summary: Column names used to build SearchArgument should be full path names Key: ORC-968 URL: https://issues.apache.org/jira/browse/ORC-968 Project: ORC Issue

[jira] [Created] (ORC-961) Expose metrics of the reader

2021-08-22 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-961: -- Summary: Expose metrics of the reader Key: ORC-961 URL: https://issues.apache.org/jira/browse/ORC-961 Project: ORC Issue Type: New Feature Components

[jira] [Created] (ORC-960) Create SearchArgument using column ids

2021-08-22 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-960: -- Summary: Create SearchArgument using column ids Key: ORC-960 URL: https://issues.apache.org/jira/browse/ORC-960 Project: ORC Issue Type: New Feature

[jira] [Created] (ORC-959) C++ reader crash in resolving nested List columns for SearchArgument

2021-08-22 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-959: -- Summary: C++ reader crash in resolving nested List columns for SearchArgument Key: ORC-959 URL: https://issues.apache.org/jira/browse/ORC-959 Project: ORC

[jira] [Created] (ORC-592) Add test coverage for corrupt protobuf objects

2020-01-19 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-592: -- Summary: Add test coverage for corrupt protobuf objects Key: ORC-592 URL: https://issues.apache.org/jira/browse/ORC-592 Project: ORC Issue Type: Test

[jira] [Created] (ORC-591) orc::readFully crash due to null pointer variable

2020-01-19 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-591: -- Summary: orc::readFully crash due to null pointer variable Key: ORC-591 URL: https://issues.apache.org/jira/browse/ORC-591 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-590) Crash in orc::RleDecoderV2::readByte

2020-01-15 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-590: -- Summary: Crash in orc::RleDecoderV2::readByte Key: ORC-590 URL: https://issues.apache.org/jira/browse/ORC-590 Project: ORC Issue Type: Bug Components

[jira] [Created] (ORC-581) C++ library could crash in orc::TypeImpl::addStructField

2019-12-14 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-581: -- Summary: C++ library could crash in orc::TypeImpl::addStructField Key: ORC-581 URL: https://issues.apache.org/jira/browse/ORC-581 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-580) Crash in StripeStreamsImpl::getEncoding

2019-12-13 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-580: -- Summary: Crash in StripeStreamsImpl::getEncoding Key: ORC-580 URL: https://issues.apache.org/jira/browse/ORC-580 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-452) Support converting MAP column from JSON to ORC

2018-12-25 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-452: -- Summary: Support converting MAP column from JSON to ORC Key: ORC-452 URL: https://issues.apache.org/jira/browse/ORC-452 Project: ORC Issue Type: New Feature

[jira] [Created] (ORC-450) [C++] Support selecting list indices without materializing list items

2018-12-21 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-450: -- Summary: [C++] Support selecting list indices without materializing list items Key: ORC-450 URL: https://issues.apache.org/jira/browse/ORC-450 Project: ORC

[jira] [Created] (ORC-414) [C++] ORC files with malformed protobuf objects can crash a release build

2018-10-09 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-414: -- Summary: [C++] ORC files with malformed protobuf objects can crash a release build Key: ORC-414 URL: https://issues.apache.org/jira/browse/ORC-414 Project: ORC

[jira] [Created] (ORC-403) Should check PostScript length before serializing it

2018-09-15 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-403: -- Summary: Should check PostScript length before serializing it Key: ORC-403 URL: https://issues.apache.org/jira/browse/ORC-403 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-400) Support Hadoop3 in C++ lib

2018-09-11 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-400: -- Summary: Support Hadoop3 in C++ lib Key: ORC-400 URL: https://issues.apache.org/jira/browse/ORC-400 Project: ORC Issue Type: New Feature Reporter

[jira] [Created] (ORC-322) c++ writer should not adjust gmtOffset when writing timestamps

2018-03-14 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-322: -- Summary: c++ writer should not adjust gmtOffset when writing timestamps Key: ORC-322 URL: https://issues.apache.org/jira/browse/ORC-322 Project: ORC Issue Type

[jira] [Created] (ORC-320) C++ writer tests failed when Daylight Saving Time started

2018-03-11 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-320: -- Summary: C++ writer tests failed when Daylight Saving Time started Key: ORC-320 URL: https://issues.apache.org/jira/browse/ORC-320 Project: ORC Issue Type

[jira] [Created] (ORC-319) C++ reader crash after Lz4DecompressionStream thrown a ParseError

2018-03-08 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-319: -- Summary: C++ reader crash after Lz4DecompressionStream thrown a ParseError Key: ORC-319 URL: https://issues.apache.org/jira/browse/ORC-319 Project: ORC Issue

[jira] [Created] (ORC-317) C++ reader crash on reading corrupt decimal types

2018-03-06 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-317: -- Summary: C++ reader crash on reading corrupt decimal types Key: ORC-317 URL: https://issues.apache.org/jira/browse/ORC-317 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-315) More robust on reading random corrupt files

2018-03-05 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-315: -- Summary: More robust on reading random corrupt files Key: ORC-315 URL: https://issues.apache.org/jira/browse/ORC-315 Project: ORC Issue Type: Task

[jira] [Created] (ORC-314) C++ Reader crash for out of range scale in parsing decimals

2018-03-04 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-314: -- Summary: C++ Reader crash for out of range scale in parsing decimals Key: ORC-314 URL: https://issues.apache.org/jira/browse/ORC-314 Project: ORC Issue Type

[jira] [Created] (ORC-313) Missing checks on SubType count of LIST and MAP types

2018-03-03 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-313: -- Summary: Missing checks on SubType count of LIST and MAP types Key: ORC-313 URL: https://issues.apache.org/jira/browse/ORC-313 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-312) C++ Reader crash for corrupt memory

2018-03-02 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-312: -- Summary: C++ Reader crash for corrupt memory Key: ORC-312 URL: https://issues.apache.org/jira/browse/ORC-312 Project: ORC Issue Type: Bug Components

[jira] [Created] (ORC-311) C++ Reader crash for corrupt StripeFooter

2018-03-02 Thread Quanlong Huang (JIRA)
Quanlong Huang created ORC-311: -- Summary: C++ Reader crash for corrupt StripeFooter Key: ORC-311 URL: https://issues.apache.org/jira/browse/ORC-311 Project: ORC Issue Type: Bug