[jira] [Created] (ORC-1144) [C++] Provide options to trim duplicated values for reader

2022-03-28 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1144: --- Summary: [C++] Provide options to trim duplicated values for reader Key: ORC-1144 URL: https://issues.apache.org/jira/browse/ORC-1144 Project: ORC Issue Type:

[GitHub] [orc] guiyanakuang commented on a change in pull request #1072: ORC-1138

2022-03-28 Thread GitBox
guiyanakuang commented on a change in pull request #1072: URL: https://github.com/apache/orc/pull/1072#discussion_r837030697 ## File path: java/core/src/java/org/apache/orc/impl/BufferChunk.java ## @@ -89,7 +89,10 @@ public boolean equals(Object other) { return false;

[jira] [Created] (ORC-1143) [C++] Support reading the PRESENT stream without reading the column data

2022-03-28 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1143: --- Summary: [C++] Support reading the PRESENT stream without reading the column data Key: ORC-1143 URL: https://issues.apache.org/jira/browse/ORC-1143 Project: ORC

[jira] [Created] (ORC-1142) [C++] Unroll loops in BooleanRleDecoderImpl::next()

2022-03-28 Thread Quanlong Huang (Jira)
Quanlong Huang created ORC-1142: --- Summary: [C++] Unroll loops in BooleanRleDecoderImpl::next() Key: ORC-1142 URL: https://issues.apache.org/jira/browse/ORC-1142 Project: ORC Issue Type: Improve

[GitHub] [orc] rbalamohan commented on a change in pull request #1072: ORC-1138

2022-03-28 Thread GitBox
rbalamohan commented on a change in pull request #1072: URL: https://github.com/apache/orc/pull/1072#discussion_r836922547 ## File path: java/core/src/java/org/apache/orc/OrcConf.java ## @@ -194,6 +194,18 @@ ORC_MAX_DISK_RANGE_CHUNK_LIMIT("orc.max.disk.range.chunk.limit",

[GitHub] [orc] ArchangeGabriel commented on issue #1068: Tests failure on 1.7.3

2022-03-28 Thread GitBox
ArchangeGabriel commented on issue #1068: URL: https://github.com/apache/orc/issues/1068#issuecomment-1081210199 We built 1.7.1 when it was released. At this time the only test failure was `TestDecompression.testLzoLong`. But rebuilding it again today has the same failure as reported above

[GitHub] [orc] dongjoon-hyun commented on issue #1068: Tests failure on 1.7.3

2022-03-28 Thread GitBox
dongjoon-hyun commented on issue #1068: URL: https://github.com/apache/orc/issues/1068#issuecomment-1081193746 According to the context, the root cause is unclear to me either. > The error is unrelated and new, it did not occur in 1.7.1. > Same failures on 1.7.1 Back to

[GitHub] [orc] ArchangeGabriel commented on issue #1068: Tests failure on 1.7.3

2022-03-28 Thread GitBox
ArchangeGabriel commented on issue #1068: URL: https://github.com/apache/orc/issues/1068#issuecomment-1081120463 Same failures on 1.7.1 (+TestDecompression.testLzoLong that was fixed since). So there has been a dependency related change. Since they are not a lot of dependencies involved (I

[GitHub] [orc] dongjoon-hyun commented on issue #1068: Tests failure on 1.7.3

2022-03-28 Thread GitBox
dongjoon-hyun commented on issue #1068: URL: https://github.com/apache/orc/issues/1068#issuecomment-1081112689 No problem. We are looking forward to seeing your test result. Let me know when you have a new result. -- This is an automated message from the Apache Git Service. To respond to

[GitHub] [orc] ArchangeGabriel commented on issue #1068: Tests failure on 1.7.3

2022-03-28 Thread GitBox
ArchangeGabriel commented on issue #1068: URL: https://github.com/apache/orc/issues/1068#issuecomment-1081107265 Sorry if there has been a misunderstanding about how #1065 in involved here, I meant that we backport this patch on top of current release else tests fails, but [we already patc

[GitHub] [orc] dongjoon-hyun merged pull request #1070: ORC-1141: Bump mockito.version from 4.3.1 to 4.4.0

2022-03-28 Thread GitBox
dongjoon-hyun merged pull request #1070: URL: https://github.com/apache/orc/pull/1070 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr

[jira] [Created] (ORC-1141) Bump mockito.version from 4.3.1 to 4.4.0

2022-03-28 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created ORC-1141: -- Summary: Bump mockito.version from 4.3.1 to 4.4.0 Key: ORC-1141 URL: https://issues.apache.org/jira/browse/ORC-1141 Project: ORC Issue Type: Test Compo

[GitHub] [orc] dongjoon-hyun commented on pull request #1072: ORC-1138

2022-03-28 Thread GitBox
dongjoon-hyun commented on pull request #1072: URL: https://github.com/apache/orc/pull/1072#issuecomment-1080996069 cc @pgaref and @guiyanakuang , too -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [orc] dongjoon-hyun commented on a change in pull request #1072: ORC-1138

2022-03-28 Thread GitBox
dongjoon-hyun commented on a change in pull request #1072: URL: https://github.com/apache/orc/pull/1072#discussion_r836724337 ## File path: java/core/src/java/org/apache/orc/impl/BufferChunk.java ## @@ -89,7 +89,10 @@ public boolean equals(Object other) { return false;

[GitHub] [orc] dongjoon-hyun commented on a change in pull request #1072: ORC-1138

2022-03-28 Thread GitBox
dongjoon-hyun commented on a change in pull request #1072: URL: https://github.com/apache/orc/pull/1072#discussion_r836720325 ## File path: java/core/src/java/org/apache/orc/OrcConf.java ## @@ -194,6 +194,18 @@ ORC_MAX_DISK_RANGE_CHUNK_LIMIT("orc.max.disk.range.chunk.limit",

[GitHub] [orc] pavibhai opened a new pull request #1072: ORC-1138

2022-03-28 Thread GitBox
pavibhai opened a new pull request #1072: URL: https://github.com/apache/orc/pull/1072 Optimizes the read of streams in ORC by combining multiple nearby reads a single read, optionally allowing the retention or drop of the extra bytes. * minSeekSize: If separation between multiple re

[GitHub] [orc] dongjoon-hyun commented on issue #1068: Tests failure on 1.7.3

2022-03-28 Thread GitBox
dongjoon-hyun commented on issue #1068: URL: https://github.com/apache/orc/issues/1068#issuecomment-1080856636 Thank you for reporting, @ArchangeGabriel . We don't have `Arch Linux` in our CI so far. cc @wzhou-code , @wgtmac , @stiga-huang because this sounds like a regression.

[jira] [Created] (ORC-1139) Benchmark for Seek vs Read

2022-03-28 Thread Pavan Lanka (Jira)
Pavan Lanka created ORC-1139: Summary: Benchmark for Seek vs Read Key: ORC-1139 URL: https://issues.apache.org/jira/browse/ORC-1139 Project: ORC Issue Type: Sub-task Components: Java

[jira] [Created] (ORC-1140) Documentation for Seek vs Read

2022-03-28 Thread Pavan Lanka (Jira)
Pavan Lanka created ORC-1140: Summary: Documentation for Seek vs Read Key: ORC-1140 URL: https://issues.apache.org/jira/browse/ORC-1140 Project: ORC Issue Type: Sub-task Components: Jav

[jira] [Created] (ORC-1138) Seek vs Read Optimization

2022-03-28 Thread Pavan Lanka (Jira)
Pavan Lanka created ORC-1138: Summary: Seek vs Read Optimization Key: ORC-1138 URL: https://issues.apache.org/jira/browse/ORC-1138 Project: ORC Issue Type: Sub-task Components: Java