[jira] [Created] (ORC-838) Implement Improvements to ByteBufferAllocatorPool

2021-07-07 Thread David Mollitor (Jira)
David Mollitor created ORC-838: -- Summary: Implement Improvements to ByteBufferAllocatorPool Key: ORC-838 URL: https://issues.apache.org/jira/browse/ORC-838 Project: ORC Issue Type: Improvement

[GitHub] [orc] belugabehr commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
belugabehr commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665807525 ## File path: java/core/src/java/org/apache/orc/impl/Dictionary.java ## @@ -44,9 +44,22 @@ /** * Given the position index, return the original strin

[GitHub] [orc] belugabehr commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
belugabehr commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665806375 ## File path: java/core/src/java/org/apache/orc/impl/DictionaryUtils.java ## @@ -42,4 +44,29 @@ public static void getTextInternal(Text result, int position

[GitHub] [orc] belugabehr commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
belugabehr commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665805203 ## File path: java/core/src/java/org/apache/orc/impl/DynamicByteArray.java ## @@ -222,10 +222,10 @@ public void setText(Text result, int offset, int length)

[GitHub] [orc] belugabehr commented on pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
belugabehr commented on pull request #736: URL: https://github.com/apache/orc/pull/736#issuecomment-876025178 > Sorry for adding last-minute comments after approval. No worries at all. Thanks for being engaged and helping me get these PRs reviewed. You are very generous with your ti

[GitHub] [orc] belugabehr commented on a change in pull request #739: Implement Improvements to ByteBufferAllocatorPool

2021-07-07 Thread GitBox
belugabehr commented on a change in pull request #739: URL: https://github.com/apache/orc/pull/739#discussion_r665788810 ## File path: java/core/src/java/org/apache/orc/impl/RecordReaderUtils.java ## @@ -504,38 +504,33 @@ static void readDiskRanges(FSDataInputStream file,

[GitHub] [orc] kbendick commented on a change in pull request #739: Implement Improvements to ByteBufferAllocatorPool

2021-07-07 Thread GitBox
kbendick commented on a change in pull request #739: URL: https://github.com/apache/orc/pull/739#discussion_r665751393 ## File path: java/core/src/java/org/apache/orc/impl/RecordReaderUtils.java ## @@ -504,38 +504,33 @@ static void readDiskRanges(FSDataInputStream file, pr

[GitHub] [orc] dongjoon-hyun commented on pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
dongjoon-hyun commented on pull request #736: URL: https://github.com/apache/orc/pull/736#issuecomment-875954235 Sorry for adding last-minute comments after approval. Please consider the above comments. -- This is an automated message from the Apache Git Service. To respond to the messag

[GitHub] [orc] dongjoon-hyun commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
dongjoon-hyun commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665721041 ## File path: java/core/src/java/org/apache/orc/impl/DictionaryUtils.java ## @@ -42,4 +44,29 @@ public static void getTextInternal(Text result, int posit

[GitHub] [orc] dongjoon-hyun commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
dongjoon-hyun commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665720389 ## File path: java/core/src/java/org/apache/orc/impl/DynamicByteArray.java ## @@ -222,10 +222,10 @@ public void setText(Text result, int offset, int lengt

[GitHub] [orc] dongjoon-hyun commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
dongjoon-hyun commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665715334 ## File path: java/core/src/java/org/apache/orc/impl/Dictionary.java ## @@ -44,9 +44,22 @@ /** * Given the position index, return the original st

[GitHub] [orc] dongjoon-hyun merged pull request #738: ORC-833: Calculate nextVector Batch Size Once

2021-07-07 Thread GitBox
dongjoon-hyun merged pull request #738: URL: https://github.com/apache/orc/pull/738 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr..

[GitHub] [orc] dongjoon-hyun commented on pull request #738: ORC-833: Calculate nextVector Batch Size Once

2021-07-07 Thread GitBox
dongjoon-hyun commented on pull request #738: URL: https://github.com/apache/orc/pull/738#issuecomment-875940248 Merged to main. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comm

[GitHub] [orc] belugabehr opened a new pull request #740: ORC-836: StringGroupFromDoubleTreeReader Use Long toString

2021-07-07 Thread GitBox
belugabehr opened a new pull request #740: URL: https://github.com/apache/orc/pull/740 ### What changes were proposed in this pull request? When converting from Double to String, use the Double class methods directly. Lower overhead of encoding by using ASCII. ### Why are the

[GitHub] [orc] belugabehr opened a new pull request #739: Implement Improvements to ByteBufferAllocatorPool

2021-07-07 Thread GitBox
belugabehr opened a new pull request #739: URL: https://github.com/apache/orc/pull/739 TODO: CREATE JIRA ### What changes were proposed in this pull request? Simplify ByteBufferAllocatorPool Key class. Cache hash value for Key as it's computed more than once: putBuffer to insert i

[jira] [Created] (ORC-837) Reuse HiveDecimalWritable in ConvertTreeReaderFactory

2021-07-07 Thread David Mollitor (Jira)
David Mollitor created ORC-837: -- Summary: Reuse HiveDecimalWritable in ConvertTreeReaderFactory Key: ORC-837 URL: https://issues.apache.org/jira/browse/ORC-837 Project: ORC Issue Type: Improveme

[jira] [Created] (ORC-836) StringGroupFromDoubleTreeReader Use Long.toString

2021-07-07 Thread David Mollitor (Jira)
David Mollitor created ORC-836: -- Summary: StringGroupFromDoubleTreeReader Use Long.toString Key: ORC-836 URL: https://issues.apache.org/jira/browse/ORC-836 Project: ORC Issue Type: Improvement

[jira] [Created] (ORC-835) Cache TRUE/FALSE Bytes in StringGroupFromBooleanTreeReader

2021-07-07 Thread David Mollitor (Jira)
David Mollitor created ORC-835: -- Summary: Cache TRUE/FALSE Bytes in StringGroupFromBooleanTreeReader Key: ORC-835 URL: https://issues.apache.org/jira/browse/ORC-835 Project: ORC Issue Type: Impr

[jira] [Created] (ORC-834) Do Not Convert to String in DecimalFromTimestampTreeReader

2021-07-07 Thread David Mollitor (Jira)
David Mollitor created ORC-834: -- Summary: Do Not Convert to String in DecimalFromTimestampTreeReader Key: ORC-834 URL: https://issues.apache.org/jira/browse/ORC-834 Project: ORC Issue Type: Impr

[GitHub] [orc] belugabehr opened a new pull request #738: ORC-833: Calculate nextVector Batch Size Once

2021-07-07 Thread GitBox
belugabehr opened a new pull request #738: URL: https://github.com/apache/orc/pull/738 ### What changes were proposed in this pull request? The number of iterations of the loops in RunLengthIntegerReaderV2 nextVector look at two distinct, but known, values. Instead of check each valu

[jira] [Created] (ORC-833) RunLengthIntegerReaderV2 Calculate Batch Size One

2021-07-07 Thread David Mollitor (Jira)
David Mollitor created ORC-833: -- Summary: RunLengthIntegerReaderV2 Calculate Batch Size One Key: ORC-833 URL: https://issues.apache.org/jira/browse/ORC-833 Project: ORC Issue Type: Improvement

[GitHub] [orc] pgaref merged pull request #734: ORC-829: Optimize Serialization percentileBits

2021-07-07 Thread GitBox
pgaref merged pull request #734: URL: https://github.com/apache/orc/pull/734 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@orc.a

[GitHub] [orc] pgaref commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
pgaref commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665440165 ## File path: java/core/src/java/org/apache/orc/impl/Dictionary.java ## @@ -47,6 +47,15 @@ */ void getText(Text result, int position); + /** + * Give

[GitHub] [orc] belugabehr commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
belugabehr commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665417540 ## File path: java/core/src/java/org/apache/orc/impl/Dictionary.java ## @@ -47,6 +47,15 @@ */ Review comment: Donr. -- This is an automated

[GitHub] [orc] belugabehr commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
belugabehr commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665397854 ## File path: java/core/src/java/org/apache/orc/impl/Dictionary.java ## @@ -47,6 +47,15 @@ */ void getText(Text result, int position); + /** + *

[GitHub] [orc] belugabehr commented on pull request #734: ORC-829: Optimize Serialization percentileBits

2021-07-07 Thread GitBox
belugabehr commented on pull request #734: URL: https://github.com/apache/orc/pull/734#issuecomment-875591895 > Hey @belugabehr changes LGTM -- just wondering if we can/should incorporate the Perf tests you did when discovered this? @pgaref Hey, thanks for taking a look. Nothi

[jira] [Created] (ORC-832) [C++] Support intermediate file footers

2021-07-07 Thread David Justen (Jira)
David Justen created ORC-832: Summary: [C++] Support intermediate file footers Key: ORC-832 URL: https://issues.apache.org/jira/browse/ORC-832 Project: ORC Issue Type: New Feature Comp

[GitHub] [orc] pgaref commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
pgaref commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665341392 ## File path: java/core/src/java/org/apache/orc/impl/Dictionary.java ## @@ -47,6 +47,15 @@ */ void getText(Text result, int position); + /** + * Give

[GitHub] [orc] pgaref commented on a change in pull request #736: ORC-831: Do Not Copy String When Flushing Dictionary

2021-07-07 Thread GitBox
pgaref commented on a change in pull request #736: URL: https://github.com/apache/orc/pull/736#discussion_r665340919 ## File path: java/core/src/java/org/apache/orc/impl/Dictionary.java ## @@ -47,6 +47,15 @@ */ Review comment: Can we please mention here that this me

[GitHub] [orc] pgaref commented on a change in pull request #734: ORC-829: Optimize Serialization percentileBits

2021-07-07 Thread GitBox
pgaref commented on a change in pull request #734: URL: https://github.com/apache/orc/pull/734#discussion_r665224823 ## File path: java/core/src/java/org/apache/orc/impl/SerializationUtils.java ## @@ -287,27 +293,24 @@ public long zigzagDecode(long val) { * @param p - perce