[ https://issues.apache.org/jira/browse/DRILL-5758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178075#comment-16178075 ]
ASF GitHub Bot commented on DRILL-5758: --------------------------------------- Github user paul-rogers commented on the issue: https://github.com/apache/drill/pull/932 Added a fix for repeated columns that have a low cardinality. If, say, one row in ten has an array entry, then average cardinality (values per row) is 0.1. This was represented by an int, rounded to 0 and caused a zero-length vector to be allocated. Drill then tried to double the length, which resulted in 0, which was doubled again, and so on forever. The fix has three parts: * The "record batch sizer" uses floats to allow fractional cardinality. * The vector initializer now works with fractional cardinality. * If all else fails, if a fixed-width vector is asked to double from zero, it sizes the vector to 256 bytes. > Rollup of external sort fixes to issues found by QA > --------------------------------------------------- > > Key: DRILL-5758 > URL: https://issues.apache.org/jira/browse/DRILL-5758 > Project: Apache Drill > Issue Type: Task > Affects Versions: 1.12.0 > Reporter: Paul Rogers > Assignee: Paul Rogers > Fix For: 1.12.0 > > > Tracking JIRA to used for the PR that combines fixes for various JIRA > entries. Bugs fixed in this task are given by the linked issues. -- This message was sent by Atlassian JIRA (v6.4.14#64029)