GitHub user jaltekruse opened a pull request: https://github.com/apache/drill/pull/266
DRILL-4056: Avro corruption bug with UTF-8 strings You can merge this pull request into a Git repository by running: $ git pull https://github.com/jaltekruse/incubator-drill 4056-avro-corruption-bug Alternatively you can review and apply these changes as the patch at: https://github.com/apache/drill/pull/266.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #266 ---- commit a3e0cbe3820a0350d58c59f374877a12184850e0 Author: Jason Altekruse <altekruseja...@gmail.com> Date: 2015-11-13T23:46:58Z DRILL-4056: Fix corruption bug reading string data out of Avro commit 44460fd5a72d6a61b232c335bb8beaaff9daad87 Author: Jason Altekruse <altekruseja...@gmail.com> Date: 2015-11-14T00:26:33Z DRILL-4056: Part 2 - Cleanup in Avro reader. Removed use of unnecessary Holder objects. Added restriction on batch size produced by a single call to next. Did not get a chance to confirm but it looks like it was reading an entire file into a single batch, which could have serious performance impacts on very large files. commit dc084c1255a59aead865e641f952e9e162d4c5e5 Author: Jason Altekruse <altekruseja...@gmail.com> Date: 2015-11-17T23:42:44Z DRILL-4056: Part 3 - Adding results verification to avro tests. Task to be finished as part of DRILL-4110. ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---