[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-30 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15032791#comment-15032791 ] ASF GitHub Bot commented on DRILL-4119: --- Github user asfgit closed the pull request

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-30 Thread Mehant Baid (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15032229#comment-15032229 ] Mehant Baid commented on DRILL-4119: I think it makes sense to address that as a separ

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-30 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15032062#comment-15032062 ] Aman Sinha commented on DRILL-4119: --- [~mehant] would it make sense to open a separate JI

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-24 Thread Zelaine Fong (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15025071#comment-15025071 ] Zelaine Fong commented on DRILL-4119: - Per discussion at today's Drill hangout, Jacque

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-24 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15025061#comment-15025061 ] Aman Sinha commented on DRILL-4119: --- Sure, if you want to try out the original version g

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-24 Thread Mehant Baid (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15024951#comment-15024951 ] Mehant Baid commented on DRILL-4119: If we are returning different values from the ori

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-24 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15024913#comment-15024913 ] Aman Sinha commented on DRILL-4119: --- Our hash64 implementation looks similar to the orig

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-24 Thread Zelaine Fong (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15024866#comment-15024866 ] Zelaine Fong commented on DRILL-4119: - [~amansinha100] - when you say "the original XX

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-24 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15024802#comment-15024802 ] Aman Sinha commented on DRILL-4119: --- I did some more testing with the sample data. Here

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-23 Thread Parth Chandra (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15022868#comment-15022868 ] Parth Chandra commented on DRILL-4119: -- The XXHash C implementation has a XXH32 funct

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15022713#comment-15022713 ] ASF GitHub Bot commented on DRILL-4119: --- Github user mehant commented on a diff in t

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-23 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15022700#comment-15022700 ] Aman Sinha commented on DRILL-4119: --- As discussed in a prior comment, I have created DRI

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-23 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15022404#comment-15022404 ] Aman Sinha commented on DRILL-4119: --- Submitted a PR. [~mehant] could you pls review ?

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-23 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15022389#comment-15022389 ] ASF GitHub Bot commented on DRILL-4119: --- GitHub user amansinha100 opened a pull requ

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-21 Thread Jacques Nadeau (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15020672#comment-15020672 ] Jacques Nadeau commented on DRILL-4119: --- Sounds good. > Skew in hash distribution f

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-21 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15020665#comment-15020665 ] Aman Sinha commented on DRILL-4119: --- Yes, it would be useful to have a suite for the has

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-21 Thread Jacques Nadeau (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15020638#comment-15020638 ] Jacques Nadeau commented on DRILL-4119: --- Interesting finding. As we've been stung by

[jira] [Commented] (DRILL-4119) Skew in hash distribution for varchar (and possibly other) types of data

2015-11-21 Thread Aman Sinha (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4119?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15020627#comment-15020627 ] Aman Sinha commented on DRILL-4119: --- The problem comes from the cast to integer after co