Taewoo Kim has posted comments on this change. Change subject: ASTERIXDB-1892: Sets a proper hash table cardinality during hash-group by ......................................................................
Patch Set 3: (2 comments) Thanks Yingyi. Here is my reply. https://asterix-gerrit.ics.uci.edu/#/c/1702/3/hyracks-fullstack/hyracks/hyracks-dataflow-std/src/main/java/org/apache/hyracks/dataflow/std/structures/SerializableHashTable.java File hyracks-fullstack/hyracks/hyracks-dataflow-std/src/main/java/org/apache/hyracks/dataflow/std/structures/SerializableHashTable.java: PS3, Line 250: private > This seems an expensive operation, you need to iterate over all header byte Proportional to the hash table cardinality if all header frames are used. I think this might be better than spilling a data partition to the disk. Any idea that I can improve this? PS3, Line 258: > Why j=j+2? getInt() is called and an integer is 4 bytes? Each pair contains (content frame #, offset in that frame#). So, it's OK to check the first part only. -- To view, visit https://asterix-gerrit.ics.uci.edu/1702 To unsubscribe, visit https://asterix-gerrit.ics.uci.edu/settings Gerrit-MessageType: comment Gerrit-Change-Id: I651139b2b559ad4d2f6137a5c844814606516a90 Gerrit-PatchSet: 3 Gerrit-Project: asterixdb Gerrit-Branch: master Gerrit-Owner: Taewoo Kim <wangs...@gmail.com> Gerrit-Reviewer: Jenkins <jenk...@fulliautomatix.ics.uci.edu> Gerrit-Reviewer: Taewoo Kim <wangs...@gmail.com> Gerrit-Reviewer: Yingyi Bu <buyin...@gmail.com> Gerrit-HasComments: Yes