Aman Sinha created DRILL-4667: --------------------------------- Summary: Improve memory footprint of broadcast joins Key: DRILL-4667 URL: https://issues.apache.org/jira/browse/DRILL-4667 Project: Apache Drill Issue Type: Improvement Components: Execution - Relational Operators Affects Versions: 1.6.0 Reporter: Aman Sinha
For broadcast joins, currently Drill optimizes the data transfer across the network for broadcast table by sending a single copy to the receiving node which then distributes it to all minor fragments running on that particular node. However, each minor fragment builds its own hash table (for a hash join) using this broadcast table. We can substantially improve the memory footprint by having a shared copy of the hash table among multiple minor fragments on a node. -- This message was sent by Atlassian JIRA (v6.3.4#6332)