Joe McDonnell created IMPALA-12906:
--------------------------------------

             Summary: Incorporate run time scan range information into the 
tuple cache key
                 Key: IMPALA-12906
                 URL: https://issues.apache.org/jira/browse/IMPALA-12906
             Project: IMPALA
          Issue Type: Task
          Components: Backend, Frontend
    Affects Versions: Impala 4.4.0
            Reporter: Joe McDonnell


The cache key for tuple caching currently doesn't incorporate information about 
the scan ranges for the tables that it scans. This is important for detecting 
changes in the table and having different cache keys for different fragment 
instances that are assigned different scan ranges.

To make this deterministic for mt_dop, we need mt_dop to assign scan ranges 
deterministically to individual fragment instances rather than using the shared 
queue introduced inĀ IMPALA-9655.

One way to implement this is to collect information about the scan nodes that 
feed into the tuple cache and pass that information over to the tuple cache 
node. At runtime, it can hash the scan ranges assigned to those scan nodes and 
incorporate that into the cache key.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to