Michael Smith created IMPALA-13166:
--------------------------------------

             Summary: Queries with many nested views are still slow
                 Key: IMPALA-13166
                 URL: https://issues.apache.org/jira/browse/IMPALA-13166
             Project: IMPALA
          Issue Type: Improvement
          Components: Frontend
    Affects Versions: Impala 4.4.0
            Reporter: Michael Smith


Further profiling on the query from IMPALA-12800 - and a larger version of the 
same - show significant time spent in {{Analyzer#registerSlotRef}}. The bulk of 
the work here is in checking whether a {{SlotDescriptor}} exists in the current 
{{TupleDescriptor}}, both because {{Path#getFullyQualifiedRawPath}} is 
expensive and because equals on the resulting list is expensive (especially 
repeated over every {{SlotDescriptor}} in the {{TupleDescriptor}}.

There's room for improvement here:
1. avoid copying {{rawPath_}} every time we call {{getFullyQualifiedRawPath}}
2. use a map to identify whether a {{SlotDescriptor}} already exists at 
{{Path}} in the current {{TupleDescriptor}}
3 other small improvements may exist around memory allocation



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to