[GitHub] [iceberg] Zhangg7723 opened a new pull request, #6498: Flink:support lookup join function with sync and async mode

GitBox Tue, 27 Dec 2022 08:05:46 -0800


Zhangg7723 opened a new pull request, #6498:
URL: https://github.com/apache/iceberg/pull/6498


   This PR gives a support for Flink lookup join like:
   INSERT INTO sink_table SELECT s.id, d.name from (SELECT *, PROCTIME() AS 
proctime FROM source_table) AS s LEFT JOIN dim_table FOR SYSTEM_TIME AS OF 
s.proctime as d ON s.id = d.id. 
   It loads the right table records into cache synchronized or asynchronized 
when lookup keys are matched . the cache supports these properties:
   - lookup.cache.type
     value "all" loads all rows for initialization work, value "lru" manages 
rows cache in LRU mode.
   - lookup.cache.ttl
   - lookup.cache.max-rows
   
   According to my tests, lookup.cache.type=all has a good performance, and 
loading cache lazily has a terrible performance, maybe it will be better when 
secondary index is mature。
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [iceberg] Zhangg7723 opened a new pull request, #6498: Flink:support lookup join function with sync and async mode

Reply via email to