Ádám Szita created HIVE-25948:
---------------------------------
Summary: Optimize Iceberg writes by directing records either
Clustered- or Fanoutwriter
Key: HIVE-25948
URL: https://issues.apache.org/jira/browse/HIVE-25948
Project: Hive
Issue Type: Improvement
Reporter: Ádám Szita
Assignee: Ádám Szita
Currently Hive writes Iceberg tables with ClusteredWriter. This has less memory
footprint as it only keeps one writer open at a time, but requires the records
to be sorted.
However if data cardinality is low Fanoutwriter is a better choice for
performance.
We should add support so that either can be used, and the decision could be
based similarly how currently SortedDynPartitonOptimizer has it.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)