BsoBird commented on code in PR #5934:
URL: https://github.com/apache/hive/pull/5934#discussion_r2191868462


##########
ql/src/java/org/apache/hadoop/hive/ql/exec/tez/SplitGrouper.java:
##########
@@ -71,8 +70,8 @@ public class SplitGrouper {
 
   // TODO This needs to be looked at. Map of Map to Map... Made concurrent for 
now since split generation
   // can happen in parallel.
-  private static final Map<Map<Path, PartitionDesc>, Map<Path, PartitionDesc>> 
cache =
-      new ConcurrentHashMap<>();
+  private final Map<Map<Path, PartitionDesc>, Map<Path, PartitionDesc>> cache =

Review Comment:
   > !!! Currently, this cache isn't useful for Iceberg tables because 
SplitGrouper doesn't receive any partition information. The dir path always 
points to the table directory, and the same applies to the keys in 
pathToPartitionInfo
   
   @deniskuzZ 
   You mean, even if we keep PartitionDesc, we shouldn't encounter a deadlock?
   That's very strange. We did experience a deadlock when reading Iceberg 
tables before. We never encountered this issue when reading regular ORC tables.
   However, at that time we were using HIVE 3.1.3 to read ICEBERG tables. Has 
this issue disappeared in HIVE4?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org
For additional commands, e-mail: gitbox-h...@hive.apache.org

Reply via email to