Csaba Ringhofer has posted comments on this change. ( http://gerrit.cloudera.org:8080/23335 )
Change subject: IMPALA-13957: Enable Hive/Yarn with Ozone ...................................................................... Patch Set 1: (2 comments) http://gerrit.cloudera.org:8080/#/c/23335/1//COMMIT_MSG Commit Message: http://gerrit.cloudera.org:8080/#/c/23335/1//COMMIT_MSG@9 PS1, Line 9: run data : load with Ozone > I've only been using it to manually run load-data.py on specific tables. So just a brain dump: it may be a good direction to separate HBase (and potentially Kudu) from other parts of the dataload, which are simply a bunch of files/directories + HMS rows HBase is also stored as just a bunch of files, but AFAIK SNAPSHOT_FILE doesn't contain /hbase, just /test-warehouse - /hbase was not needed so far, because all filesystems that need to load from snapshot also don't handle HBase. I don't know if it worth to change this - even if HBase could be run on Ozone, is it useful to testthe Impala + HBase + Ozone combination in Impala tests? There are some folders for HBase tables, but they don't contain the actual data and are deleted after loading the snapshot: https://github.com/apache/impala/blob/50926b5d8e941c5cc10fd77d0b4556e3441c41e7/testdata/bin/load-test-warehouse-snapshot.sh#L149 Note that the write side of snapshot creation is not open source. http://gerrit.cloudera.org:8080/#/c/23335/1/testdata/bin/run-all.sh File testdata/bin/run-all.sh: http://gerrit.cloudera.org:8080/#/c/23335/1/testdata/bin/run-all.sh@79 PS1, Line 79: elif [[ ${DEFAULT_FS} == "ofs://${INTERNAL_LISTEN_HOST}:9862" ]]; then optional: I think that it would be clearer if there was an env var like RUN_HBASE and it would be initialized in a different block than Hive -- To view, visit http://gerrit.cloudera.org:8080/23335 To unsubscribe, visit http://gerrit.cloudera.org:8080/settings Gerrit-Project: Impala-ASF Gerrit-Branch: master Gerrit-MessageType: comment Gerrit-Change-Id: I64bd0a8dc71f189becf410731285a9af147e25d2 Gerrit-Change-Number: 23335 Gerrit-PatchSet: 1 Gerrit-Owner: Michael Smith <michael.sm...@cloudera.com> Gerrit-Reviewer: Csaba Ringhofer <csringho...@cloudera.com> Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com> Gerrit-Reviewer: Jason Fehr <jf...@cloudera.com> Gerrit-Reviewer: Michael Smith <michael.sm...@cloudera.com> Gerrit-Reviewer: Riza Suminto <riza.sumi...@cloudera.com> Gerrit-Reviewer: Yida Wu <wydbaggio...@gmail.com> Gerrit-Comment-Date: Tue, 26 Aug 2025 05:15:03 +0000 Gerrit-HasComments: Yes