Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/23335 )

Change subject: IMPALA-13957: Enable Hive/Yarn with Ozone
......................................................................


Patch Set 1:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/23335/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/23335/1//COMMIT_MSG@9
PS1, Line 9: run data
           : load with Ozone
> I've only been using it to manually run load-data.py on specific tables. So
just a brain dump:
it may be a good direction to separate HBase (and potentially Kudu) from other 
parts of the dataload, which are simply a bunch of files/directories + HMS rows

HBase is also stored as just a bunch of files, but AFAIK SNAPSHOT_FILE doesn't 
contain /hbase, just /test-warehouse - /hbase was not needed so far, because 
all filesystems that need to load from snapshot also don't handle HBase. I 
don't know if it worth to change this - even if HBase could be run on Ozone, is 
it useful to testthe Impala + HBase + Ozone combination in Impala tests?

There are some folders for HBase tables, but they don't contain the actual data 
and are deleted after loading the snapshot: 
https://github.com/apache/impala/blob/50926b5d8e941c5cc10fd77d0b4556e3441c41e7/testdata/bin/load-test-warehouse-snapshot.sh#L149

Note that the write side of snapshot creation is not open source.


http://gerrit.cloudera.org:8080/#/c/23335/1/testdata/bin/run-all.sh
File testdata/bin/run-all.sh:

http://gerrit.cloudera.org:8080/#/c/23335/1/testdata/bin/run-all.sh@79
PS1, Line 79: elif [[ ${DEFAULT_FS} == "ofs://${INTERNAL_LISTEN_HOST}:9862" ]]; 
then
optional: I think that it would be clearer if there was an env var like 
RUN_HBASE and it would be initialized in a different block than Hive



--
To view, visit http://gerrit.cloudera.org:8080/23335
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I64bd0a8dc71f189becf410731285a9af147e25d2
Gerrit-Change-Number: 23335
Gerrit-PatchSet: 1
Gerrit-Owner: Michael Smith <michael.sm...@cloudera.com>
Gerrit-Reviewer: Csaba Ringhofer <csringho...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Jason Fehr <jf...@cloudera.com>
Gerrit-Reviewer: Michael Smith <michael.sm...@cloudera.com>
Gerrit-Reviewer: Riza Suminto <riza.sumi...@cloudera.com>
Gerrit-Reviewer: Yida Wu <wydbaggio...@gmail.com>
Gerrit-Comment-Date: Tue, 26 Aug 2025 05:15:03 +0000
Gerrit-HasComments: Yes

Reply via email to