nfarah86 commented on code in PR #7967: URL: https://github.com/apache/hudi/pull/7967#discussion_r1108055435
########## website/docs/metadata_indexing.md: ########## @@ -64,8 +74,23 @@ spark-submit \ From version 0.11.0 onwards, Hudi metadata table is enabled by default and the files index will be automatically created. While the deltastreamer is running in continuous mode, let us schedule the indexing for COLUMN_STATS index. First we need to define a properties file for the indexer. +### Configurations + +As mentioned before, metadata indexes are pluggable. One can add any index at any point in time depending on changing +business requirements. Some configurations to enable particular indexes are listed below. Full set of metadata Review Comment: The full set of metadata ########## website/docs/metadata_indexing.md: ########## @@ -64,8 +74,23 @@ spark-submit \ From version 0.11.0 onwards, Hudi metadata table is enabled by default and the files index will be automatically created. While the deltastreamer is running in continuous mode, let us schedule the indexing for COLUMN_STATS index. First we need to define a properties file for the indexer. +### Configurations + +As mentioned before, metadata indexes are pluggable. One can add any index at any point in time depending on changing +business requirements. Some configurations to enable particular indexes are listed below. Full set of metadata +configurations can be explored [here](/docs/configurations/#Metadata-Configs). + + +|Config| Default | Description | Scope | Since Version | +|---|---|---|---|---| +| hoodie.metadata.enable | true | Metadata table | Set to false to disable metadata table | 0.7.0 | +| hoodie.metadata.index.async | false | Metadata table | Enable async indexing of metadata table. | 0.11.0 | +| hoodie.metadata.index.column.stats.enable | false | Metadata table | Enable indexing column ranges of user data files under metadata table key lookups | 0.11.0 | +| hoodie.metadata.index.bloom.filter.enable | false | Metadata table | Enable indexing bloom filters of user data files under metadata table | 0.11.0 | + :::note -Enabling metadata table and configuring a lock provider are the prerequisites for using async indexer. +Enabling metadata table and configuring a lock provider are the prerequisites for using async indexer. Checkout a sample Review Comment: Enabling the metadata -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org