bhat-vinay opened a new pull request, #10414: URL: https://github.com/apache/hudi/pull/10414
Depends on [HUDI-7243] A new TVF, `hudi_filesystem_view(...)` is added to support querying timeline through spark-sql. The information displayed is influenced by the 'fsview' command of hudi-cli A new relation, `FileSystemRelation`, is added to transparently support this functionality. The relation implements buildScan(...) method of TableScan trait. It does not support filter or predicate push-down. Column filtering and predicate evaluation needs to be done by the execution layer. This seems reasonable for the initial implementation for this tool which is mainly going to be used as a debugging/introspection tool. The relation defines a fixed schema required to display basic file information of a given hudi table ### Change Logs A new TVF, `hudi_filesystem_view(...)` is added to support querying timeline through spark-sql. The information displayed is influenced by the 'fsview' command of hudi-cli A new relation, `FileSystemRelation`, is added to transparently support this functionality. The relation implements buildScan(...) method of TableScan trait. It does not support filter or predicate push-down. Column filtering and predicate evaluation needs to be done by the execution layer. This seems reasonable for the initial implementation for this tool which is mainly going to be used as a debugging/introspection tool. The relation defines a fixed schema required to display basic file information of a given hudi table ### Impact New TVF function is added to introspect fileystem state for a given hudi table through spark-sql ### Risk level (write none, low medium or high below) Low ### Documentation Update TBD ### Contributor's checklist - [ ] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute) - [ ] Change Logs and Impact were stated clearly - [ ] Adequate tests were added if applicable - [ ] CI passed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@hudi.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org