GitHub user suryaprasanna added a comment to the discussion: Parquet Tool Interface for File-Level Operations in Clustering
The use cases where we use column pruning is in nullifying unused columns on historical data, to save storage space. But users can also leverage the interface to remove already drop columns from the data files. GitHub link: https://github.com/apache/hudi/discussions/17958#discussioncomment-15556776 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected]
