[GitHub] [parquet-mr] ggershinsky commented on pull request #1016: PARQUET-2223: Parquet Data Masking Enhancement for Column Encryption

2023-01-16 Thread GitBox
ggershinsky commented on PR #1016: URL: https://github.com/apache/parquet-mr/pull/1016#issuecomment-1384054816 I found the doc. Could you provide me with a "comment" access, so we'll discuss the goals and design there? Thanks. -- This is an automated message from the Apache Git Service.

[GitHub] [parquet-mr] ggershinsky commented on pull request #1016: PARQUET-2223: Parquet Data Masking Enhancement for Column Encryption

2023-01-15 Thread GitBox
ggershinsky commented on PR #1016: URL: https://github.com/apache/parquet-mr/pull/1016#issuecomment-1383552737 As far as I understand, _data masking_ replaces content of sensitive columns; it does not remove the columns (schema and content). The latter is done by _column pruning_ - when