[ https://issues.apache.org/jira/browse/HADOOP-19385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ahmar Suhail updated HADOOP-19385: ---------------------------------- Parent Issue: HADOOP-19559 (was: HADOOP-19363) > S3A: add a file-format-parsing module for testing format parsing > ---------------------------------------------------------------- > > Key: HADOOP-19385 > URL: https://issues.apache.org/jira/browse/HADOOP-19385 > Project: Hadoop Common > Issue Type: Sub-task > Components: fs/azure, fs/s3 > Affects Versions: 3.4.2 > Reporter: Steve Loughran > Priority: Major > Labels: pull-request-available > > Create a cloud-storage/format-parsing module declaring various file formats > as dependencies (parquet, iceberg, orc) purely for integration/regression > testing store support for them. > h2. Parquet > for parquet reading we'd want > * parquet lib > * samples of well formed files > * samples of malformed files. > Test runs would upload the files then open then. > h2. Iceberg > Verify bulk delete through iceberg FileIO api. > *Update: Iceberg needs java17* > It can't be merged until hadoop trunk goes there. parquet stuff we can put in > earlier and backport > does let me set up the module though -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org