emkornfield commented on code in PR #1: URL: https://github.com/apache/parquet-benchmark/pull/1#discussion_r1735005498
########## README.md: ########## @@ -1 +1,28 @@ -# Apache Parquet Benchmarking +# Parquet benchmark data + +This repository contains Parquet benchmark data. Such data is useful to help +optimize Parquet implementations but also advance the Parquet format itself. + +At this point the community requests donation of Parquet footers and especially +footers that are large and slow to parse/process. Typically these are footers of +wide schemata: either coming from lots of individual columns and/or deeply nested +structs. + +To donate Parquet footers we have built a binary `parquet-dump-footer` as part Review Comment: could you add a note that this is now compressed? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
