Some planning has started around this in PARQUET-1985 [1]. It seems it would be relatively easy in the short term for Rust and C++ to reuse archery for this purpose. I agree it is a good thing.
[1] https://issues.apache.org/jira/browse/PARQUET-1985 On Fri, Mar 5, 2021 at 12:42 PM Jorge Cardoso Leitão < jorgecarlei...@gmail.com> wrote: > Hi, > > To run integration with IPC, we have a set of `.arrow` and `.stream` files > that different implementations consume and can compare against "golden" > .json files that contain the corresponding in-memory representation. > > I wonder if we should not have equivalent json files corresponding to a > curated set of parquet files so that implementations can validate parity > with the C++ implementation with respect to how parquet should be converted > from and to arrow. > > Any opinions? > > Best, > Jorge >