Thanks for sharing Julien! I also wanted to point out another online viewer from Xiangpeng Hao that lets you explore all the metadata (and data) as well[1]
Andrew [1]: https://parquet-viewer.xiangpeng.systems/ On Thu, Oct 30, 2025 at 2:38 PM Arnav Balyan <[email protected]> wrote: > Thanks a lot for sharing! This will be really helpful for the FSST testing! > > On Thu, Oct 30, 2025 at 11:41 PM Curt Hagenlocher <[email protected]> > wrote: > > > Thanks, this is great! It's something I'd long mused about creating but > > never got to the point of setting pen to paper. > > > > On Thu, Oct 30, 2025 at 11:02 AM Julien Le Dem <[email protected]> > wrote: > > > > > I mentioned in the sync that I did a little Parquet file metadata > > > visualizer. > > > I wanted to have something to help people who are benchmarking Parquet > > and > > > want to take a look at what the pages look like and verify that the > > choices > > > on page size or encoding are not creating degraded performance. > > > This is made with claude code and doubles as my personal > experimentation > > > with AI coding. > > > Note that I don't really know typescript or front end development, so > > code > > > quality may vary. > > > Requirements: > > > - run entirely in the browser. (It is using hyparquet!) > > > - display the page metadata by iterating through the page headers for > a > > > given column chunk. > > > - it is trying to be careful of memory usage, I have loaded 2GB+ > Parquet > > > files with many columns in it. > > > https://github.com/julienledem/parquet-metadata-vizualizer > > > I have published it here, if you want to play with it: > > > https://julien.ledem.net/experiment/parquet-visualizer.html > > > There is no backend. It is all running in your browser. > > > Here is an example with the NY taxi data > > > [image: Screenshot 2025-10-30 at 10.58.53 AM.png] > > > > > >
