Thanks for sharing them! I added the two tools to 
https://github.com/severo/awesome-parquet (feel free to propose more resources!)




On Thursday, 30 October 2025 at 9:55 PM, Andrew Lamb <[email protected]> 
wrote:

> 
> 
> Thanks for sharing Julien!
> 
> I also wanted to point out another online viewer from Xiangpeng Hao that
> lets you explore all the metadata (and data) as well[1]
> 
> Andrew
> 
> [1]: https://parquet-viewer.xiangpeng.systems/
> 
> 
> 
> On Thu, Oct 30, 2025 at 2:38 PM Arnav Balyan [email protected] wrote:
> 
> > Thanks a lot for sharing! This will be really helpful for the FSST testing!
> > 
> > On Thu, Oct 30, 2025 at 11:41 PM Curt Hagenlocher [email protected]
> > wrote:
> > 
> > > Thanks, this is great! It's something I'd long mused about creating but
> > > never got to the point of setting pen to paper.
> > > 
> > > On Thu, Oct 30, 2025 at 11:02 AM Julien Le Dem [email protected]
> > > wrote:
> > > 
> > > > I mentioned in the sync that I did a little Parquet file metadata
> > > > visualizer.
> > > > I wanted to have something to help people who are benchmarking Parquet
> > > > and
> > > > want to take a look at what the pages look like and verify that the
> > > > choices
> > > > on page size or encoding are not creating degraded performance.
> > > > This is made with claude code and doubles as my personal
> > > > experimentation
> > > > with AI coding.
> > > > Note that I don't really know typescript or front end development, so
> > > > code
> > > > quality may vary.
> > > > Requirements:
> > > > - run entirely in the browser. (It is using hyparquet!)
> > > > - display the page metadata by iterating through the page headers for
> > > > a
> > > > given column chunk.
> > > > - it is trying to be careful of memory usage, I have loaded 2GB+
> > > > Parquet
> > > > files with many columns in it.
> > > > https://github.com/julienledem/parquet-metadata-vizualizer
> > > > I have published it here, if you want to play with it:
> > > > https://julien.ledem.net/experiment/parquet-visualizer.html
> > > > There is no backend. It is all running in your browser.
> > > > Here is an example with the NY taxi data
> > > > [image: Screenshot 2025-10-30 at 10.58.53 AM.png]

Reply via email to