Hello,

I have been writing some code to read Parquet files and it would be useful if 
there was an easy way to get the number of bytes in a string column as well as 
the null indices of that column. I would have expected this to be available in 
metadata somewhere, but I have not seen any way to query that from the API and 
don’t see anything like this using `parquet-tools` to inspect the files.

Is there any way to get the null indices of a Parquet string column besides 
reading the whole file and manually checking for nulls?

Is there any way to get the byte lengths of string columns without reading each 
string and summing the number of bytes of each string?

Thank you.

Best,
Ben McDonald

Reply via email to