What does "deprecated" entail here? Do we plan to remove this field from the format? Otherwise, is it just documentation?
On Mon, 1 Dec 2025 12:09:18 -0800 Micah Kornfield <[email protected]> wrote: > This has come up a few times in the sync and other forums. I wanted to > start the conversation about deprecating file_path > <https://github.com/apache/parquet-format/blob/3ab52ff2e4e1cbe4c52a3e25c0512803e860c454/src/main/thrift/parquet.thrift#L962> > [1] in the parquet footer. > > Outside of the "_metadata" file index use-case I don't think this is used > or implemented in any reader (effectively a poor man's table format). > > With the rise of file formats, it seems like a reasonable design choice to > push complexity of referencing columns across files to the table level and > keep parquet focused on single file storage (encodings, indexing, etc). > > Implementing this at a file level also can be challenging in the context of > knowing all credentials one might need to read from different objects on > object storage? > > Thoughts/Objections? > > Thanks, > Micah > > > [1] > https://github.com/apache/parquet-format/blob/3ab52ff2e4e1cbe4c52a3e25c0512803e860c454/src/main/thrift/parquet.thrift#L962 >
