[ https://issues.apache.org/jira/browse/SPARK-43039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17710381#comment-17710381 ]
Snoot.io commented on SPARK-43039: ---------------------------------- User 'ryan-johnson-databricks' has created a pull request for this issue: https://github.com/apache/spark/pull/40677 > Support custom fields in the file source _metadata column > --------------------------------------------------------- > > Key: SPARK-43039 > URL: https://issues.apache.org/jira/browse/SPARK-43039 > Project: Spark > Issue Type: New Feature > Components: Spark Core > Affects Versions: 3.4.0 > Reporter: Ryan Johnson > Priority: Major > > Today, the schema of the file source _metadata column depends on the file > format (e.g. parquet file format supports {{{}_metadata.row_index{}}}) but > this is hard-wired into the {{FileFormat}} itself. Not only is this an ugly > design, it also prevents custom file formats from adding their own fields to > the {{_metadata}} column. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org