Re: [PR] [SPARK-48495][SQL][DOCS] Describe shredding scheme for Variant [spark]

via GitHub Wed, 10 Jul 2024 06:27:41 -0700


cashmand commented on PR #46831:
URL: https://github.com/apache/spark/pull/46831#issuecomment-2220511842


   Hi @shaeqahmed, I think the ability to read with older engines is not a 
goal, and I don't think it's worth adding extra complexity to the scheme to 
allow it. I can update the document to make that clear.
   
   I also don't think there's much benefit to collapsing of the levels in the 
schema. It adds extra complexity to parse and handle that case, and ensure that 
the meaning of a name can't be ambiguous. I don't think it really saves much, 
if anything - the number of column chunks won't change, and in the current 
scheme, marking the intermediate groups as required instead of optional should 
result in the same column chunk size as if they had been collapsed.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Re: [PR] [SPARK-48495][SQL][DOCS] Describe shredding scheme for Variant [spark]

Reply via email to