gatorsmile opened a new pull request #30463:
URL: https://github.com/apache/spark/pull/30463


   ### What changes were proposed in this pull request?
   
   This reverts commit 065f17386d1851d732b4c1badf1ce2e14d0de338, which is not 
part of any released version. That is, this is an unreleased feature
   
   ### Why are the changes needed?
   
   I like the concept of Trash, but I think this PR might just resolve a very 
specific issue by introducing a mechanism without a proper design doc. This 
could make the usage more complex.
   
   I think we need to consider the big picture. Trash directory is an important 
concept. If we decide to introduce it, we should consider all the code paths of 
Spark SQL that could delete the data, instead of Truncate only. We also need to 
consider what is the current behavior if the underlying file system does not 
provide the API `Trash.moveToAppropriateTrash`. Is the exception good? How 
about the performance when users are using the object store instead of HDFS? 
Will it impact the GDPR compliance? 
   
   In sum, I think we should not merge the PR 
https://github.com/apache/spark/pull/29552 without the design doc and 
implementation plan. That is why I reverted it before the code freeze of Spark 
3.1
   
   ### Does this PR introduce _any_ user-facing change?
   Reverted the original commit
   
   
   ### How was this patch tested?
   The existing tests.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to