MaxGekk opened a new pull request, #38454:
URL: https://github.com/apache/spark/pull/38454

   ### What changes were proposed in this pull request?
   In the PR, I propose to migrate `failAnalysis()` errors without a context 
onto temporary error classes with the prefix `_LEGACY_ERROR_TEMP_24xx`. The 
error message will not include the error classes, so, in this way we will 
preserve the existing behaviour.
   
   ### Why are the changes needed?
   The migration on temporary error classes allows to gather statistics about 
errors and detect most popular error classes. After that we could prioritise 
the work on migration.
   
   The new error class name prefix `_LEGACY_ERROR_TEMP_` proposed here kind of 
marks the error as developer-facing, not user-facing. Developers can still get 
the error class programmatically via the `SparkThrowable` interface, so that 
they can build error infra with it. End users won't see the error class in the 
message. This allows us to do the error migration very quickly, and we can 
refine the error classes and mark them as user-facing later (naming them 
properly, adding tests, etc.).
   
   ### Does this PR introduce _any_ user-facing change?
   No. The error messages should be almost the same by default.
   
   ### How was this patch tested?
   By running the affected test suites:
   ```
   $ PYSPARK_PYTHON=python3 build/sbt "sql/testOnly 
org.apache.spark.sql.SQLQueryTestSuite"
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to