Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/18865 From the viewpoints of the end users of Spark, `dfFromFile.select($"_corrupt_record").show()` might not return all the expected records. ``_corrupt_record`` should return all the records that Spark SQL fail to parse. If we are unable to output the expected results, we need to disallow users to do it. cc @cloud-fan
--- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org