Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/18865
  
    Just to summarise my opinion:
    
    I think I am unclear if we should call, selecting `_corrupt_record` alone, 
a bug to disallow. If I understood correctly, I think we are basically saying:
    
    - Results from selecting `_corrupt_record` alone are useless and weird:
    
      Its output looks not completely useless and weird in a way as shown in 
https://github.com/apache/spark/pull/18865#issuecomment-326832181
    
    - Results from selecting `_corrupt_record` alone should produce like all 
columns are parsed but it this looks violating the designed behavior. So, we 
should rather disallow and inform the workaround in the exception.
    
      I think we are unclear on this as the flipside looks also working - it 
should work like no columns are parsed, apparently as designed. We should allow 
but with informing the workaround in logging or documentation.
    
    I am -0 for disallowing because I prefer a conservative way, and it does 
not look challenging or demanding, but both sides look having some arguments 
though.
    
    If I understood correctly, we don't say the current behavior with selecting 
`_corrupt_record` alone is incorrect as this might be as designed.
    
    If we don't think this behavior is as designed and it should be dependent 
on the selected json fields, and If we can't produce the expected results (not 
only selecting `_corrupt_record` alone but also few columns together), I think 
we should deprecate this option first with explaining this somewhere, regarding 
the opinion here.
    
    I have been thinking we disallow or change a behaviour when, for example, 
it is clearly a bug, does not make sense, or is not as designed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to