Github user juliuszsompolski commented on the issue:

    https://github.com/apache/spark/pull/21403
  
    I think that the way the columns are defined in the subquery should define 
the semantics.
    E.g.:
    `(a, b) IN (select c, d from ...)` - unpack (a, b) and treat it as a multi 
column comparison as in current semantics.
    `(a, b) IN (select (c, d) from ..)` - keep it packed and treat it as a 
single column IN.
    `(a, b, c) IN (select (d, e), f from ..)` or similar combinations - catch 
it in analysis as ambiguous
    `(a, b, c) IN (select (d, e), f, g from ..)` - but this is valid as long as 
`a` matches the type of `(d, e)`


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to