Re: WholeStageCodeGen + DSv2

2021-05-19 Thread Andrew Melo
As it turns out, I also commented on the same Jira (and forgot about it until just now). On Wed, May 19, 2021 at 8:32 AM Shubham Chaurasia wrote: > > Hi, > > I remember creating one for a similar scenario in the past - > https://issues.apache.org/jira/browse/SPARK-29372. > > Thanks, > Shubham >

Re: WholeStageCodeGen + DSv2

2021-05-19 Thread Shubham Chaurasia
Hi, I remember creating one for a similar scenario in the past - https://issues.apache.org/jira/browse/SPARK-29372. Thanks, Shubham On Wed, May 19, 2021 at 5:34 PM Takeshi Yamamuro wrote: > hi, Andrew, > > Welcome any improvement proposal for that. > Could you file an issue in jira first to

Re: WholeStageCodeGen + DSv2

2021-05-19 Thread Takeshi Yamamuro
hi, Andrew, Welcome any improvement proposal for that. Could you file an issue in jira first to show us your idea and an example query to reproduce the issue you described? Bests, Takeshi On Wed, May 19, 2021 at 11:38 AM Andrew Melo wrote: > Hello, > > When reading a very wide (> 1000 cols)

WholeStageCodeGen + DSv2

2021-05-18 Thread Andrew Melo
Hello, When reading a very wide (> 1000 cols) input, WholeStageCodeGen blows past the 64kB source limit and fails. Looking at the generated code, a big part of the code is simply the DSv2 convention that the codegen'd variable names are the same as the columns instead of something more compact