Acero performs poorly, and coredump occurs frequently??

In the scenario I'm working on, I'll read one Parquet file and then several 
other Parquet files. These files will have the same column name (UUID). I need 
to join (by UUID), project (remove UUID), and filter (some custom filtering) 
the results of the two reads. I found that Acero could only be used to do join, 
but when I tested it, Acero performance was very poor and very unstable, 
coredump often happened. Is there another way? Or just another way to do a join!







1057445597
[email protected]



 

Reply via email to