Hello:
    Now we cann't add a shuffle-operation in a sql-job.
Sometimes , for example, I have a kafka-source(three partitions) with 
parallelism three. And then I have a lookup-join function, I want process the 
data distribute by id so that the data can split into thre parallelism evenly 
(The source maybe slant seriously).
In DataStream API i can do it with keyby(), but it's so sad that i can do 
nothing when i use a sql;
Maybe we can do it like 'select id, f1,f2 from sourceTable distribute by id' 
like we do it in SparkSql.

Sot that we can make change on the picture  in sql-mode;

    



lpengdr...@163.com

Reply via email to