Hi all, I’m looking at the execution process of several operations and have a question may be naive and hope that someone can help me. For the operations like Ordey by, why do we use an extra MR job to sample the data? But in java version implementation, we can always use on MR job to implement the operation.
Thank you for your time!! Best, Ruoyu
