Hey Guys,
I am wondering what is the best way to get logs for driver in the cluster
mode on standalone cluster? Normally I used to run client mode so I could
capture logs from the console.
Now I've started running jobs in cluster mode and obviously driver is
running on worker and can't see the
Hi Magnus,
Yes, I was thinking also about partitioning approach. And I think this is
the best solution in this type of scenario.
Also my scenario is relevant to your last paragraph, the dates which are
coming are very random. I can get updated from 2012 and from 2019.
Therefore, this strategy mi