[GitHub] spark issue #19456: [SPARK] [Scheduler] Configurable default scheduling mode

2018-05-30 Thread varuvish
Github user varuvish commented on the issue: https://github.com/apache/spark/pull/19456 Can you guarantee that your users share an even workload? With this setup you could potentially have a user with a high workload using 1/#users resources. Wouldn't using the queue feature

[GitHub] spark issue #21333: [SPARK-23778][CORE] Avoid unneeded shuffle when union ge...

2018-05-30 Thread varuvish
Github user varuvish commented on the issue: https://github.com/apache/spark/pull/21333 Nice change! I tested this out as well and verified that the shuffle doesn't happen. I did notice that this change wasn't reflected in the dataset API. Is that something that should be addressed