I've ran a simple WordCount example with a very small List<String> as input
lines and ran it in standalone (local[*]), and Datasets is very slow..
We're talking ~700 msec for RDDs while Datasets takes ~3.5 sec.
Is this just start-up overhead ? please note that I'm not timing the
context creation...

And in general, is there a way to run with local[*] "lightweight" mode for
testing ? something like without the WebUI server for example (and anything
else that's not needed for testing purposes)

Thanks,
Amit

Reply via email to