I've ran a simple WordCount example with a very small List<String> as input lines and ran it in standalone (local[*]), and Datasets is very slow.. We're talking ~700 msec for RDDs while Datasets takes ~3.5 sec. Is this just start-up overhead ? please note that I'm not timing the context creation...
And in general, is there a way to run with local[*] "lightweight" mode for testing ? something like without the WebUI server for example (and anything else that's not needed for testing purposes) Thanks, Amit