Hi team,
In a single Linux node, I would like to set up Rstudio with Sparkly. Three
to four people make up the dev team.
I am aware of the single-node spark cluster's constraints. When there is a
resource problem with Spark, I want to know when more users join in to use
Sparkly in Rstudio. It
Fyi .. apache spark version is 3.1.3
On Wed, Mar 15, 2023 at 4:34 PM karan alang wrote:
> Hi Mich, this doesn't seem to be working for me .. the watermark seems to
> be getting ignored !
>
> Here is the data put into Kafka :
>
> ```
>
>
>
All else equal it is better to have the same resources in fewer executors.
More tasks are local to other tasks which helps perf. There is more
possibility of 'borrowing' extra mem and CPU in a task.
On Thu, Mar 16, 2023, 2:14 PM Nikhil Goyal wrote:
> Hi folks,
> I am trying to understand what
Hi folks,
I am trying to understand what would be the difference in running 8G 1 core
executor vs 40G 5 core executors. I see that on yarn it can cause bin
fitting issues but other than that are there any pros and cons on using
either?
Thanks
Nikhil