Re: [EXTERNAL] Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-14 Thread Shay Elbaz
: Artemis User Sent: Thursday, November 3, 2022 8:35 PM To: user@spark.apache.org Subject: [EXTERNAL] Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs ATTENTION: This email originated from outside of GM. Now I see what you want to do. If you have access to the cl

Re: [EXTERNAL] Re: Re: Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-06 Thread Shay Elbaz
November 6, 2022 4:19 PM To: Shay Elbaz Cc: Artemis User ; Tom Graves ; Tom Graves ; user@spark.apache.org Subject: [EXTERNAL] Re: Re: Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs ATTENTION: This email originated from outside of GM. May I ask why the ETL

Re: [EXTERNAL] Re: Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-06 Thread ayan guha
0 GPUs. > > I hope that was more clear. > Thank you very much for helping. > > Shay > > -- > *From:* Tom Graves > *Sent:* Friday, November 4, 2022 4:19 PM > *To:* Tom Graves ; Artemis User < > arte...@dtechspace.com>; user@spark.apache.org ; >

Re: [EXTERNAL] Re: Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-05 Thread Shay Elbaz
ves ; Artemis User ; user@spark.apache.org ; Shay Elbaz Subject: [EXTERNAL] Re: Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs ATTENTION: This email originated from outside of GM. So I'm not sure I completely follow. Are you asking for a way to change

Re: [EXTERNAL] Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-04 Thread Tom Graves
utes, if not much more. Any failure during this long time is pretty expensive. ShayFrom: Tom Graves Sent: Thursday, November 3, 2022 7:56 PM To: Artemis User ; user@spark.apache.org ; Shay Elbaz Subject: [EXTERNAL] Re: Re: Re: Stage level scheduling - lower the number of executors when using

Re: [EXTERNAL] Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-03 Thread Shay Elbaz
rsday, November 3, 2022 7:56 PM To: Artemis User ; user@spark.apache.org ; Shay Elbaz Subject: [EXTERNAL] Re: Re: Re: Stage level scheduling - lower the number of executors when using GPUs ATTENTION: This email originated from outside of GM. Stage level scheduling does not allow you to change co

Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-03 Thread Artemis User
2 1:16 AM *To:* user@spark.apache.org <mailto:user@spark.apache.org> <mailto:user@spark.apache.org> *Subject:* [EXTERNAL] Re: Stage level scheduling - lower the number of executors when using GPUs *ATTENTION:*This email originated from outside of GM. Are you using Rapids for GPU s

Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-03 Thread Tom Graves
he profile with 1 GPU per executor. So, the question is how do I limit the stage resources to 20 GPUs total?  Thanks again,Shay From: Artemis User Sent: Thursday, November 3, 2022 5:23 PM To: user@spark.apache.org Subject: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors

Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-03 Thread Sean Owen
;> Shay >> >> -- >> *From:* Artemis User >> *Sent:* Thursday, November 3, 2022 5:23 PM >> *To:* user@spark.apache.org >> *Subject:* [EXTERNAL] Re: Re: Stage level scheduling - lower the number >> of executors when using GPUs &g

Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-03 Thread bo yang
ces to 20 GPUs total? > > Thanks again, > Shay > > -- > *From:* Artemis User > *Sent:* Thursday, November 3, 2022 5:23 PM > *To:* user@spark.apache.org > *Subject:* [EXTERNAL] Re: Re: Stage level scheduling - lower the number > of executors when using GPUs &

Re: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-03 Thread Shay Elbaz
to 20 GPUs total? Thanks again, Shay From: Artemis User Sent: Thursday, November 3, 2022 5:23 PM To: user@spark.apache.org Subject: [EXTERNAL] Re: Re: Stage level scheduling - lower the number of executors when using GPUs ATTENTION: This email originated from out

Re: [EXTERNAL] Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-03 Thread Artemis User
*From:* Artemis User *Sent:* Thursday, November 3, 2022 1:16 AM *To:* user@spark.apache.org *Subject:* [EXTERNAL] Re: Stage level scheduling - lower the number of executors when using GPUs *ATTENTION:*This email originated from outside of GM. Are you using Rapids for GPU support in Spark? Cou

Re: [EXTERNAL] Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-02 Thread Shay Elbaz
ResourceProfile, directly (API) or indirectly (some advanced workaround). Thanks, Shay From: Artemis User Sent: Thursday, November 3, 2022 1:16 AM To: user@spark.apache.org Subject: [EXTERNAL] Re: Stage level scheduling - lower the number of executors when using GPUs

Re: Stage level scheduling - lower the number of executors when using GPUs

2022-11-02 Thread Artemis User
Are you using Rapids for GPU support in Spark?  Couple of options you may want to try: 1. In addition to dynamic allocation turned on, you may also need to turn on external shuffling service. 2. Sounds like you are using Kubernetes.  In that case, you may also need to turn on shuffle track

Stage level scheduling - lower the number of executors when using GPUs

2022-11-02 Thread Shay Elbaz
Hi, Our typical applications need less executors for a GPU stage than for a CPU stage. We are using dynamic allocation with stage level scheduling, and Spark tries to maximize the number of executors also during the GPU stage, causing a bit of resources chaos in the cluster. This forces us to u