[jira] [Updated] (SPARK-54553) Supports receiving podgroup JSON format configurations when using Volcano.

Wenjun Ruan (Jira) Wed, 03 Dec 2025 02:19:06 -0800


     [ 
https://issues.apache.org/jira/browse/SPARK-54553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Wenjun Ruan updated SPARK-54553:
--------------------------------
    Description: 
I submit Spark jobs to Kubernetes using Volcano as the scheduler.

According to the documentation:
[https://spark.apache.org/docs/latest/running-on-kubernetes.html#volcano-feature-step]

I found that we should create a {{podGroupTemplate file first, and then using 
it in spark job.}}

I hope we ca does not overwrite the PodGroup name when it is already defined in 
the {{{}podGroupTemplateFile{}}}.

 

I can submit a PR to make this behavior configurable or to preserve the 
PodGroup name if it is already provided in {{{}podGroupTemplateFile{}}}.

  was:
I submit Spark jobs to Kubernetes using Volcano as the scheduler.

According to the documentation:
[https://spark.apache.org/docs/latest/running-on-kubernetes.html#volcano-feature-step]

I found that the actual PodGroup name is not the one I specified in the 
{{{}podGroupTemplateFile{}}}. Instead, it is overwritten by 
{{{}VolcanoFeatureStep{}}}.

I hope Spark does not overwrite the PodGroup name when it is already defined in 
the {{{}podGroupTemplateFile{}}}.

Here is the relevant code:

```scala

override def getAdditionalPreKubernetesResources(): Seq[HasMetadata] = {
  if (kubernetesConf.isInstanceOf[KubernetesExecutorConf]) {
    logWarning(
      "VolcanoFeatureStep#getAdditionalPreKubernetesResources() is not 
supported for executor."
    )
    return Seq.empty
  }

  lazy val client = new DefaultVolcanoClient
  val template = kubernetesConf.getOption(POD_GROUP_TEMPLATE_FILE_KEY)
  val pg = template.map(client.podGroups.load(_).item).getOrElse(new PodGroup())

  var metadata = pg.getMetadata
  if (metadata == null) metadata = new ObjectMeta
  metadata.setName(podGroupName)
  metadata.setNamespace(namespace)
  pg.setMetadata(metadata)

  var spec = pg.getSpec
  if (spec == null) spec = new PodGroupSpec
  pg.setSpec(spec)

  Seq(pg)
}

```

I can submit a PR to make this behavior configurable or to preserve the 
PodGroup name if it is already provided in {{{}podGroupTemplateFile{}}}.


> Supports receiving podgroup JSON format configurations when using Volcano.
> --------------------------------------------------------------------------
>
>                 Key: SPARK-54553
>                 URL: https://issues.apache.org/jira/browse/SPARK-54553
>             Project: Spark
>          Issue Type: Improvement
>          Components: Kubernetes
>    Affects Versions: 4.0.1
>            Reporter: Wenjun Ruan
>            Priority: Major
>
> I submit Spark jobs to Kubernetes using Volcano as the scheduler.
> According to the documentation:
> [https://spark.apache.org/docs/latest/running-on-kubernetes.html#volcano-feature-step]
> I found that we should create a {{podGroupTemplate file first, and then using 
> it in spark job.}}
> I hope we ca does not overwrite the PodGroup name when it is already defined 
> in the {{{}podGroupTemplateFile{}}}.
>  
> I can submit a PR to make this behavior configurable or to preserve the 
> PodGroup name if it is already provided in {{{}podGroupTemplateFile{}}}.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Updated] (SPARK-54553) Supports receiving podgroup JSON format configurations when using Volcano.

Reply via email to