Hi, Georg, This is being tracked by https://issues.apache.org/jira/browse/SPARK-32017 You can leave comments in the JIRA.
Thanks, Xiao On Sun, Aug 30, 2020 at 3:06 PM Georg Heiler <georg.kf.hei...@gmail.com> wrote: > Hi, > > I want to use pyspark as distributed via conda in headless mode. > It looks like the hadoop binaries are bundles (= pip distributes a default > version) > https://stackoverflow.com/questions/63661404/bootstrap-spark-itself-on-yarn > . > > I want to ask if it would be possible to A) distribute the headless > version (=without hadoop) instead or B) distribute the headless version > additionally for pip & conda-forge distribution channels. > > Best, > Georg > -- <https://databricks.com/sparkaisummit/north-america>