Github user ifilonenko commented on a diff in the pull request:

    https://github.com/apache/spark/pull/22608#discussion_r225630500
  
    --- Diff: bin/docker-image-tool.sh ---
    @@ -71,18 +71,29 @@ function build {
         --build-arg
         base_img=$(image_ref spark)
       )
    -  local BASEDOCKERFILE=${BASEDOCKERFILE:-"$IMG_PATH/spark/Dockerfile"}
    -  local 
PYDOCKERFILE=${PYDOCKERFILE:-"$IMG_PATH/spark/bindings/python/Dockerfile"}
    -  local RDOCKERFILE=${RDOCKERFILE:-"$IMG_PATH/spark/bindings/R/Dockerfile"}
    +  local 
BASEDOCKERFILE=${BASEDOCKERFILE:-"$IMG_PATH/main/dockerfiles/spark/Dockerfile"}
    +  local 
PYDOCKERFILE=${PYDOCKERFILE:-"$IMG_PATH/main/dockerfiles/spark/bindings/python/Dockerfile"}
    +  local 
KDOCKERFILE=${KDOCKERFILE:-"$IMG_PATH/test/dockerfiles/spark/kerberos/Dockerfile"}
    +  local 
RDOCKERFILE=${RDOCKERFILE:-"$IMG_PATH/main/dockerfiles/spark/bindings/R/Dockerfile"}
     
    +  # Spark Base
       docker build $NOCACHEARG "${BUILD_ARGS[@]}" \
         -t $(image_ref spark) \
         -f "$BASEDOCKERFILE" .
     
    +  # PySpark
       docker build $NOCACHEARG "${BINDING_BUILD_ARGS[@]}" \
         -t $(image_ref spark-py) \
         -f "$PYDOCKERFILE" .
     
    +  # The following are optional docker builds for Kerberos Testing
    +  docker pull ifilonenko/hadoop-base:latest
    --- End diff --
    
    The strategy of using this “personal” image is purely because the HDFS 
nodes: (kdc, nn, dn) themselves require hadoop-2.7.3.tgz. Unless that can be 
packaged in the distribution, in which case it would be trivial to build, 
pulling is easier. I am open for opinions on building strategies. The 
dockerfile in question can be found here: 
https://github.com/ifilonenko/hadoop-kerberos-helm/blob/master/Dockerfile 


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to