koert kuipers created SPARK-27513:
-------------------------------------

             Summary: Spark tarball with binaries should have files owned by 
uid 0
                 Key: SPARK-27513
                 URL: https://issues.apache.org/jira/browse/SPARK-27513
             Project: Spark
          Issue Type: Bug
          Components: Build
    Affects Versions: 2.4.1
            Reporter: koert kuipers
             Fix For: 3.0.0


currently the tarball is created in dev/make-distribution.sh like this:

{code:bash}
tar czf "spark-$VERSION-bin-$NAME.tgz" -C "$SPARK_HOME" "$TARDIR_NAME"
{code}

the problem with this is that if root unpacks this tarball the files are owned 
by whatever the uid is of the person that created the tarball. this uid 
probably doesnt exist or belongs to a different unrelated user. this is 
unlikely to be what anyone wants.

for other users this problem doesnt exist since tar is now allowed to change 
uid. so when they unpack the tarball the files are owned by them.

it is more typical to set the uid and gid to 0 for a tarball. that way when 
root unpacks it the files are owned by root. so like this:

{code:bash}
tar czf "spark-$VERSION-bin-$NAME.tgz" --numeric-owner --owner=0 --group=0 -C 
"$SPARK_HOME" "$TARDIR_NAME
{code}





--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to